aft-pytorch

Unofficial PyTorch implementation of Attention Free Transformer's layers by Zhai, et al. [abs, pdf] from Apple Inc.

Installation

You can install aft-pytorch via pip:

pip install aft-pytorch

Usage

You can import the AFT-Full or AFT-Simple layer (as described in the paper) from the package like so:

`AFTFull`

from aft_pytorch import AFTFull

layer = AFTFull(
    max_seqlen=20,
    dim=512,
    hidden_dim=64
)

# a batch of sequences with 10 timesteps of length 512 each
x = torch.rand(32, 10, 512)
y = layer(x) # [32, 10, 512]

`AFTSimple`

from aft_pytorch import AFTSimple

layer = AFTSimple(
    max_seqlen=20,
    dim=512,
    hidden_dim=64
)

# a batch of sequences with 10 timesteps of length 512 each
x = torch.rand(32, 10, 512)
y = layer(x) # [32, 10, 512]

This layer wrapper is a 'plug-and-play' with your existing networks / Transformers. You can swap out the Self-Attention layer with the available layers in this package with minimal changes.

Add full AFT architecture
Add variants like, AFTConv, AFTLocal

Contributing

If you like this repo, please leave a star! If there are any amends or suggestions, feel free to raise a PR/issue.

Credits

@misc{attention-free-transformer,
title = {An Attention Free Transformer},
author = {Shuangfei Zhai and Walter Talbott and Nitish Srivastava and Chen Huang and Hanlin Goh and Ruixiang Zhang and Josh Susskind},
year = {2021},
URL = {https://arxiv.org/pdf/2105.14103.pdf}
}

License

MIT

Github GitHub - rish-16/aft-pytorch: Unofficial PyTorch implementation of Attent...

aft-pytorch

Installation

Usage

`AFTFull`

`AFTSimple`

Contributing

Credits

License

Recommend

Github GitHub - lantip/sawa: sawa (ꦱꦮ) is an open source programming language, a...

地面频道只能靠自救！芒果MCN李志华等：广电MCN的实践经验与媒体融合营销突破

Launchmetrics 发布“中国十大美妆影响者排行榜”：李佳琦、豆豆Babe、西门大嫂位列前三

世界经济论坛发布DeFi监管政策工具包

外卖小哥，究竟要怎样的保护？

红杉中国投资加拿大时尚电商 SSENSE，投后估值达50亿加元

知识付费迷路了？

2001年的阴谋论游戏：以一场失火开场，因恐怖袭击落幕

三星虚拟助理成了国外网友的新老婆，却惨遭官方删除

Seinfeld: Not That There's Anything Wrong With That (Clip) | TBS

About Joyk