Transformer Speed-Up Sped Up

LauraUstariz · October 13, 2021, 9:38pm

Hey everyone!

The transformer architecture is notoriously inefficient when processing long sequences — a problem in processing images, which are essentially long sequences of pixels.

What’s new: Zizhao Zhang and colleagues at Google and Rutgers University simplified an earlier proposal for using transformers to process images. They call their architecture NesT.

Why it matters: Transformers typically bog down when processing images. NesT could help vision applications take fuller advantage of the architecture’s strengths.

We’re thinking: Computational efficiency for the Swin!

To read the full story, click here

Jean_de · October 15, 2021, 9:00pm

Hi @LauraUstariz,

Great summary on the latest news from the Batch. The NesT architecture and the computation that it is bringing is good news for the vision community.

Two of the other things I found interesting in this week’s Batch are Andrew’s notes on academia vs industry and the news about Multitask Unified Model(MUM) that Google plans to introduce in Google Search and Google Lens.

The comparison between academia and industry is definitely a must-read for anyone who would like to transition from one to another.

Here is the link to this week’s Batch to read more about academia and industry, NesT, Google MUM, and many other AI news.

We look forward to the next week’s newsletter!

Topic		Replies	Views
Transformer architecture is smarter than you think AI Discussions the-batch , ai-discussions	1	56	May 20, 2023
Seeing What Comes Next: Transformers predict future video frames AI Discussions the-batch , ai-discussions	1	73	May 20, 2023
Transformers in Vision AI Discussions	3	69	May 3, 2022
AI’s Eyes Evolve: Vision transformer research exploded in 2022 AI Discussions the-batch , ai-discussions	1	67	May 20, 2023
Cookbook for Vision Transformers: A Formula for Training Vision Transformers AI Discussions the-batch , ai-discussions	1	202	May 20, 2023

Transformer Speed-Up Sped Up

Related topics