Hey,
As a matter of personal interest I’m not sure if anyone else here saw the chat he gave yesterday on LinkedIn.
At one point he mentioned that some researchers are now experimenting with Diffusion, rather than Transformer based LLMs. He didn’t mention any names, but I’d love to know more about it/see reference to papers.
Has anyone else heard about this ?