Teaching LLMs to understand images and videos in addition to text...
An introductory, simple, and functional implementation of MoE LLM pretraining...
Understanding reasoning models and their relation to standard LLMs...
Understanding models like DeepSeek, Grok, and Mixtral from the ground up...
Understanding the current state of LLM scaling and the future of AI research...