Show HN: Landmark AI and ML research explained, redrawn, animated
Interactive, animated, visual explainers of landmark AI & ML papers — attention & transformers, GPT-3, FlashAttention, Mixtral, Mamba, DeepSeek, and more.…
Opening excerpt (first ~120 words) tap to expand
Rudrite Research — the frontier, made legibleInteractive, animated, visual explainers of landmark AI & ML papers — the systems and ideas behind the models you use, redrawn and made legible. Free and open.Browse all 100 explainers · Guided reading tracksAttention Is All You NeedFlashAttentionPagedAttention (vLLM)Megatron-LMDeepSeek-R1GPT-3: Language Models are Few-Shot LearnersZeRO: Zero Redundancy OptimizerMixtral of ExpertsTraining Compute-Optimal Large Language ModelsMamba: Linear-Time Sequence Modeling with Selective State SpacesBERT: Pre-training of Deep Bidirectional TransformersDeepSeek-V3Qwen3OLMo 2MiniMax-01Gemma 4Scaling Laws for Neural Language ModelsAdam: A Method for Stochastic OptimizationDeep Residual Learning for Image RecognitionDenoising Diffusion Probabilistic…
Excerpt limited to ~120 words for fair-use compliance. The full article is at Rudrite Research.