Novita AI Evaluates FlashMLA on H100 and H200
DeepSeek has officially kicked off its five-day open source release initiative, with the first featured project being FlashMLA. FlashMLA is an optimized, high-efficiency MLA decoding kernel specifically designed for NVIDIA Hopper GPUs (e.g., H800 SXM...
novita.hashnode.dev4 min read