kedar kolluri
kktw
AI & ML interests
None yet
Recent Activity
published an article about 4 hours ago
1.7x Faster on a 218B Model: EAGLE3 Speculative Decoding for GLM-4.7 published an article 6 days ago
2x Faster on a 229B MoE: EAGLE3 Speculative Decoding for MiniMax-M2.5 published an article 8 days ago
Google Released Gemma-4 Four Days Ago. We Already Made It 1.72× Faster.