The models for the paper: Hybrid Linear Attention Done Right: Efficient Distillation and Effective Architectures for Extremely Long Contexts
Yingfa Chen
chen-yingfa
AI & ML interests
Long-context modeling, continual learning, architectures
Recent Activity
updated a collection 1 day ago
HypeNet liked a model 1 day ago
chen-yingfa/HypeNet-2B updated a model 1 day ago
chen-yingfa/HypeNet-2BOrganizations
None yet