arxiv:2411.19096
Haiyue Song
shyyhs
AI & ML interests
machine translation, subword
Recent Activity
liked a model 1 day ago
Fugaku-LLM/Fugaku-LLM-13B submitted a paper 6 days ago
OptiMer: Optimal Distribution Vector Merging Is Better than Data Mixing for Continual Pre-TrainingOrganizations
None yet