AI & ML interests
LLM
Recent Activity
View all activity
-
OpenMOSS-Team/MOSS-TTS
Text-to-Speech ⢠8B ⢠Updated ⢠74.2k ⢠362 -
OpenMOSS-Team/MOSS-TTS-Realtime
Text-to-Speech ⢠2B ⢠Updated ⢠67k ⢠72 -
OpenMOSS-Team/MOSS-TTS-Local-Transformer
Text-to-Speech ⢠3B ⢠Updated ⢠51.4k ⢠23 -
OpenMOSS-Team/MOSS-Audio-Tokenizer
Feature Extraction ⢠2B ⢠Updated ⢠66.6k ⢠38
-
OpenMOSS-Team/MOSS-TTSD-v1.0
Text-to-Speech ⢠8B ⢠Updated ⢠8.69k ⢠53 -
OpenMOSS-Team/MOSS-TTSD-v0.7
Text-to-Speech ⢠2B ⢠Updated ⢠224 ⢠18 -
OpenMOSS-Team/MOSS-TTSD-v0.5
Text-to-Speech ⢠2B ⢠Updated ⢠694 ⢠54 -
OpenMOSS-Team/MOSS-TTSD-v0
Text-to-Speech ⢠2B ⢠Updated ⢠4 ⢠28
True Speech-to-Speech Langugage Model
First Omni-modal Future Forecasting Benchmark
https://github.com/OpenMOSS/FRoM-W1
Proactive Robot Manipulation in Omni-modal Context
Open source weights of Lorsa modules introduced in "Towards Understanding the Nature of Attention with Low-Rank Sparse Decomposition".
The MHA2MLA model published in the paper "Towards Economical Inference: Enabling DeepSeek's Multi-Head Latent Attention in Any Transformer-Based LLMs"
-
Towards Economical Inference: Enabling DeepSeek's Multi-Head Latent Attention in Any Transformer-based LLMs
Paper ⢠2502.14837 ⢠Published ⢠4 -
OpenMOSS-Team/Llama-2-7B-MLA-d_kv_16
Text Generation ⢠6B ⢠Updated ⢠6 ⢠1 -
OpenMOSS-Team/Llama-2-7B-MLA-d_kv_32
Text Generation ⢠6B ⢠Updated ⢠6 ⢠1 -
OpenMOSS-Team/Llama-2-7B-MLA-d_kv_64
Text Generation ⢠7B ⢠Updated ⢠21 ⢠1
Opensource Lorsas and Transcoders
A unified multimodal large language model for end-to-end speaker-attributed, time-stamped transcription.
Evaluating Agentic Backend Coding Capabilities in Real-World Development Scenarios
-
ABC-Bench: Benchmarking Agentic Backend Coding in Real-World Development
Paper ⢠2601.11077 ⢠Published ⢠67 -
OpenMOSS-Team/ABC-Bench
Viewer ⢠Updated ⢠224 ⢠176 ⢠4 -
OpenMOSS-Team/Qwen3-32B-ABC
Text Generation ⢠33B ⢠Updated ⢠18 ⢠2 -
OpenMOSS-Team/Qwen3-8B-ABC
Text Generation ⢠8B ⢠Updated ⢠5 ⢠3
[ICLR 2026] Game-RL: Synthesizing Multimodal Verifiable Game Data to Boost VLMs' General Reasoning
An Efficient Training Framework for Diffusion Language Models
-
World Modeling Makes a Better Planner: Dual Preference Optimization for Embodied Task Planning
Paper ⢠2503.10480 ⢠Published ⢠57 -
Unleashing Embodied Task Planning Ability in LLMs via Reinforcement Learning
Paper ⢠2506.23127 ⢠Published ⢠2 -
World-aware Planning Narratives Enhance Large Vision-Language Model Planner
Paper ⢠2506.21230 ⢠Published ⢠1 -
OpenMOSS-Team/Embodied_R1-ScienceWorld
8B ⢠Updated ⢠2 ⢠1
The MHA2MLA model published in the paper "Towards Economical Inference: Enabling DeepSeek's Multi-Head Latent Attention in Any Transformer-Based LLMs"
-
OpenMOSS-Team/SmolLM-135M-MLA-d_kv_8-refactor
Text Generation ⢠0.1B ⢠Updated ⢠2 ⢠1 -
OpenMOSS-Team/SmolLM-135M-MLA-d_kv_32-refactor
Text Generation ⢠0.1B ⢠Updated ⢠19 ⢠1 -
OpenMOSS-Team/SmolLM-135M-MLA-d_kv_16-refactor
Text Generation ⢠0.1B ⢠Updated ⢠3 ⢠1 -
OpenMOSS-Team/SmolLM-360M-MLA-d_kv_8-refactor
Text Generation ⢠0.3B ⢠Updated ⢠6 ⢠1
-
OpenMOSS-Team/moss-moon-003-sft-plugin
Text Generation ⢠Updated ⢠30 ⢠71 -
OpenMOSS-Team/moss-moon-003-sft
Text Generation ⢠Updated ⢠739 ⢠128 -
OpenMOSS-Team/moss-moon-003-base
Text Generation ⢠Updated ⢠697 ⢠132 -
OpenMOSS-Team/moss-moon-003-sft-int4
Text Generation ⢠Updated ⢠43 ⢠41
Opensource Lorsas and Transcoders
-
OpenMOSS-Team/MOSS-TTS
Text-to-Speech ⢠8B ⢠Updated ⢠74.2k ⢠362 -
OpenMOSS-Team/MOSS-TTS-Realtime
Text-to-Speech ⢠2B ⢠Updated ⢠67k ⢠72 -
OpenMOSS-Team/MOSS-TTS-Local-Transformer
Text-to-Speech ⢠3B ⢠Updated ⢠51.4k ⢠23 -
OpenMOSS-Team/MOSS-Audio-Tokenizer
Feature Extraction ⢠2B ⢠Updated ⢠66.6k ⢠38
-
OpenMOSS-Team/MOSS-TTSD-v1.0
Text-to-Speech ⢠8B ⢠Updated ⢠8.69k ⢠53 -
OpenMOSS-Team/MOSS-TTSD-v0.7
Text-to-Speech ⢠2B ⢠Updated ⢠224 ⢠18 -
OpenMOSS-Team/MOSS-TTSD-v0.5
Text-to-Speech ⢠2B ⢠Updated ⢠694 ⢠54 -
OpenMOSS-Team/MOSS-TTSD-v0
Text-to-Speech ⢠2B ⢠Updated ⢠4 ⢠28
A unified multimodal large language model for end-to-end speaker-attributed, time-stamped transcription.
True Speech-to-Speech Langugage Model
Evaluating Agentic Backend Coding Capabilities in Real-World Development Scenarios
-
ABC-Bench: Benchmarking Agentic Backend Coding in Real-World Development
Paper ⢠2601.11077 ⢠Published ⢠67 -
OpenMOSS-Team/ABC-Bench
Viewer ⢠Updated ⢠224 ⢠176 ⢠4 -
OpenMOSS-Team/Qwen3-32B-ABC
Text Generation ⢠33B ⢠Updated ⢠18 ⢠2 -
OpenMOSS-Team/Qwen3-8B-ABC
Text Generation ⢠8B ⢠Updated ⢠5 ⢠3
First Omni-modal Future Forecasting Benchmark
[ICLR 2026] Game-RL: Synthesizing Multimodal Verifiable Game Data to Boost VLMs' General Reasoning
https://github.com/OpenMOSS/FRoM-W1
An Efficient Training Framework for Diffusion Language Models
Proactive Robot Manipulation in Omni-modal Context
-
World Modeling Makes a Better Planner: Dual Preference Optimization for Embodied Task Planning
Paper ⢠2503.10480 ⢠Published ⢠57 -
Unleashing Embodied Task Planning Ability in LLMs via Reinforcement Learning
Paper ⢠2506.23127 ⢠Published ⢠2 -
World-aware Planning Narratives Enhance Large Vision-Language Model Planner
Paper ⢠2506.21230 ⢠Published ⢠1 -
OpenMOSS-Team/Embodied_R1-ScienceWorld
8B ⢠Updated ⢠2 ⢠1
Open source weights of Lorsa modules introduced in "Towards Understanding the Nature of Attention with Low-Rank Sparse Decomposition".
The MHA2MLA model published in the paper "Towards Economical Inference: Enabling DeepSeek's Multi-Head Latent Attention in Any Transformer-Based LLMs"
-
OpenMOSS-Team/SmolLM-135M-MLA-d_kv_8-refactor
Text Generation ⢠0.1B ⢠Updated ⢠2 ⢠1 -
OpenMOSS-Team/SmolLM-135M-MLA-d_kv_32-refactor
Text Generation ⢠0.1B ⢠Updated ⢠19 ⢠1 -
OpenMOSS-Team/SmolLM-135M-MLA-d_kv_16-refactor
Text Generation ⢠0.1B ⢠Updated ⢠3 ⢠1 -
OpenMOSS-Team/SmolLM-360M-MLA-d_kv_8-refactor
Text Generation ⢠0.3B ⢠Updated ⢠6 ⢠1
The MHA2MLA model published in the paper "Towards Economical Inference: Enabling DeepSeek's Multi-Head Latent Attention in Any Transformer-Based LLMs"
-
Towards Economical Inference: Enabling DeepSeek's Multi-Head Latent Attention in Any Transformer-based LLMs
Paper ⢠2502.14837 ⢠Published ⢠4 -
OpenMOSS-Team/Llama-2-7B-MLA-d_kv_16
Text Generation ⢠6B ⢠Updated ⢠6 ⢠1 -
OpenMOSS-Team/Llama-2-7B-MLA-d_kv_32
Text Generation ⢠6B ⢠Updated ⢠6 ⢠1 -
OpenMOSS-Team/Llama-2-7B-MLA-d_kv_64
Text Generation ⢠7B ⢠Updated ⢠21 ⢠1
-
OpenMOSS-Team/moss-moon-003-sft-plugin
Text Generation ⢠Updated ⢠30 ⢠71 -
OpenMOSS-Team/moss-moon-003-sft
Text Generation ⢠Updated ⢠739 ⢠128 -
OpenMOSS-Team/moss-moon-003-base
Text Generation ⢠Updated ⢠697 ⢠132 -
OpenMOSS-Team/moss-moon-003-sft-int4
Text Generation ⢠Updated ⢠43 ⢠41