Models
Datasets
Spaces
Buckets new
Docs
Enterprise
Pricing
Log In
Sign Up

Collections

Discover the best community collections!

Collections including paper arxiv:2503.05236

UnifiedReward 2.0 Qwen3.5 Models

Unified Reward Model for Multimodal Understanding and Generation

Paper • 2503.05236 • Published Mar 7, 2025 • 124
Unified Multimodal Chain-of-Thought Reward Model through Reinforcement Fine-Tuning

Paper • 2505.03318 • Published May 6, 2025 • 94
CodeGoat24/UnifiedReward-Think-qwen35-9b

9B • Updated Mar 9 • 121
CodeGoat24/UnifiedReward-Think-qwen35-27b

3.05M • Updated Mar 15 • 387

UnifiedReward 1.0 LLaVA Model

Unified Reward Model for Multimodal Understanding and Generation

Paper • 2503.05236 • Published Mar 7, 2025 • 124
Unified Multimodal Chain-of-Thought Reward Model through Reinforcement Fine-Tuning

Paper • 2505.03318 • Published May 6, 2025 • 94
CodeGoat24/UnifiedReward-Think-7b

8B • Updated Aug 29, 2025 • 7 • 10
CodeGoat24/UnifiedReward-7b-v1.5

8B • Updated Nov 5, 2025 • 9.79k • 7

LongVie: Multimodal-Guided Controllable Ultra-Long Video Generation

Paper • 2508.03694 • Published Aug 5, 2025 • 52
On the Generalization of SFT: A Reinforcement Learning Perspective with Reward Rectification

Paper • 2508.05629 • Published Aug 7, 2025 • 191
Improving Video Generation with Human Feedback

Paper • 2501.13918 • Published Jan 23, 2025 • 53
Unified Reward Model for Multimodal Understanding and Generation

Paper • 2503.05236 • Published Mar 7, 2025 • 124

UnifiedReward 1.0 Qwen2.5 Models GGUF

Unified Reward Model for Multimodal Understanding and Generation

Paper • 2503.05236 • Published Mar 7, 2025 • 124
Unified Multimodal Chain-of-Thought Reward Model through Reinforcement Fine-Tuning

Paper • 2505.03318 • Published May 6, 2025 • 94
mradermacher/UnifiedReward-qwen-32b-i1-GGUF

33B • Updated Jul 10, 2025 • 182 • 1
mradermacher/UnifiedReward-Think-qwen-7b-i1-GGUF

8B • Updated Jul 10, 2025 • 269

R1-Onevision: Advancing Generalized Multimodal Reasoning through Cross-Modal Formalization

Paper • 2503.10615 • Published Mar 13, 2025 • 17
UniGoal: Towards Universal Zero-shot Goal-oriented Navigation

Paper • 2503.10630 • Published Mar 13, 2025 • 6
Search-R1: Training LLMs to Reason and Leverage Search Engines with Reinforcement Learning

Paper • 2503.09516 • Published Mar 12, 2025 • 39
LMM-R1: Empowering 3B LMMs with Strong Reasoning Abilities Through Two-Stage Rule-Based RL

Paper • 2503.07536 • Published Mar 10, 2025 • 88

UnifiedReward 2.0 Qwen3VL Models

Unified Reward Model for Multimodal Understanding and Generation

Paper • 2503.05236 • Published Mar 7, 2025 • 124
CodeGoat24/UnifiedReward-Think-qwen3vl-32b

1.14M • Updated Mar 9 • 147
CodeGoat24/UnifiedReward-Think-qwen3vl-8b

9B • Updated Mar 9 • 1.07k • 2
CodeGoat24/UnifiedReward-Think-qwen3vl-4b

4B • Updated Mar 9 • 7

UnifiedReward 2.0 Qwen2.5VL Models

Unified Reward Model for Multimodal Understanding and Generation

Paper • 2503.05236 • Published Mar 7, 2025 • 124
CodeGoat24/UnifiedReward-2.0-qwen-3b

4B • Updated Sep 17, 2025 • 7 • 2
CodeGoat24/UnifiedReward-2.0-qwen-7b

8B • Updated Sep 17, 2025 • 483 • 2
CodeGoat24/UnifiedReward-2.0-qwen-32b

33B • Updated Sep 24, 2025 • 208

Unified Multimodal Model

A curated list for Multimodal Model Generation papers.

OmniGen2: Exploration to Advanced Multimodal Generation

Paper • 2506.18871 • Published Jun 23, 2025 • 78
OmniGen: Unified Image Generation

Paper • 2409.11340 • Published Sep 17, 2024 • 115
Show-o Turbo: Towards Accelerated Unified Multimodal Understanding and Generation

Paper • 2502.05415 • Published Feb 8, 2025 • 20
Show-o: One Single Transformer to Unify Multimodal Understanding and Generation

Paper • 2408.12528 • Published Aug 22, 2024 • 51

Feature-Level Insights into Artificial Text Detection with Sparse Autoencoders

Paper • 2503.03601 • Published Mar 5, 2025 • 233
Transformers without Normalization

Paper • 2503.10622 • Published Mar 13, 2025 • 172
RWKV-7 "Goose" with Expressive Dynamic State Evolution

Paper • 2503.14456 • Published Mar 18, 2025 • 154
ReCamMaster: Camera-Controlled Generative Rendering from A Single Video

Paper • 2503.11647 • Published Mar 14, 2025 • 148

RuCCoD: Towards Automated ICD Coding in Russian

Paper • 2502.21263 • Published Feb 28, 2025 • 133
Unified Reward Model for Multimodal Understanding and Generation

Paper • 2503.05236 • Published Mar 7, 2025 • 124
Sketch-of-Thought: Efficient LLM Reasoning with Adaptive Cognitive-Inspired Sketching

Paper • 2503.05179 • Published Mar 7, 2025 • 46
R1-Searcher: Incentivizing the Search Capability in LLMs via Reinforcement Learning

Paper • 2503.05592 • Published Mar 7, 2025 • 27

UnifiedReward 2.0 Qwen3.5 Models

Unified Reward Model for Multimodal Understanding and Generation

Paper • 2503.05236 • Published Mar 7, 2025 • 124
Unified Multimodal Chain-of-Thought Reward Model through Reinforcement Fine-Tuning

Paper • 2505.03318 • Published May 6, 2025 • 94
CodeGoat24/UnifiedReward-Think-qwen35-9b

9B • Updated Mar 9 • 121
CodeGoat24/UnifiedReward-Think-qwen35-27b

3.05M • Updated Mar 15 • 387

UnifiedReward 2.0 Qwen3VL Models

Unified Reward Model for Multimodal Understanding and Generation

Paper • 2503.05236 • Published Mar 7, 2025 • 124
CodeGoat24/UnifiedReward-Think-qwen3vl-32b

1.14M • Updated Mar 9 • 147
CodeGoat24/UnifiedReward-Think-qwen3vl-8b

9B • Updated Mar 9 • 1.07k • 2
CodeGoat24/UnifiedReward-Think-qwen3vl-4b

4B • Updated Mar 9 • 7

UnifiedReward 1.0 LLaVA Model

Unified Reward Model for Multimodal Understanding and Generation

Paper • 2503.05236 • Published Mar 7, 2025 • 124
Unified Multimodal Chain-of-Thought Reward Model through Reinforcement Fine-Tuning

Paper • 2505.03318 • Published May 6, 2025 • 94
CodeGoat24/UnifiedReward-Think-7b

8B • Updated Aug 29, 2025 • 7 • 10
CodeGoat24/UnifiedReward-7b-v1.5

8B • Updated Nov 5, 2025 • 9.79k • 7

UnifiedReward 2.0 Qwen2.5VL Models

Unified Reward Model for Multimodal Understanding and Generation

Paper • 2503.05236 • Published Mar 7, 2025 • 124
CodeGoat24/UnifiedReward-2.0-qwen-3b

4B • Updated Sep 17, 2025 • 7 • 2
CodeGoat24/UnifiedReward-2.0-qwen-7b

8B • Updated Sep 17, 2025 • 483 • 2
CodeGoat24/UnifiedReward-2.0-qwen-32b

33B • Updated Sep 24, 2025 • 208

LongVie: Multimodal-Guided Controllable Ultra-Long Video Generation

Paper • 2508.03694 • Published Aug 5, 2025 • 52
On the Generalization of SFT: A Reinforcement Learning Perspective with Reward Rectification

Paper • 2508.05629 • Published Aug 7, 2025 • 191
Improving Video Generation with Human Feedback

Paper • 2501.13918 • Published Jan 23, 2025 • 53
Unified Reward Model for Multimodal Understanding and Generation

Paper • 2503.05236 • Published Mar 7, 2025 • 124

Unified Multimodal Model

A curated list for Multimodal Model Generation papers.

OmniGen2: Exploration to Advanced Multimodal Generation

Paper • 2506.18871 • Published Jun 23, 2025 • 78
OmniGen: Unified Image Generation

Paper • 2409.11340 • Published Sep 17, 2024 • 115
Show-o Turbo: Towards Accelerated Unified Multimodal Understanding and Generation

Paper • 2502.05415 • Published Feb 8, 2025 • 20
Show-o: One Single Transformer to Unify Multimodal Understanding and Generation

Paper • 2408.12528 • Published Aug 22, 2024 • 51

UnifiedReward 1.0 Qwen2.5 Models GGUF

Unified Reward Model for Multimodal Understanding and Generation

Paper • 2503.05236 • Published Mar 7, 2025 • 124
Unified Multimodal Chain-of-Thought Reward Model through Reinforcement Fine-Tuning

Paper • 2505.03318 • Published May 6, 2025 • 94
mradermacher/UnifiedReward-qwen-32b-i1-GGUF

33B • Updated Jul 10, 2025 • 182 • 1
mradermacher/UnifiedReward-Think-qwen-7b-i1-GGUF

8B • Updated Jul 10, 2025 • 269

Feature-Level Insights into Artificial Text Detection with Sparse Autoencoders

Paper • 2503.03601 • Published Mar 5, 2025 • 233
Transformers without Normalization

Paper • 2503.10622 • Published Mar 13, 2025 • 172
RWKV-7 "Goose" with Expressive Dynamic State Evolution

Paper • 2503.14456 • Published Mar 18, 2025 • 154
ReCamMaster: Camera-Controlled Generative Rendering from A Single Video

Paper • 2503.11647 • Published Mar 14, 2025 • 148

R1-Onevision: Advancing Generalized Multimodal Reasoning through Cross-Modal Formalization

Paper • 2503.10615 • Published Mar 13, 2025 • 17
UniGoal: Towards Universal Zero-shot Goal-oriented Navigation

Paper • 2503.10630 • Published Mar 13, 2025 • 6
Search-R1: Training LLMs to Reason and Leverage Search Engines with Reinforcement Learning

Paper • 2503.09516 • Published Mar 12, 2025 • 39
LMM-R1: Empowering 3B LMMs with Strong Reasoning Abilities Through Two-Stage Rule-Based RL

Paper • 2503.07536 • Published Mar 10, 2025 • 88

RuCCoD: Towards Automated ICD Coding in Russian

Paper • 2502.21263 • Published Feb 28, 2025 • 133
Unified Reward Model for Multimodal Understanding and Generation

Paper • 2503.05236 • Published Mar 7, 2025 • 124
Sketch-of-Thought: Efficient LLM Reasoning with Adaptive Cognitive-Inspired Sketching

Paper • 2503.05179 • Published Mar 7, 2025 • 46
R1-Searcher: Incentivizing the Search Capability in LLMs via Reinforcement Learning

Paper • 2503.05592 • Published Mar 7, 2025 • 27

Previous
1
2
3
Next

Company

TOS Privacy About Careers

Website

Models Datasets Spaces Pricing Docs