🔄 In a Training Loop

Zinan Tang

Word2Li

·

https://zinantang.pages.dev/

AI & ML interests

NLP、LLM、Data4LLM、LLM4Data

Recent Activity

liked a dataset 6 days ago

PolarSeeker/OpenSeeker-v1-Data

liked a dataset 6 days ago

miromind-ai/MiroVerse-v0.1

authored a paper 7 days ago

CausalMix: Data Mixture as Causal Inference for Language Model Training

View all activity

Organizations

None yet

liked 2 datasets 6 days ago

PolarSeeker/OpenSeeker-v1-Data

Viewer • Updated Mar 17 • 11.7k • 1.73k • 49

miromind-ai/MiroVerse-v0.1

Viewer • Updated Jan 16 • 228k • 218 • 238

authored a paper 7 days ago

CausalMix: Data Mixture as Causal Inference for Language Model Training

Paper • 2607.01104 • Published 12 days ago • 19

upvoted a paper 11 days ago

CausalMix: Data Mixture as Causal Inference for Language Model Training

Paper • 2607.01104 • Published 12 days ago • 19

submitted a paper to Daily Papers 11 days ago

CausalMix: Data Mixture as Causal Inference for Language Model Training

Paper • 2607.01104 • Published 12 days ago • 19

upvoted a paper 20 days ago

BioMatrix: Towards a Comprehensive Biological Foundation Model Spanning the Modality Matrix of Sequences, Structures, and Language

Paper • 2606.22138 • Published 23 days ago • 24

upvoted 6 papers 26 days ago

Distilling Long-CoT Reasoning through Collaborative Step-wise Multi-Teacher Decoding

Paper • 2605.02290 • Published May 4 • 42

AstraFlow: Dataflow-Oriented Reinforcement Learning for Agentic LLMs

Paper • 2605.15565 • Published May 15 • 17

The MiniMax-M2 Series: Mini Activations Unleashing Max Real-World Intelligence

Paper • 2605.26494 • Published May 26 • 41

Nemotron 3 Ultra: Open, Efficient Mixture-of-Experts Hybrid Mamba-Transformer Model for Agentic Reasoning

Paper • 2606.15007 • Published Jun 12 • 19

VibeThinker-3B: Exploring the Frontier of Verifiable Reasoning in Small Language Models

Paper • 2606.16140 • Published 28 days ago • 122

Ling and Ring 2.6 Technical Report: Efficient and Instant Agentic Intelligence at Trillion-Parameter Scale

Paper • 2606.15079 • Published about 1 month ago • 87

commented a paper 28 days ago

Domain-Specific Data Synthesis for LLMs via Minimal Sufficient Representation Learning

Paper • 2605.30039 • Published May 29 • 20 •

upvoted 7 papers 28 days ago

Domain-Specific Data Synthesis for LLMs via Minimal Sufficient Representation Learning

Paper • 2605.30039 • Published May 29 • 20

MIRA: Mid-training Rubric Anchoring for Source-Aware Data Selection

Paper • 2605.30288 • Published May 29 • 23

STRIDE: Training Data Attribution via Sparse Recovery from Subset Perturbations

Paper • 2606.05165 • Published Jun 3 • 4

LLM Explainability with Counterfactual Chains and Causal Graphs

Paper • 2606.05972 • Published Jun 4 • 18

PaperFlow: Profiling, Recommending, and Adapting Across Daily Paper Streams

Paper • 2606.07454 • Published Jun 5 • 14

Flow-DPPO: Divergence Proximal Policy Optimization for Flow Matching Models

Paper • 2606.11025 • Published Jun 9 • 41

MaxProof: Scaling Mathematical Proof with Generative-Verifier RL and Population-Level Test-Time Scaling

Paper • 2606.13473 • Published Jun 11 • 93