LightZero: A Unified Benchmark for Monte Carlo Tree Search in General Sequential Decision Scenarios Paper β’ 2310.08348 β’ Published Oct 12, 2023 β’ 4
SafeWork-R1: Coevolving Safety and Intelligence under the AI-45$^{\circ}$ Law Paper β’ 2507.18576 β’ Published Jul 24, 2025 β’ 10
ReZero: Boosting MCTS-based Algorithms by Backward-view and Entire-buffer Reanalyze Paper β’ 2404.16364 β’ Published Apr 25, 2024 β’ 1
One Model for All Tasks: Leveraging Efficient World Models in Multi-Task Planning Paper β’ 2509.07945 β’ Published Sep 9, 2025 β’ 1
The Curse and Blessing of Mean Bias in FP4-Quantized LLM Training Paper β’ 2603.10444 β’ Published Mar 11 β’ 12
MetaphorStar: Image Metaphor Understanding and Reasoning with End-to-End Visual Reinforcement Learning Paper β’ 2602.10575 β’ Published Feb 11 β’ 4
MetaphorStar: Image Metaphor Understanding and Reasoning with End-to-End Visual Reinforcement Learning Paper β’ 2602.10575 β’ Published Feb 11 β’ 4
II-Bench: An Image Implication Understanding Benchmark for Multimodal Large Language Models Paper β’ 2406.05862 β’ Published Jun 9, 2024 β’ 4
view post Post 1412 The #1 trending AI/ML dataset today πMassive scale, diversity and end-to-end potential from nvidia ! nvidia/PhysicalAI-Autonomous-Vehicles See translation π₯ 1 1 + Reply
view post Post 836 The new King πhas arrived! Moonshot AI now the top model on Hugging Face π₯ moonshotai/Kimi-K2-Thinking See translation π₯ 1 1 π€ 1 1 + Reply
view post Post 2888 πΈπ€You donβt need 100 GPUs to train something amazing!Our Smol Training Playbook teaches you a better path to world-class LLMs, for free! Check out the #1 trending space on π€ : HuggingFaceTB/smol-training-playbook See translation π€ 7 7 π 3 3 π₯ 2 2 + Reply
RoboChallenge: Large-scale Real-robot Evaluation of Embodied Policies Paper β’ 2510.17950 β’ Published Oct 20, 2025 β’ 9
view post Post 2363 Cool stuff these past weeks on huggingface! π€ π !β’ πTrackio, local-first W&B alternativehttps://github.com/gradio-app/trackio/issuesβ’ πEmbeddingGemma, 300M-param, multilingual embeddings, on-devicehttps://huggingface.co/blog/embeddinggemmaβ’ π»Open LLMs in VS Code (Inference Providers)https://x.com/reach_vb/status/1966185427582497171β’ π€Smol2Operator GUI agentshttps://huggingface.co/blog/smol2operatorβ’ πΌοΈGradio visible watermarkinghttps://huggingface.co/blog/watermarking-with-gradio See translation π₯ 4 4 π€ 3 3 + Reply
II-Bench: An Image Implication Understanding Benchmark for Multimodal Large Language Models Paper β’ 2406.05862 β’ Published Jun 9, 2024 β’ 4
Let Androids Dream of Electric Sheep: A Human-like Image Implication Understanding and Reasoning Framework Paper β’ 2505.17019 β’ Published May 22, 2025 β’ 4