Out of Sight but Not Out of Mind: Hybrid Memory for Dynamic Video World Models Paper • 2603.25716 • Published 10 days ago • 151
ShotStream: Streaming Multi-Shot Video Generation for Interactive Storytelling Paper • 2603.25746 • Published 10 days ago • 153
MambaEye: A Size-Agnostic Visual Encoder with Causal Sequential Processing Paper • 2511.19963 • Published Nov 25, 2025 • 2
SegviGen: Repurposing 3D Generative Model for Part Segmentation Paper • 2603.16869 • Published 19 days ago • 18
Geometry-Guided Reinforcement Learning for Multi-view Consistent 3D Scene Editing Paper • 2603.03143 • Published Mar 3 • 145
Generated Reality: Human-centric World Simulation using Interactive Video Generation with Hand and Camera Control Paper • 2602.18422 • Published Feb 20 • 30
view article Article GGML and llama.cpp join HF to ensure the long-term progress of Local AI +4 Feb 20 • 493
PaperBanana: Automating Academic Illustration for AI Scientists Paper • 2601.23265 • Published Jan 30 • 222
Latent Diffusion Model without Variational Autoencoder Paper • 2510.15301 • Published Oct 17, 2025 • 50
Scaling Instruction-Based Video Editing with a High-Quality Synthetic Dataset Paper • 2510.15742 • Published Oct 17, 2025 • 51
D2E: Scaling Vision-Action Pretraining on Desktop Data for Transfer to Embodied AI Paper • 2510.05684 • Published Oct 7, 2025 • 145
Lynx: Towards High-Fidelity Personalized Video Generation Paper • 2509.15496 • Published Sep 19, 2025 • 13
JAM-Flow: Joint Audio-Motion Synthesis with Flow Matching Paper • 2506.23552 • Published Jun 30, 2025 • 10
Seeing Voices: Generating A-Roll Video from Audio with Mirage Paper • 2506.08279 • Published Jun 9, 2025 • 27
Efficient LLaMA-3.2-Vision by Trimming Cross-attended Visual Features Paper • 2504.00557 • Published Apr 1, 2025 • 15
SANA-Sprint: One-Step Diffusion with Continuous-Time Consistency Distillation Paper • 2503.09641 • Published Mar 12, 2025 • 42
SANA-Sprint Collection 🏃SANA-Sprint: One-Step Diffusion with Continuous-Time Consistency Distillation • 6 items • Updated 26 days ago • 44
Fourier Position Embedding: Enhancing Attention's Periodic Extension for Length Generalization Paper • 2412.17739 • Published Dec 23, 2024 • 41