SketchingReality: From Freehand Scene Sketches To Photorealistic Images Paper • 2602.14648 • Published Feb 16
CountLoop: Training-Free High-Instance Image Generation via Iterative Agent Guidance Paper • 2508.16644 • Published Aug 18, 2025
BEV-CV: Birds-Eye-View Transform for Cross-View Geo-Localisation Paper • 2312.15363 • Published Dec 23, 2023
YuE: Scaling Open Foundation Models for Long-Form Music Generation Paper • 2503.08638 • Published Mar 11, 2025 • 73
Chirpy3D: Continuous Part Latents for Creative 3D Bird Generation Paper • 2501.04144 • Published Jan 7, 2025 • 19
Chirpy3D: Continuous Part Latents for Creative 3D Bird Generation Paper • 2501.04144 • Published Jan 7, 2025 • 19
Scaling Transformers for Low-Bitrate High-Quality Speech Coding Paper • 2411.19842 • Published Nov 29, 2024 • 12
FAM Diffusion: Frequency and Attention Modulation for High-Resolution Image Generation with Stable Diffusion Paper • 2411.18552 • Published Nov 27, 2024 • 18