Submitted by KouSiqi 32 Think-Then-Generate: Reasoning-Aware Text-to-Image Diffusion with LLM Encoders DENG Lab @ SJTU 113 3
Submitted by xuchenkai 17 LoPA: Scaling dLLM Inference via Lookahead Parallel Decoding DENG Lab @ SJTU 38 2
Submitted by Yi Yang (SII) 12 Mantis: A Versatile Vision-Language-Action Model with Disentangled Visual Foresight DENG Lab @ SJTU 92 2