AI & ML interests
Efficient AI
Papers
RelayGen: Intra-Generation Model Switching for Efficient Reasoning
LiteStage: Latency-aware Layer Skipping for Multi-stage Reasoning
SNU-VLSI 's datasets
None public yet
Efficient AI
RelayGen: Intra-Generation Model Switching for Efficient Reasoning
LiteStage: Latency-aware Layer Skipping for Multi-stage Reasoning