Ming Zhang
konglongge
ยท
AI & ML interests
LLMs
Recent Activity
upvoted a paper about 16 hours ago
The Verification Horizon: No Silver Bullet for Coding Agent Rewards liked a dataset about 1 month ago
llmeval-fdu/LLMEval-Logic upvoted a paper about 1 month ago
LLMEval-Logic: A Solver-Verified Chinese Benchmark for Logical Reasoning of LLMs with Adversarial Hardening