I’m a first-year PhD @CityUHK, advised by Ning Miao. Previously, I worked as a master student @Fudan DISC (affiliated to FudanNLP), advised by Zhongyu Wei and Siyuan Wang. I was a research intern @XLANG, advised by Tao Yu. I earned my bachelor’s degree from @Beijing Institute of Technology.
My research focuses on reward design (ALaRM, HAF-RM), evaluation (DS-1000, ARKS), and math/code reasoning (VHG, SSAE). I’m particularly interested in the efficient oversight of human-level/superhuman models.