About

I’m an incoming PhD @CityUHK, advised by Ning Miao. Previously, I worked as a master student @Fudan DISC (affiliated to FudanNLP), advised by Siyuan Wang and Zhongyu Wei. I was a research intern @XLANG, advised by Tao Yu. I earned my bachelor’s degree from @Beijing Institute of Technology.

My works mainly focus on reward design (ALaRM, HaF-RM) and evaluation (DS-1000, ARKS) with an emphasis on reasoning. I’m particularly interested in the efficient oversight of human-level/superhuman models.

Misc