About me

Hi! I am a Statistics Ph.D. student at the Wharton SchoolUniversity of Pennsylvania. Previously, I obtained my M.S. in Computer Science (2020 – 2023) and B.S. in Mathematics (2016 – 2020) from Tsinghua University.

I am broadly interested in the theoretical aspects of modern machine learning, recently focusing on foundation models and transformers. Feel free to reach out if you’d like to have a chat!

News:

  • Oct 2023: New paper on the theoretical exploration of the in-context learning dynamics of the one-layer transformer!