Hi! I am a Statistics Ph.D. student at the Wharton School, University of Pennsylvania. Previously, I obtained my M.S. in Computer Science (2020 – 2023) and B.S. in Mathematics (2016 – 2020) from Tsinghua University.
I am broadly interested in the theoretical aspects of modern machine learning, recently focusing on foundation models and transformers. Feel free to reach out if you’d like to have a chat!
- Oct 2023: New paper on the theoretical exploration of the in-context learning dynamics of the one-layer transformer!