Yangsibo Huang

Yangsibo Huang

Welcome! I am a Ph.D. candidate and Wallace Memorial Fellow at Princeton University. I am very fortunate to be co-advised by Prof. Kai Li and Prof. Sanjeev Arora. I have been doing research at the intersection of machine learning, systems, and policy, with a focus on auditing and improving machine learning systems’ compliance with policies, from the perspectives of

I also believe in the power of community efforts to enhance the trustworthiness and transparency of machine learning systems. Recently, we (with researchers from 13 institutes) advocate for A Safe Harbor for AI Evaluation and Red Teaming, encouraging AI companies to provide legal and technical protections for good faith research on their AI models. We also release an open letter (signed by 300+ researchers, and reported by The Washington Post, VentureBeat, AIPwn, and Computerworld).

I did my undergrad in Computer Science at Zhejiang University. I also spent a great semester at Harvard Medical School under the supervision of Prof. Quanzheng Li.

News

Selected Publications and Manuscripts

Please refer to publications or my Google Scholar profile for the full list. ("(α)" stands for alphabetical order)


  1. Assessing the Brittleness of Safety Alignment via Pruning and Low-Rank Modifications
  2. Catastrophic Jailbreak of Open-source LLMs via Exploiting Generation
    Yangsibo Huang, Samyak Gupta, Mengzhou Xia, Kai Li, and Danqi Chen
  3. Detecting Pretraining Data from Large Language Modelss
  4. (α) LabelDP-Pro: Learning with Label Differential Privacy via Projections
  1. (α) Sparsity-Preserving Differentially Private Training
  2. Privacy Implications of Retrieval-Based Language Models
    Yangsibo Huang, Samyak Gupta, Zexuan Zhong, Kai Li, and Danqi Chen
  1. Recovering Private Text in Federated Learning of Language Models
  2. A Dataset Auditing Method for Collaboratively Trained Machine Learning Models
    Yangsibo Huang, Chun-Yin Huang, Xiaoxiao Li, and Kai Li
  1. Evaluating Gradient Inversion Attacks and Defenses in Federated Learning
    Yangsibo Huang, Samyak Gupta, Zhao Song, Kai Li, and Sanjeev Arora

      Service

      Contact me

      You are welcome to reach out to me via email.

      MISC

      In my spare time, I mainly stay with my four cats 😺😻😼😽.

      rss facebook twitter github gitlab youtube mail spotify lastfm instagram linkedin google google-plus pinterest medium vimeo stackoverflow reddit quora quora