andyzoujm

Follow

Andy Zou andyzoujm

Follow

PhD student at CMU

185 followers · 0 following

Berkeley, CA
https://andyzoujm.github.io/

Achievements

Achievements

Pinned Loading

llm-attacks/llm-attacks llm-attacks/llm-attacks Public

Universal and Transferable Attacks on Aligned Language Models

Python 3.5k 479
representation-engineering representation-engineering Public

Representation Engineering: A Top-Down Approach to AI Transparency

Jupyter Notebook 728 86
autocast autocast Public

Forecasting Future World Events with Neural Networks (NeurIPS 2022)

Jupyter Notebook 179 48
hendrycks/test hendrycks/test Public

Measuring Massive Multitask Language Understanding | ICLR 2021

Python 1.2k 92
pixmix pixmix Public

PixMix: Dreamlike Pictures Comprehensively Improve Safety Measures (CVPR 2022)

Python 101 7
aypan17/machiavelli aypan17/machiavelli Public

Python 122 22