Recent Publications
-
How Does Code Pretraining Affect Language Model Task Performance?
Jan 2025 • Jackson Petty, Sjoerd van Steenkiste and Tal Linzen -
GPQA: A Graduate-Level Google-Proof Q&A Benchmark
Oct 2024 • David Rein, Betty Li Hou, Asa Cooper Stickland, Jackson Petty, Richard Yuanzhe Pang, Julien Dirani, Julian Michael and Samuel R. Bowman -
The Illusion of State in State-Space Models
Apr 2024 • William Merrill, Jackson Petty and Ashish Sabharwal -
Debate Helps Supervise Unreliable Experts
Nov 2023 • Julian Michael, Salsabila Mahdi, David Rein, Jackson Petty, Julien Dirani, Vishakh Padmakumar and Samuel R. Bowman -
In-context Learning Generalizes, But Not Always Robustly: The Case of Syntax
Nov 2023 • Aaron Mueller, Albert Webson, Jackson Petty and Tal Linzen
Writing
-
Building a Slackbot to DM Users
Ingredients: two API calls and a server; Time: 1hr; Serves anonymized DMs to facilitate asynchronous data collection. -
Nearer to G-d are We
A brief reflection on the roots of sacrifice for Pesach. Published in the “Passover 2022” issue of Shibboleth.