Giordano Rogers
Publications
Blog
CV
Giordano Rogers
AI Interpretability Researcher
Follow
Boston, MA
Northeastern University
Email
GitHub
LinkedIn
Publications
Agents of Chaos
Natalie Shapira,
Giordano Rogers
, David Bau, et al.
[Under Peer Review]
PDF
Project Page
LLMs Process Lists With General Filter Heads
Arnab Sen Sharma,
Giordano Rogers
, Natalie Shapira, David Bau
ICLR
PDF
Project Page
Code
Do Natural Language Descriptions of Model Activations Convey Privileged Information?
Millicent Li, Alberto Mario Ceballos Arroyo,
Giordano Rogers
, Naomi Saphra, Byron C Wallace
PDF