Archit Manek

Blog
Feedback

👋🏼 Hi, I’m Archit. I work on AI safety — on questions about whether we can trust AI systems to do what we think they’re doing. This is where I share some of it.

2026¹

March

Do Models Know They're Being Tested? Probing Eval-Awareness Across Scale and Architecture

March 27, 2026 · 15 min

ICML '26 Mech Interp Workshop OpenReview arXiv

© 2026 Archit Manek · Powered by Hugo & PaperMod