Our “Clone Brain” architecture allows you to create a digital representation of your mind—reflecting your knowledge, tone, ways of thinking, and even the purpose that drives your conversations. (For example, a leadership coach might direct their clone to mentor emerging managers, while a consultant might want their clone to focus on sales strategy and client onboarding.)

Up until now, many of our improvements have come from intuition, first principles, and a very basic testing suite. We want to increase the fidelity of each Clone Brain, ensuring it captures its owner’s unique style, knowledge, and conversational aims, while also being able to reason in new situations. But to do that, we need rigorous measurements and interpretability tools that transform “it feels right” into “we have metrics & benchmarks that prove it.”

Enter the Research Engineer – Evals & Interpretability. You’ll develop frameworks that quantify how well each digital clone mirrors the authenticity and expertise of its human counterpart, while also building the tooling to open the black box and figure out why the clone behaves the way it does. If you’re curious about cognitive science, neural network interpretability, and the essence of what makes a human mind unique—this role has your name on it.

What You Will Work On

  1. Frontier Eval Systems & Metrics
  2. Interpretability & Debugging
  3. Collaboration & Deployment
  4. Infrastructure & Tooling

Preferred Abilities

Why You Might Like This Role