All open roles
Candidates are matched on Reeval. Get started on the homepage — create an account and complete your professional profile so we can reach out when it's a fit.
Role focus
AI systems behind automated evals, prompt optimization, model routing, and AI-powered analysis of traces and logs—production AI engineering, not thin API glue.
Responsibilities
- Design and build AI features (eval systems, prompt optimization).
- LLM-as-judge pipelines for quality assessment.
- Routing, caching, and fallbacks for models.
- Experiment with new models and ship to product.
- Work with customers on AI workflow pain points.
Minimum qualifications
- Strong Python and TypeScript.
- Production LLM experience beyond prototypes.
- Prompt engineering, eval methods, model behavior.
- In person in the Bay Area.
Preferred qualifications
- Fine-tuning, RAG, or agents.
- ML infrastructure or MLOps.
- Observability or developer tools.
- Open-source AI/ML.