Closing the feedback loop in production.
NeoSigma is a product-driven research lab building the intelligence layer that helps close the feedback loop between your customers, products, and AI systems.
We are an intensely technical team of researchers, engineers, and designers pushing the frontier of agentic systems and redefining the interface between humans and AI.
If you are interested in our mission, we would love to hear from you to join us!
Backed by angels and leaders like Jeff Dean and others from OpenAI, Mercor, Sierra, World Labs, Skild AI, Decagon, Databricks and others.
Victor Barres
Tau bench co-creator, Researcher at Sierra
Intelligence in an agent is as much the ability to solve problems as it is the ability to learn from experience and adapt to an ever-changing environment. Neosigma is paving the way towards making this an operational reality.
Shyamal Anadkat
ex-OpenAI, Applied Evals
Evals grounded in real usage are the foundation of systems that compound in quality over time. Companies that close the loop between production signals and evaluation will win.
Reah Miyara
Senior Director, Google · ex-OpenAI Post-Training Lead
Transforming performance in production environments requires much more than better models. It requires systems that learn from their own mistakes at scale.
Chirag Mahapatra
Director of Engineering, Mercor
The future of agent systems is automated evals driven by real-world failures. NeoSigma brings that to life: turning production issues into a continuous feedback loop that improves reliability without manual overhead.
Manoj Soundararajan
Product @Decagon
In production, the real challenge is making agents reliable across the long tail of constraints and user behavior. NeoSigma is addressing this by catching regressions, debugging failures, and maintaining evaluations and reliability as systems evolve and user behaviors drift.
Victor Barres
Tau bench co-creator, Researcher at Sierra
Intelligence in an agent is as much the ability to solve problems as it is the ability to learn from experience and adapt to an ever-changing environment. Neosigma is paving the way towards making this an operational reality.
Shyamal Anadkat
ex-OpenAI, Applied Evals
Evals grounded in real usage are the foundation of systems that compound in quality over time. Companies that close the loop between production signals and evaluation will win.
Reah Miyara
Senior Director, Google · ex-OpenAI Post-Training Lead
Transforming performance in production environments requires much more than better models. It requires systems that learn from their own mistakes at scale.
Chirag Mahapatra
Director of Engineering, Mercor
The future of agent systems is automated evals driven by real-world failures. NeoSigma brings that to life: turning production issues into a continuous feedback loop that improves reliability without manual overhead.
Manoj Soundararajan
Product @Decagon
In production, the real challenge is making agents reliable across the long tail of constraints and user behavior. NeoSigma is addressing this by catching regressions, debugging failures, and maintaining evaluations and reliability as systems evolve and user behaviors drift.
Victor Barres
Tau bench co-creator, Researcher at Sierra
Intelligence in an agent is as much the ability to solve problems as it is the ability to learn from experience and adapt to an ever-changing environment. Neosigma is paving the way towards making this an operational reality.
Shyamal Anadkat
ex-OpenAI, Applied Evals
Evals grounded in real usage are the foundation of systems that compound in quality over time. Companies that close the loop between production signals and evaluation will win.
Reah Miyara
Senior Director, Google · ex-OpenAI Post-Training Lead
Transforming performance in production environments requires much more than better models. It requires systems that learn from their own mistakes at scale.
Chirag Mahapatra
Director of Engineering, Mercor
The future of agent systems is automated evals driven by real-world failures. NeoSigma brings that to life: turning production issues into a continuous feedback loop that improves reliability without manual overhead.
Manoj Soundararajan
Product @Decagon
In production, the real challenge is making agents reliable across the long tail of constraints and user behavior. NeoSigma is addressing this by catching regressions, debugging failures, and maintaining evaluations and reliability as systems evolve and user behaviors drift.