Cajal

Roadmap & Position in Formal Verification

Deploys AI mathematicians that formally verify proofs, grounding outputs in truth not guesses.

Company Overview

Massively scaling formal verification to accelerate scientific discovery. Deploys superhuman AI mathematicians to high-impact applied domains, starting with quantum computing and finance. Uses Lean, a framework for formally verifying mathematical statements, to ground AI in truth and validate discovered tools.

What They're Building

The company's public product roadmap & what they're committed to building.

Tau multi-agent system that collaborates to discover and verify new mathematical proofs in Lean. Given a research direction, Tau autonomously formalizes large corpora of applied mathematics, discovering novel results with real-world applications. Every result is machine-verified by Lean's type-checking kernel. Also partners with frontier AI labs and research institutes through datasets, evals, and RL environments.

Latest Intelligence

Zeitgeist tracks private signals to determine where the company is heading strategically.

Verified Math Becomes Scientific Infrastructure

May 13, 2026

Confidence:

High

New Intel: Cajal is building AI mathematicians that verify proofs in Lean, starting with quantum computing and finance. The wedge is high-stakes research where plausible answers are useless unless they can be checked.

Founder and Key Execs

Luke Johnston, Co-Founder

Co-Founder

Pedro Nobre, Co-Founder

Co-Founder

Founder Force Multiplier

Luke Johnston has a background in machine learning and neuroscience research at Oxford, Cambridge, and UCL. Pedro Nobre published what he believes to be the first major error in a physics paper found through the process of formalization.

Funding History

2025 | Luke Johnston and Pedro Nobre co-found Cajal.
2026 | Accepted into Y Combinator W26 batch.
2026 | Tau multi-agent system development; partnering with frontier AI labs.

Competitors

Formal Verification

Lean community, Coq, Isabelle/HOL.

AI Math

AlphaProof (DeepMind), LEGO-Prover.

AI Research

Sakana AI (AI Scientist), Aemon (YC W26).

Quantum Computing Software

Xanadu (PennyLane), Qiskit (IBM).

Cajal

's Moat:

Formal verification via Lean produces machine-checked proofs, not probabilistic outputs. Once a research team's proof library is built on Cajal's framework, migrating means re-proving everything from scratch. The intersection of LLM generation and formal verification is a narrow talent pool that Cajal's team already occupies.

How They're Leveraging AI

Autoformalization via LLMs

Tau uses LLMs to automatically translate informal mathematical statements and proofs into formally verified Lean 4 code, eliminating manual formalization bottlenecks.

Multi-Agent Proof Search

Tau deploys multiple specialized AI agents that collaboratively search for novel mathematical proofs, coordinating exploration strategies to discover results no single agent could find alone.

Synthetic Proof Data Generation

Tau generates synthetic formal proof datasets by systematically mutating and recombining verified proofs, creating a self-improving training flywheel that continuously enhances its own theorem-proving capabilities.

AI Use Overview:

Using LLM autoformalization into verified Lean 4 code, multi-agent collaborative proof search, and synthetic proof generation for a self-improving training flywheel.

More Similar Companies

Arena (formerly LLMArena)

Crowdsourced human-preference benchmarking platform for LLMs and generative AI models.

Neutral third-party evaluation becomes critical infrastructure as model proliferation outpaces any single lab's ability to grade itself credibly.

Ashr

Catches AI agent failures before users see them by stress-testing across text, voice, and images.

AI agents are shipping to production faster than anyone can test them. Ashr generates synthetic users that stress-test agents across text, voice, and images before real users hit the failure modes.

Cascade

Evaluates and certifies AI agents for safe deployment with red teaming and formal guarantees.

Red teaming and guardrails exist as separate tools. Cascade combines them into one platform with adaptive scaffolding that learns from production runs, already deployed across legal reasoning and customer support agents. The CEO researched graph reasoning and agentic safety at UC Berkeley's BAIR Lab.

Envariant

Lets model builders inspect and steer AI behavior inside the latent space to catch failures.

Most AI safety tools work on model outputs. Envariant operates inside the latent space itself, detecting hallucinations and drift at the representation level before they surface. Beta SDK launched with applications in text LLMs, robotic agents, and protein models.

Back To All Companies >