Jarrod Barnes jbarnes850

Jarrod Barnes

I'm a researcher and founder working on open-ended scientific discovery. I run Dynamical Systems, where I build environments, evaluations, and verification systems that make scientific work trainable.

My work sits across RL, post-training, agent evaluation, and scientific ML, turning search, uncertainty, revision, tool use, and verification into learning signals for models operating in long-horizon environments.

Research threads

Training scientific judgment

Verified campaign environments convert search, trust, escalation, and revision into a multi-turn RL problem with physics-grounded oracle reward.

Scaling test-time verification for novel materials

Probe-gradient guidance extracts band-gap signal from an unconditional crystal diffusion model and steers sampling without retraining.

Self-improving pretraining as a substrate for agentic post-training

Synthetic thinking traces and self-improvement loops as a substrate for training models that can revise, critique, and extend their own work.

ATLAS: Adaptive Test-Time Learning for Agentic Systems

A continual learning framework that converts production agent trajectories into inference-time adaptation and on-policy distillation loops.

Open source

I contribute to inference and training infrastructure in the open-source ML stack:

SGLang: inference infrastructure
Slime: RL trainer

Primary stack: Python, Rust, PyTorch, Ray, SGLang.

Website | LinkedIn | X | Email

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Jarrod Barnes jbarnes850

Achievements

Achievements

Highlights

Block or report jbarnes850

Jarrod Barnes

Research threads

Open source

Pinned Loading

Uh oh!