Skip to content
View jbarnes850's full-sized avatar
🎯
Focusing
🎯
Focusing

Highlights

  • Pro

Block or report jbarnes850

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
jbarnes850/README.md

Jarrod Barnes

I'm a researcher and founder working on open-ended scientific discovery. I run Dynamical Systems, where I build environments, evaluations, and verification systems that make scientific work trainable.

My work sits across RL, post-training, agent evaluation, and scientific ML, turning search, uncertainty, revision, tool use, and verification into learning signals for models operating in long-horizon environments.

Research threads

Training scientific judgment

Verified campaign environments convert search, trust, escalation, and revision into a multi-turn RL problem with physics-grounded oracle reward.

Scaling test-time verification for novel materials

Probe-gradient guidance extracts band-gap signal from an unconditional crystal diffusion model and steers sampling without retraining.

Self-improving pretraining as a substrate for agentic post-training

Synthetic thinking traces and self-improvement loops as a substrate for training models that can revise, critique, and extend their own work.

ATLAS: Adaptive Test-Time Learning for Agentic Systems

A continual learning framework that converts production agent trajectories into inference-time adaptation and on-policy distillation loops.

Open source

I contribute to inference and training infrastructure in the open-source ML stack:

Primary stack: Python, Rust, PyTorch, Ray, SGLang.

Website | LinkedIn | X | Email

Pinned Loading

  1. Arc-Computer/ATLAS Arc-Computer/ATLAS Public

    A Continual Learning Framework for Production LLM Agents

    Python 42 6

  2. deepseek-r1-finetune deepseek-r1-finetune Public

    A step by step guide to fine-tuning the DeepSeek R1 Distilled models on Apple Silicon machines.

    Python 59 9

  3. near-horizon/near-ai-agent-studio near-horizon/near-ai-agent-studio Public

    A production-ready starter kit for building AI agents and multi-agent swarms on NEAR.

    Python 28 12

  4. near-horizon/near-protocol-rewards near-horizon/near-protocol-rewards Public

    A transparent, metric-based rewards system for NEAR projects that directly ties incentives to development activity and user adoption.

    TypeScript 33 12

  5. test-time-training test-time-training Public

    Code for "Surprisal-Guided Selection: Compute-Optimal Test-Time Strategies for Execution-Grounded Code Generation"

    Python 1

  6. mlx-disitrubted-training mlx-disitrubted-training Public

    A privacy-first distributed training framework built on MLX for Apple Silicon, enabling secure and efficient AI model training across multiple devices while preserving data privacy.

    Python 12 1