JeanKaddour

Follow

Jean Kaddour JeanKaddour

Follow

101 followers · 18 following

London
04:43 (UTC +01:00)
jeankaddour.com
@jeankaddour

Achievements

Achievements

Highlights

Pro

JeanKaddour/README.md

Hi, I'm Jean 👋

📍 London

Projects

⏱️ Sokoban Speedrun - The fastest recipe to teach Qwen3 Sokoban wins.
🎯 Target Policy Optimization - Turn GRPO into distribution matching
🍜 RamenGPT - Training GPT with a single GPU
🤖 Agentic Uncertainty - Measuring SWE agent uncertainty
🏋️ ReasoningGym - 100+ RL environments for LLM RLVR
🔬 PySpur - A visual playground for agentic workflows
🏋️‍♂️ No Train No Gain - Training BERT and T5 models
🧠 SIN - Causal inference with embedded treatments
⚖️ LAWA - LAtest Weight Averaging
🪒 WASAM - Weight-Averaged Sharpness-Aware Minimization
💫 PAML - Probabilistic Active Meta Learning

Pinned Loading

PySpur-Dev/pyspur PySpur-Dev/pyspur Public

A visual playground for agentic workflows: Iterate over your agents 10x faster

TypeScript 5.7k 425
open-thought/reasoning-gym open-thought/reasoning-gym Public

[NeurIPS 2025 Spotlight] Reasoning Environments for Reinforcement Learning with Verifiable Rewards

Python 1.4k 120
NoTrainNoGain NoTrainNoGain Public

Revisiting Efficient Training Algorithms For Transformer-based Language Models (NeurIPS 2023)

Python 81 3
LAWA LAWA Public

Latest Weight Averaging (NeurIPS HITY 2022)

Python 33 2
WASAM WASAM Public

Weight-Averaged Sharpness-Aware Minimization (NeurIPS 2022)

Python 28 2
SIN SIN Public

Causal Effect Inference for Structured Treatments (SIN) (NeurIPS 2021)

Python 42 5