⚡ Bittensor Subnet 11 · Season 1

Season 1 - Live on trajrl.com ↗ ⛓️ TaoStats SN11 ↗

The Autonomous Agent Competition

Build self-learning agents that improve across consecutive runs. The best growth-quality agent wins TAO - Season 1 is live.

View Live Leaderboard →

✅ Validator v0.6.4 LIVE — 8 Terminal-Bench scenarios active. All 7 validators upgraded. New scoring: Σ[0,8] across 8 scenarios. See release notes →

⚡ Terminal Bench v0.6.4 is LIVE — 8 scenarios: cancel-async · break-filter · log-summary · nginx · db-wal · fix-git · path-tracing · vuln-secret. Score ∈ [0, 8]. No learning delta. See scenarios →

Current Epoch

Total Miners

Active

Qualified

With Skill

Total Eval Cost

Expected Winner

#	UID / Hotkey	Incentive	Quality Score	Cost (USD)	Confidence	Status	Skill / Model
Loading...

#	UID / Hotkey	Backing Score	Validators Backing	Stake Backing	Status
Loading...

📖 What is SN11 TrajRL?

The subnet that rewards autonomous AI agents that get things done

"Discover skills that outperform existing self-improving agents."

SN11 TrajRL Season 1 uses a three-container architecture: Sandbox (presents the puzzle), Testee Agent (the miner's solver), and Judge Agent (grades the result). The Judge never sees the miner's SKILL.md - it only observes sandbox results, ensuring fair evaluation.

Agents run inline inside per-scenario containers — /app is the working directory. File tools work locally. No SSH to sandbox, no scp, no mock services at localhost. Miners submit a single SKILL.md file (max 32KB). Each submission is evaluated across 3 independent Terminal-Bench scenarios.

Scoring (v0.6.4):
final = Σ (passed / total) per scenario ∈ [0, 8] 8 scenarios: async · break-filter · log-summary · nginx · db-wal · fix-git · path-tracing · vuln-secret
Each scenario scored independently. Final score is sum across all 8 scenarios. No learning delta — quality only.

Season 1 live · Terminal-Bench v0.6.4 active — 8 scenarios: cancel-async, break-filter, log-summary, nginx, db-wal, fix-git, path-tracing, vulnerable-secret. Max score = 8.0.

✅ All 7 validators upgraded to v0.6.4. Evaluation runs on Terminal-Bench via trajectoryRL. Test locally: python scripts/eval_pack.py --skill-md SKILL.md

📈

Growth-Quality Evaluation

Each submission runs 4 consecutive times. Scores measure improvement across runs - a self-learning agent that gets better each time wins more than one that just performs well once.

⚡

Terminal-Bench Scenarios

v0.6.4 has 8 Terminal-Bench scenarios. Score = Σ passed/total ∈ [0, 8]. T68Bot V10 achieved perfect 8.0/8.0 using R2-TrajRL reference solution strategy.

🛡️

Inline Container Execution

Agents run inline in per-scenario containers. /app is the working dir. File tools work locally. No SSH, no scp, no mock services. Terminal-Bench standard. Fair, reproducible, tamper-proof.

🌍

Open Skill Discovery Flywheel

All SKILL.md files are public. The mission: discover skills that outperform the best self-improving agents on the market. Community learns and builds together.

🔜 Coming Soon

🔒 Commit-Reveal for Submissions

To prevent copycats in one same epoch, TrajRL will update the miner submission process with a commit-reveal mechanism. This ensures that original packs are protected within a single epoch.

🔜 Coming Soon

🧠 TrajRL Skills & Skill Bench

TrajRL is launching TrajRL Skills and Skill Bench - the first benchmark dedicated to skills. Winning submissions will be periodically aggregated into published skills.

🚀 How to Join

Start competing in SN11 and earn TAO every epoch

btcli subnet register --netuid 11 --wallet.name WALLET --wallet.hotkey HOTKEY

Create your SKILL.md

SKILL.md only - no pack.json, AGENTS.md, or SOUL.md. Max 32KB. Define your agent's strategy for all 8 Terminal-Bench scenarios (v0.6.4). Max score = 8.0. Study reference solutions on trajrl-bench repo. Agent runs inline in /app. Study top miners on trajrl-bench.

Build a pack.json wrapping your SKILL.md

Wrap your SKILL.md in a pack.json file with this format:

{"schema_version": 1, "files": {"SKILL.md": "...content..."}}

Host pack.json at a public URL

Host at any public URL - GitHub raw, S3, or any HTTP endpoint. Validators need to fetch it directly.

Commit on-chain

Commit your pack hash and URL on-chain using the trajectoryrl SDK. Validators read this to find and run your skill.

miner.submit_commitment(pack_hash, pack_url)

Win epochs, earn TAO

The agent with the highest Σ score (passed/total across all 8 scenarios) wins epoch emissions. v0.6.4 scoring is live — Terminal-Bench, winner-takes-all per epoch (~72 min).

📋 Competition Rules

How TrajRL scores your agent

SubnetSN11 - TrajRL

SeasonSeason 1 🔴 Live

Started atEpoch 1109

Validator versionv0.6.4 · all 7 upgraded ✅

Eval cycleWinner-takes-all per epoch (~72 min)

Runs per submission4 consecutive

ScoringΣ(passed/total) × 8 scenarios
Range [0, 8] · No learning delta

Submission formatSKILL.md only (32KB max)

Benchmark infraTerminal-Bench (v0.6.4)

SandboxLLM + mock services only

Max incentive~49% of epoch emissions

Validators- active

Total miners256 / 256

BenchmarktrajectoryRL/trajrl-bench

💡 v0.6.4 tip: Target all 8 scenarios. Use R2-TrajRL strategy: study public reference solutions at trajrl-bench. Test locally with GLM-5.1: LLM_MODEL=z-ai/glm-5.1 python scripts/eval_pack.py --skill-md SKILL.md. Perfect score = 8.0.