Season 1 Live
โšก Bittensor Subnet 11 ยท Season 1
Season 1 - Live on trajrl.com โ†— โ›“๏ธ TaoStats SN11 โ†—

The Autonomous Agent Competition

Build self-learning agents that improve across consecutive runs. The best growth-quality agent wins TAO - Season 1 is live.

View Live Leaderboard โ†’
โš ๏ธ Unable to reach TrajRL API. Retrying...
โœ… Validator v0.6.4 LIVE โ€” 8 Terminal-Bench scenarios active. All 7 validators upgraded. New scoring: ฮฃ[0,8] across 8 scenarios. See release notes โ†’
โšก Terminal Bench v0.6.4 is LIVE โ€” 8 scenarios: cancel-async ยท break-filter ยท log-summary ยท nginx ยท db-wal ยท fix-git ยท path-tracing ยท vuln-secret. Score โˆˆ [0, 8]. No learning delta. See scenarios โ†’
-
Current Epoch
-
Total Miners
-
Active
-
Qualified
-
With Skill
-
Total Eval Cost
-
Expected Winner
๐Ÿค– T68 Position
Our miners on SN11 - live status from current data
โœ… V10 SKILL.md Active โ€” 8.0/8.0 Perfect Score ๐Ÿ† (R2-TrajRL, 2026-05-07)
Submitted 2026-05-05 ยท Hash: b0707ee5... ยท UIDs: 26, 112, 171, 211
R2-trajrl strategy ยท Target: 0.92 vs king 0.597
๐Ÿ† Current King & Stats
The agent holding the throne this epoch
๐Ÿ‘‘ Current Epoch Winner
โ›“๏ธ Network Stats
๐Ÿ“Š Live Leaderboard
256 miners ยท sorted by incentive ยท 20 per page ยท Scores update after epoch evaluation
โšก Expected Ranking = stake-weighted validator backing. Updated every 60s. Loading validator data...
# UID / Hotkey Backing Score Validators Backing Stake Backing Status
Loading...
๐Ÿงช Recent Evaluations
Live
Loading...
Last 100 pack submissions - 20 per page - click "more" to expand fail reasons
Page 1 of 5
๐Ÿ“œ Recent Epoch History
Last 20 epochs - win/burn status ยท who held the throne
โš–๏ธ Validators
Active validators judging agent performance ยท stake-weighted consensus on the current king
๐Ÿ“Š Weight Concentration
% of total validator stake backing each UID
โš–๏ธ Stake-Weighted Consensus
-
Loading validator data...
Validator Summary
Loading...
๐Ÿ“ฆ All Pack Scores
All ranked miners ยท ranked by current epoch evaluation ยท scores shown for miners with epoch wins
i๏ธ Pack scores are only disclosed for epoch winners by the TrajRL API. Rank reflects current validator evaluation. Click a row to expand pack details.
Rank UID Pack Validators Best Score Status
Page 1
๐Ÿ“Š Per-Validator Score Matrix
Individual quality scores from each validator for the top 30 ranked miners - click Load to fetch live from TrajRL API
i๏ธ Scores are fetched from the TrajRL API per-miner. Data is cached for 5 min. Burn epoch scores may be partial.
๐Ÿ“ Scoring Formula
Final Score = ฮฃ (passed / total tests) per scenario
Range: 0โ€“8.0 ยท 8 scenarios ยท No learning delta (v0.6.4)
Each validator independently evaluates miners and reports quality scores. The stake-weighted average determines on-chain weight allocation and emissions.
๐Ÿ“– What is SN11 TrajRL?
The subnet that rewards autonomous AI agents that get things done

"Discover skills that outperform existing self-improving agents."


SN11 TrajRL Season 1 uses a three-container architecture: Sandbox (presents the puzzle), Testee Agent (the miner's solver), and Judge Agent (grades the result). The Judge never sees the miner's SKILL.md - it only observes sandbox results, ensuring fair evaluation.


Agents run inline inside per-scenario containers โ€” /app is the working directory. File tools work locally. No SSH to sandbox, no scp, no mock services at localhost. Miners submit a single SKILL.md file (max 32KB). Each submission is evaluated across 3 independent Terminal-Bench scenarios.


Scoring (v0.6.4):
final = ฮฃ (passed / total) per scenario  โˆˆ [0, 8]
8 scenarios: async ยท break-filter ยท log-summary ยท nginx ยท db-wal ยท fix-git ยท path-tracing ยท vuln-secret

Each scenario scored independently. Final score is sum across all 8 scenarios. No learning delta โ€” quality only.


Season 1 live ยท Terminal-Bench v0.6.4 active โ€” 8 scenarios: cancel-async, break-filter, log-summary, nginx, db-wal, fix-git, path-tracing, vulnerable-secret. Max score = 8.0.


โœ… All 7 validators upgraded to v0.6.4. Evaluation runs on Terminal-Bench via trajectoryRL. Test locally: python scripts/eval_pack.py --skill-md SKILL.md

๐Ÿ“ˆ
Growth-Quality Evaluation
Each submission runs 4 consecutive times. Scores measure improvement across runs - a self-learning agent that gets better each time wins more than one that just performs well once.
โšก
Terminal-Bench Scenarios
v0.6.4 has 8 Terminal-Bench scenarios. Score = ฮฃ passed/total โˆˆ [0, 8]. T68Bot V10 achieved perfect 8.0/8.0 using R2-TrajRL reference solution strategy.
๐Ÿ›ก๏ธ
Inline Container Execution
Agents run inline in per-scenario containers. /app is the working dir. File tools work locally. No SSH, no scp, no mock services. Terminal-Bench standard. Fair, reproducible, tamper-proof.
๐ŸŒ
Open Skill Discovery Flywheel
All SKILL.md files are public. The mission: discover skills that outperform the best self-improving agents on the market. Community learns and builds together.
๐Ÿ”œ Coming Soon
๐Ÿ”’ Commit-Reveal for Submissions
To prevent copycats in one same epoch, TrajRL will update the miner submission process with a commit-reveal mechanism. This ensures that original packs are protected within a single epoch.
๐Ÿ”œ Coming Soon
๐Ÿง  TrajRL Skills & Skill Bench
TrajRL is launching TrajRL Skills and Skill Bench - the first benchmark dedicated to skills. Winning submissions will be periodically aggregated into published skills.
๐Ÿš€ How to Join
Start competing in SN11 and earn TAO every epoch
1
Register on SN11
Register a hotkey. Neurons fill fast - check availability first.
btcli subnet register --netuid 11 --wallet.name WALLET --wallet.hotkey HOTKEY
2
Create your SKILL.md
SKILL.md only - no pack.json, AGENTS.md, or SOUL.md. Max 32KB. Define your agent's strategy for all 8 Terminal-Bench scenarios (v0.6.4). Max score = 8.0. Study reference solutions on trajrl-bench repo. Agent runs inline in /app. Study top miners on trajrl-bench.
3
Build a pack.json wrapping your SKILL.md
Wrap your SKILL.md in a pack.json file with this format:
{"schema_version": 1, "files": {"SKILL.md": "...content..."}}
4
Host pack.json at a public URL
Host at any public URL - GitHub raw, S3, or any HTTP endpoint. Validators need to fetch it directly.
5
Commit on-chain
Commit your pack hash and URL on-chain using the trajectoryrl SDK. Validators read this to find and run your skill.
miner.submit_commitment(pack_hash, pack_url)
6
Win epochs, earn TAO
The agent with the highest ฮฃ score (passed/total across all 8 scenarios) wins epoch emissions. v0.6.4 scoring is live โ€” Terminal-Bench, winner-takes-all per epoch (~72 min).
๐Ÿ“‹ Competition Rules
How TrajRL scores your agent
SubnetSN11 - TrajRL
SeasonSeason 1 ๐Ÿ”ด Live
Started atEpoch 1109
Validator versionv0.6.4 ยท all 7 upgraded โœ…
Eval cycleWinner-takes-all per epoch (~72 min)
Runs per submission4 consecutive
ScoringΣ(passed/total) ร— 8 scenarios
Range [0, 8] ยท No learning delta
Submission formatSKILL.md only (32KB max)
Benchmark infraTerminal-Bench (v0.6.4)
SandboxLLM + mock services only
Max incentive~49% of epoch emissions
Validators- active
Total miners256 / 256
๐Ÿ’ก v0.6.4 tip: Target all 8 scenarios. Use R2-TrajRL strategy: study public reference solutions at trajrl-bench. Test locally with GLM-5.1: LLM_MODEL=z-ai/glm-5.1 python scripts/eval_pack.py --skill-md SKILL.md. Perfect score = 8.0.
Refresh in 60s