December 23rd 2025 βοΈ
Live Previews (beta)
- Live preview available in beta to all users.
- Start new template project from cto.new.
- Add build and run commands to task runner.
- Setup agent attempts build and run commands.
Smarter model selection
- Route tasks based on quality, speed, capacity, and cost.
- Expanded model support, including:
- GPT-5.2
- Sonnet 4.5 - GPT 5.2 alloy
- Claude Opus 4.5
- Gemini 3 Flash preview)
- GLM-4.7
- MiniMax M2.1
- Devstral 2
Engine-agent chat improvements
- Faster, more reliable task planning.
- Better visibility into tool usage during runs.
- Tasks started from chat are now properly scoped and isolated.
- Simplified planning and draft task UX.
MCP overhaul
- Unified MCP system with clearer setup and fewer edge cases.
- Global MCPs can now be configured once and reused in agents and chats.
- Per-chat MCP selection for more control.
- Improved support for HTTP, SSE, and stdio MCPs.
- Better error handling and clearer tool naming.
Reliability & performance
- Faster VM startup times.
- More robust VM lifecycle handling.
- Better handling of temporary provider outages.
Benchmark
- Introduced cto bench, measuring real-world task success rates across models.
- Live chart and leaderboard updated every day.
UI & UX polish
- Mobile improvements.
- Keyboard shortcut: β + Enter throughout the app.
- Create empty repo from cto.new.
- Clearer task and preview states.
- General responsiveness and stability improvements.
November 26th 2025
Product & UX
- Add βget codeβ buttons for easily copyable git checkout commands for a task.
- Add task preview in alpha.
- Clearer draft PR/MR status.
- Chats moved to their own page.
- Add privacy mode for zero data use.
Models & Routing
- Added GPT-5.1 and GPT-5.1 Codex and removed GPT-5 and GPT-5 Codex.
- Added GPT-5.1 Codex Mini.
- Added Gemini 3 Pro Preview.
- Added Kimi K2 Thinking.
- Added MiniMax M2.
- Improved and updated auto model routing.
Infra, Events & Git
- Harden events queueing and retries.
- Update GitHub webhooks and improve git reliability.
- Status page: status.cto.new.
November 15th 2025
Models
- Added auto-model selection.
- Haiku set as chat default for improved speed and tool calling.
- Tasks started from drafts use preferred model.
Chat
- Optimistic send for chat messages.
- Cancel chat messages.
- 20k character message limit.
UI/UX
- Git checkout command in task view.
- Fix chat text area resizing, mobile fixes, dark-mode tweaks.
- Surfaced secrets error messages.
Auth and Invites
- Update invite eligibility to prevent abuse.
- Google sign-in restored.
Reliability and Bugfixes
- Fix diff saving.
- Race fix on user/org creation.
- Validate env/secret format.
- Fix βghost queuedβ tasks.
October 31st 2025 π
Chat & UX
- Fixed chat text area resize.
- Optimistic send - show user messages instantly.
- Chat summaries: agent can now reference last 10 chats.
Models & Memory
- Added memory to chat agent.
- Improved chat label generation.
- Added Haiku, improved Anthropic prompt caching.
October 29th 2025
Tasks & Workflow
- Shorter draft titles.
- Chat agent now aware of tasks.
Models & Execution
- Anthropic fully restored (incl. Alloy).
- Add GLM-4.6.
- Improve prompt caching.
Abuse & Rate Limits
- 20k-char cap on first chat message.
- Domain blocks for obvious botting.
- Added CAPTCHA and bot blocks.
Git & Repos
- Hardened git token fetching/retries.
Reliability & Ops
- Removed legacy ticket plans/speculative mode; removed context7 MCP and Anthropic web_search tool.
- Start routing tasks to Northflank for scalability.
UI
- Dark mode.
- Text area resize fixes.
October 22nd 2025
Access & onboarding
- Invite UX: add invite button; generate all available codes.
- Team invites: invited team members no longer need codes.
- Sign-up protections to block bots.
Draft tasks & task UX
- Prevent multiple tasks per draft.
- Save draft edits on start; show actionable controls only when valid.
Models & routing
- Add AWS Bedrock for Anthropic model failover.
- Better labels & long-chat handling; include model name in system prompt.
GitHub & repos
- Fix Start GitHub issue from @cto tag.
- Branch selector no longer crashes if branch deleted.
Reliability & safety
- Fewer flaky runs: increase events timeout; harden Git token fetching/retries; empty-repo/default-branch fixes.
- Auto-ban abusive agent chat calls (unknown adapter/invalid schema).