Modu
@askModuAI
The secure standard runtime for coding agents. Run, secure, and benchmark Claude Code, Amp, Codex & more — from one CLI or Slack. Join waitlist: http://askmodu.com
Sandboxes or agent runtime (ie e2b, daytona, vercel, modal, cloudflare) will be the fastest growing piece of infra due to agents the next few years, and we’re only at the tip of the iceberg of what all the possible needs for agents will be: dev environments, ephemeral, gpu…
Common flow we see: run a task across multiple agents (factory, amp, etc.) & pick what to merge The UX to review agentic work will be crucial as agents are used more but we also want to be as lightweight as possible since great interfaces will quickly evolve (ie cursor, github)
Side by side benchmarking for coding agents is here ❤️
Meet Agent Bake-Off: blind side by side tests for coding agents like Claude Code, @cursor_ai , @AmpCode , @FactoryAI , Codex, & more. We've been building the largest real world benchmarks for coding agents, and we're excited share an early preview of our open community tooling.
Who is doing really good product reviews of Claude Code/Cursor/VSCode right now? Like - who is actually tracking the ongoing comparison with any kind of rigour?
Still a WIP before we release our real world use coding agent benchmarks broadly but notable things from the past few weeks: 1. Flippening starting to happen with OAI Codex and Claude Code since gpt5-codex released 2. Sonnet 4.5 plus major releases like new…
OpenAI's efforts to catch up with Anthropic's code-writing AI seem to be working: OpenAI's Codex has pulled ahead of Anthropic's Claude Code assistant by some measures, and its popularity with developers is catching up too, based on new data from Modu: theinformation.com/articles/opena…
OpenAI's efforts to catch up with Anthropic's code-writing AI seem to be working: OpenAI's Codex has pulled ahead of Anthropic's Claude Code assistant by some measures, and its popularity with developers is catching up too, based on new data from Modu: theinformation.com/articles/opena…
A few days into this week, even after @FactoryAI and @AnthropicAI big launches, @AmpCode still has the highest merged PR success rates (it's my personal daily driver) My prediction for the next two weeks is that Factory climbs up with CLI support and same with CC as 4.5 gets…
I'm giving it a week before @claudeai Code flips @OpenAI Codex again with 4.5 in merge PR success rates Really good decision making, plus CC is non-trivially better than Codex when it comes to core tool-use (like raising PRs)
Introducing Claude Sonnet 4.5—the best coding model in the world. It's the strongest model for building complex agents. It's the best model at using computers. And it shows substantial gains on tests of reasoning and math.
Side to side benchmarking for a given task among coding agents is the most common action for @askModuAI Releasing public benchmarks soon! Please DM if you use amp/codex/claude code/cursor/devin/etc & have strong opinions on patchviewer/diff UX, turns out it’s a heated debate 😊
who's building a tool so i can prompt both claude code and codex at the same time and have them fight to the death on the best solution?
I was OOO for a week and codex suddenly got really good?? @askModuAI will release real world use benchmarks soon
All you need is Modu
open router for coding agents or something like that
Onboarding our largest enterprise customer by a mile onto the @askModuAI research preview today People underestimate just how slow businesses are reacting to AI transformation, Twitter is a bubble What a way to start the weekend!
👀
“Agent coordination” will be the big need on the b2b side as: - businesses need to stay neutral on procurement from different foundational model labs - foundational model labs continue to verticalize by building more end-user interfaces/agents Already happening w code agents!
New domain spotted. Just wrestled with namecheap for 4 hours Time to create shareholder value
Prepping for some announcements... My soldiers, SCREAM OUT! MY SOLDIERS, RAGE!
United States الاتجاهات
- 1. #BUNCHITA 1,377 posts
- 2. #SmackDown 44.9K posts
- 3. Tulane 4,234 posts
- 4. Aaron Gordon 3,569 posts
- 5. Giulia 14.5K posts
- 6. Supreme Court 183K posts
- 7. Russ 13.6K posts
- 8. Frankenstein 77K posts
- 9. Connor Bedard 2,821 posts
- 10. #TheLastDriveIn 3,608 posts
- 11. #TheFutureIsTeal N/A
- 12. Podz 2,937 posts
- 13. #OPLive 2,246 posts
- 14. Caleb Wilson 5,669 posts
- 15. Northwestern 5,011 posts
- 16. Memphis 16.2K posts
- 17. Justice Jackson 5,422 posts
- 18. Keon 1,166 posts
- 19. Scott Frost N/A
- 20. Tatis 1,994 posts
Something went wrong.
Something went wrong.