askModuAI's profile picture. The secure standard runtime for coding agents. Run, secure, and benchmark Claude Code, Amp, Codex & more — from one CLI or Slack. Join waitlist: http://askmodu.com

Modu

@askModuAI

The secure standard runtime for coding agents. Run, secure, and benchmark Claude Code, Amp, Codex & more — from one CLI or Slack. Join waitlist: http://askmodu.com

Modu أعاد

Sandboxes or agent runtime (ie e2b, daytona, vercel, modal, cloudflare) will be the fastest growing piece of infra due to agents the next few years, and we’re only at the tip of the iceberg of what all the possible needs for agents will be: dev environments, ephemeral, gpu…


Modu أعاد

Common flow we see: run a task across multiple agents (factory, amp, etc.) & pick what to merge The UX to review agentic work will be crucial as agents are used more but we also want to be as lightweight as possible since great interfaces will quickly evolve (ie cursor, github)


Side by side benchmarking for coding agents is here ❤️

Meet Agent Bake-Off: blind side by side tests for coding agents like Claude Code, @cursor_ai , @AmpCode , @FactoryAI , Codex, & more. We've been building the largest real world benchmarks for coding agents, and we're excited share an early preview of our open community tooling.



Modu أعاد

Who is doing really good product reviews of Claude Code/Cursor/VSCode right now? Like - who is actually tracking the ongoing comparison with any kind of rigour?



Modu أعاد

Still a WIP before we release our real world use coding agent benchmarks broadly but notable things from the past few weeks: 1. Flippening starting to happen with OAI Codex and Claude Code since gpt5-codex released 2. Sonnet 4.5 plus major releases like new…

brexton's tweet image. Still a WIP before we release our real world use coding agent benchmarks broadly but notable things from the past few weeks:

1. Flippening starting to happen with OAI Codex and Claude Code since gpt5-codex released
2. Sonnet 4.5 plus major releases like new…
brexton's tweet image. Still a WIP before we release our real world use coding agent benchmarks broadly but notable things from the past few weeks:

1. Flippening starting to happen with OAI Codex and Claude Code since gpt5-codex released
2. Sonnet 4.5 plus major releases like new…

OpenAI's efforts to catch up with Anthropic's code-writing AI seem to be working: OpenAI's Codex has pulled ahead of Anthropic's Claude Code assistant by some measures, and its popularity with developers is catching up too, based on new data from Modu: theinformation.com/articles/opena…



Modu أعاد

OpenAI's efforts to catch up with Anthropic's code-writing AI seem to be working: OpenAI's Codex has pulled ahead of Anthropic's Claude Code assistant by some measures, and its popularity with developers is catching up too, based on new data from Modu: theinformation.com/articles/opena…


Modu أعاد

A few days into this week, even after @FactoryAI and @AnthropicAI big launches, @AmpCode still has the highest merged PR success rates (it's my personal daily driver) My prediction for the next two weeks is that Factory climbs up with CLI support and same with CC as 4.5 gets…

brexton's tweet image. A few days into this week, even after @FactoryAI and @AnthropicAI big launches, @AmpCode still has the highest merged PR success rates (it's my personal daily driver)

My prediction for the next two weeks is that Factory climbs up with CLI support and same with CC as 4.5 gets…

Modu أعاد

I'm giving it a week before @claudeai Code flips @OpenAI Codex again with 4.5 in merge PR success rates Really good decision making, plus CC is non-trivially better than Codex when it comes to core tool-use (like raising PRs)

brexton's tweet image. I'm giving it a week before @claudeai Code flips @OpenAI Codex again with 4.5 in merge PR success rates

Really good decision making, plus CC is non-trivially better than Codex when it comes to core tool-use (like raising PRs)

Introducing Claude Sonnet 4.5—the best coding model in the world. It's the strongest model for building complex agents. It's the best model at using computers. And it shows substantial gains on tests of reasoning and math.

claudeai's tweet image. Introducing Claude Sonnet 4.5—the best coding model in the world.

It's the strongest model for building complex agents. It's the best model at using computers. And it shows substantial gains on tests of reasoning and math.


Modu أعاد

Side to side benchmarking for a given task among coding agents is the most common action for @askModuAI Releasing public benchmarks soon! Please DM if you use amp/codex/claude code/cursor/devin/etc & have strong opinions on patchviewer/diff UX, turns out it’s a heated debate 😊

who's building a tool so i can prompt both claude code and codex at the same time and have them fight to the death on the best solution?



Modu أعاد

Compounding retention for @askModuAI That's how it's done

brexton's tweet image. Compounding retention for @askModuAI 

That's how it's done

Modu أعاد

I was OOO for a week and codex suddenly got really good?? @askModuAI will release real world use benchmarks soon


All you need is Modu

This is madness. What are we doing!

shadcn's tweet image. This is madness. What are we doing!


Modu أعاد

open router for coding agents or something like that


Modu أعاد

Onboarding our largest enterprise customer by a mile onto the @askModuAI research preview today People underestimate just how slow businesses are reacting to AI transformation, Twitter is a bubble What a way to start the weekend!


Is something happening today


👀

“Agent coordination” will be the big need on the b2b side as: - businesses need to stay neutral on procurement from different foundational model labs - foundational model labs continue to verticalize by building more end-user interfaces/agents Already happening w code agents!



Big day for me to join a couple of teams's slacks IYKYK

askModuAI's tweet image. Big day for me to join a couple of teams's slacks

IYKYK

New domain spotted. Just wrestled with namecheap for 4 hours Time to create shareholder value

askModuAI's tweet image. New domain spotted. Just wrestled with namecheap for 4 hours

Time to create shareholder value

Prepping for some announcements... My soldiers, SCREAM OUT! MY SOLDIERS, RAGE!

askModuAI's tweet image. Prepping for some announcements...

My soldiers, SCREAM OUT! MY SOLDIERS, RAGE!

United States الاتجاهات

Loading...

Something went wrong.


Something went wrong.