Modu

@askModuAI

The secure standard runtime for coding agents. Run, secure, and benchmark Claude Code, Amp, Codex & more — from one CLI or Slack. Join waitlist: http://askmodu.com

askmodu.com

انضم في مارس 2025

88المنشورات 506المتابعون 8المتابَعون

Modu أعاد

brexton

@brexton

9 س

Sandboxes or agent runtime (ie e2b, daytona, vercel, modal, cloudflare) will be the fastest growing piece of infra due to agents the next few years, and we’re only at the tip of the iceberg of what all the possible needs for agents will be: dev environments, ephemeral, gpu…

Modu أعاد

brexton

@brexton

11 س

Common flow we see: run a task across multiple agents (factory, amp, etc.) & pick what to merge The UX to review agentic work will be crucial as agents are used more but we also want to be as lightweight as possible since great interfaces will quickly evolve (ie cursor, github)

Modu

@askModuAI

٤ نوفمبرم

Side by side benchmarking for coding agents is here ❤️

brexton

@brexton

٤ نوفمبرم

Meet Agent Bake-Off: blind side by side tests for coding agents like Claude Code, @cursor_ai , @AmpCode , @FactoryAI , Codex, & more. We've been building the largest real world benchmarks for coding agents, and we're excited share an early preview of our open community tooling.

Modu أعاد

brexton

@brexton

١٤ أكتوبرم

.@askModuAI

Matt Pocock

@mattpocockuk

٥ أكتوبرم

Who is doing really good product reviews of Claude Code/Cursor/VSCode right now? Like - who is actually tracking the ongoing comparison with any kind of rigour?

Modu أعاد

brexton

@brexton

٩ أكتوبرم

Still a WIP before we release our real world use coding agent benchmarks broadly but notable things from the past few weeks: 1. Flippening starting to happen with OAI Codex and Claude Code since gpt5-codex released 2. Sonnet 4.5 plus major releases like new…

brexton's tweet image. Still a WIP before we release our real world use coding agent benchmarks broadly but notable things from the past few weeks:

1. Flippening starting to happen with OAI Codex and Claude Code since gpt5-codex released
2. Sonnet 4.5 plus major releases like new…

Stephanie Palazzolo

@steph_palazzolo

٩ أكتوبرم

OpenAI's efforts to catch up with Anthropic's code-writing AI seem to be working: OpenAI's Codex has pulled ahead of Anthropic's Claude Code assistant by some measures, and its popularity with developers is catching up too, based on new data from Modu: theinformation.com/articles/opena…

steph_palazzolo's tweet card. OpenAI’s effort to catch up to Anthropic in code-generating artificial intelligence seems to be working.New data show OpenAI’s Codex coding assistant has pulled ahead of Anthropic’s Claude Code...

OpenAI Is Catching Up To Anthropic in AI Coding

المصدر: theinformation.com

Modu أعاد

Stephanie Palazzolo

@steph_palazzolo

٩ أكتوبرم

OpenAI Is Catching Up To Anthropic in AI Coding

المصدر: theinformation.com

Modu أعاد

brexton

@brexton

٢ أكتوبرم

A few days into this week, even after @FactoryAI and @AnthropicAI big launches, @AmpCode still has the highest merged PR success rates (it's my personal daily driver) My prediction for the next two weeks is that Factory climbs up with CLI support and same with CC as 4.5 gets…

brexton's tweet image. A few days into this week, even after @FactoryAI and @AnthropicAI big launches, @AmpCode still has the highest merged PR success rates (it's my personal daily driver)

My prediction for the next two weeks is that Factory climbs up with CLI support and same with CC as 4.5 gets…

Modu أعاد

brexton

@brexton

٢٩ سبتمبرم

I'm giving it a week before @claudeai Code flips @OpenAI Codex again with 4.5 in merge PR success rates Really good decision making, plus CC is non-trivially better than Codex when it comes to core tool-use (like raising PRs)

Claude

@claudeai

٢٩ سبتمبرم

Introducing Claude Sonnet 4.5—the best coding model in the world. It's the strongest model for building complex agents. It's the best model at using computers. And it shows substantial gains on tests of reasoning and math.

claudeai's tweet image. Introducing Claude Sonnet 4.5—the best coding model in the world.

It's the strongest model for building complex agents. It's the best model at using computers. And it shows substantial gains on tests of reasoning and math.

Modu أعاد

brexton

@brexton

٩ سبتمبرم

Side to side benchmarking for a given task among coding agents is the most common action for @askModuAI Releasing public benchmarks soon! Please DM if you use amp/codex/claude code/cursor/devin/etc & have strong opinions on patchviewer/diff UX, turns out it’s a heated debate 😊

Josh Pigford

@Shpigford

٨ سبتمبرم

who's building a tool so i can prompt both claude code and codex at the same time and have them fight to the death on the best solution?

Modu أعاد

brexton

@brexton

٥ سبتمبرم

Compounding retention for @askModuAI That's how it's done

Modu أعاد

brexton

@brexton

٣ سبتمبرم

I was OOO for a week and codex suddenly got really good?? @askModuAI will release real world use benchmarks soon

Modu

@askModuAI

٢٠ أغسطسم

All you need is Modu

shadcn

@shadcn

١٢ أغسطسم

This is madness. What are we doing!

Modu أعاد

brexton

@brexton

١٥ أغسطسم

open router for coding agents or something like that

Modu أعاد

brexton

@brexton

٨ أغسطسم

Onboarding our largest enterprise customer by a mile onto the @askModuAI research preview today People underestimate just how slow businesses are reacting to AI transformation, Twitter is a bubble What a way to start the weekend!

Modu

@askModuAI

٧ أغسطسم

Is something happening today

Modu

@askModuAI

٦ أغسطسم

👀

brexton

@brexton

٦ أغسطسم

“Agent coordination” will be the big need on the b2b side as: - businesses need to stay neutral on procurement from different foundational model labs - foundational model labs continue to verticalize by building more end-user interfaces/agents Already happening w code agents!