rishdotblog's profile picture. Co-Founder @DefogData (YC W23)

Rishabh Srivastava

@rishdotblog

Co-Founder @DefogData (YC W23)

مثبتة

Open-sourcing Introspect: MIT-licensed Deep-Research for your internal data! Works with spreadsheets, databases, PDFs, and web search. Has a remarkably simple architecture – Sonnet agent armed with recursive tool calling and 3 default tools. Best for use-cases where you want to…

rishdotblog's tweet image. Open-sourcing Introspect: MIT-licensed Deep-Research for your internal data!

Works with spreadsheets, databases, PDFs, and web search. Has a remarkably simple architecture – Sonnet agent armed with recursive tool calling and 3 default tools.

Best for use-cases where you want to…

ChatGPT has (finally) started taking credit for the thankless work it does 😅

rishdotblog's tweet image. ChatGPT has (finally) started taking credit for the thankless work it does 😅

Added `grok-4-fast` to my agentic data analysis benchmark – super cheap, super fast, super good

rishdotblog's tweet image. Added `grok-4-fast` to my agentic data analysis benchmark – super cheap, super fast, super good

Haiku 4.5 hits a sweet spot for agentic data analysis workflows Super nice blend of low cost, low latency, and high quality outputs. I found it better than gpt-5. Will try to publish proper evals if I can find the time!

rishdotblog's tweet image. Haiku 4.5 hits a sweet spot for agentic data analysis workflows

Super nice blend of low cost, low latency, and high quality outputs. I found it better than gpt-5. Will try to publish proper evals if I can find the time!


Haiku 4.5 hits a sweet spot for agentic data analysis workflows Super nice blend of low cost, low latency, and high quality outputs. I found it better than gpt-5. Will try to publish proper evals if I can find the time!

rishdotblog's tweet image. Haiku 4.5 hits a sweet spot for agentic data analysis workflows

Super nice blend of low cost, low latency, and high quality outputs. I found it better than gpt-5. Will try to publish proper evals if I can find the time!

You're doing yourself a disservice if you still have not used Codex It worked uninterrupted for 35 mins for a super complex task - and got it right first try Quite nuts - it's already a much better programmer than me (for verifiable tasks) already.

rishdotblog's tweet image. You're doing yourself a disservice if you still have not used Codex

It worked uninterrupted for 35 mins for a super complex task - and got it right first try

Quite nuts - it's already a much better programmer than me (for verifiable tasks) already.

Man OpenAI killed it this DevDay. Tons of startups will have to pivot as a result of this. "Ride the waves caused by constant churn" seems to be the only viable strategy for an early stage co moving forward 😅


Fascinating chart. Survey from April 2024

rishdotblog's tweet image. Fascinating chart. Survey from April 2024

Google's on a roll. That's a lot of performance for that tiny size! I just embedded 1.4 million documents in ~80 mins on my M2 Max for free. Would've been ~$200 with the text-embedding-3-large, with worse quality.

Introducing EmbeddingGemma🎉 🔥With only 308M params, this is the top open model under 500M 🌏Trained on 100+ languages 🪆Flexible embeddings (768 to 128 dims) with Matryoshka 🤗Works with your favorite open tools 🤏Runs with as little as 200MB developers.googleblog.com/en/introducing…

osanseviero's tweet image. Introducing EmbeddingGemma🎉

🔥With only 308M params, this is the top open model under 500M
🌏Trained on 100+ languages
🪆Flexible embeddings (768 to 128 dims) with Matryoshka
🤗Works with your favorite open tools
🤏Runs with as little as 200MB

developers.googleblog.com/en/introducing…


Quick poll - what looks better in dark mode? First image or second image?

rishdotblog's tweet image. Quick poll - what looks better in dark mode? First image or second image?
rishdotblog's tweet image. Quick poll - what looks better in dark mode? First image or second image?

One reason I'm very bullish on Cloudflare Workers and Durable Objects - solves this in a very meaningful way

I don't think every UI will be generated on the fly, but apps will want to be more and more customized per user the problem is our current stack is terrible at this serving 1 site to 100k users is a very different problem to serving 100k slightly modified sites



Rishabh Srivastava أعاد

The most interesting tension in the AI industry: - better lesson pushes models towards end to end solutions with single large models - economic viability pushes industry towards many composable models with broad horizontal applicability


Credit where it's due: seems like OpenAI has fixed a lot of GPT-5 issues in the last 12-24 hours, and Codex CLI works really well in auto mode Still terrible if you use in a "approve before making edits" mode, but hopefully they fix it soon🤞🏼


Loading...

Something went wrong.


Something went wrong.