iamrobotbear (bk)
@iamrobotbear
Product Manager & AI Engineer working on Gen AI & ML. Opinions are my own, not my employer's. RT !=endorsement
You might like
lol.
GPT-5.1 is now available in the API. It’s faster, more steerable, better at coding, and ships with practical new tools. If you’re building apps or agents where intelligence, speed, and cost matter, GPT-5.1 should feel like a meaningful upgrade. openai.com/index/gpt-5-1-…
everyone complained that the GPT5.1 release yesterday had no benchmarks. now you have them. note minor regressions in AIME and Taubench, which increases confidence that this is not benchmarkmaxxing i think more generally model comms for a consumer AI model lab has to be split…
GPT-5.1 is now available in the API. It’s faster, more steerable, better at coding, and ships with practical new tools. If you’re building apps or agents where intelligence, speed, and cost matter, GPT-5.1 should feel like a meaningful upgrade. openai.com/index/gpt-5-1-…
We built a Deep Research demo for the Claude Agent SDK! It's one our most requested use cases: spawn multiple AI agents to research a topic in parallel, then synthesize their findings into a report. 🧵 on how it works:
SIMA 2 is our most capable AI agent for virtual 3D worlds. 👾🌐 Powered by Gemini, it goes beyond following basic instructions to think, understand, and take actions in interactive environments – meaning you can talk to it through text, voice, or even images. Here’s how 🧵
Gamma CEO Grant Lee on the subtle reason why he isn't worried about being replaced by generalist AI models: “You as the creator need to... feel like you have a lot of input. You want to be involved in that because it is your story you're telling. It's not the AI's story.” Good…
Grant Lee: How Gamma Built a 100 Million User AI Presentation Company Despite being one of the most successful AI tools, @GammaApp was not founded as an AI company. Compared with other presentation tool companies, Gamma was building something deeper from day one: building blocks…
Hey @Snowflake how much do you charge for @streamlit hosting when using Snowflake to host the app? It's not in your pricing table.
Introducing Replit AI Integrations ✨ Build AI apps with 300+ AI models instantly - no API keys, no setup! 🔥 Access top models (OpenAI, Gemini, Anthropic, Meta, Grok, Mistral & more) with one click - all inside Replit. You ask. The Agent builds. It just works. 🚀
Today, as shared by The New York Times, we’re announcing two things: >Our Series B at a $2.1B valuation led by @sarahdingwang at @a16z. >Reaching $100M ARR, profitably, with a team of just 50 people. That's $2M ARR per employee. PowerPoint was invented before the first website,…
LLMs often suggest libraries that might not fit your use case or are outdated. When that happens, I ask codex: "i would like to find a more modern better fast alternative to <library> how should i form the research question?" Then I switch to Perplexity or GPT deep research to…
Codemaps can also be taken literally - visual system diagrams with two way linking strongly grounded in the codebase. we heard this feedback from Aiden but also our biggest Fortune 500 customers trying to wrangle their codebases x.com/aidenybai/stat… see it here:…
there's significant alpha in better diagraming/exploration tools for understanding codebases. the ones today suck traditionally done w/ real work or pair programming, but that's slow. practically most engs don't understand the full codebase
Link: viksit.substack.com/p/solving-agen… Coming up soon: GEPA based optimization of tools and routes.
The fact that API decisions for AI use are decided by IT has large downstream consequences for companies with their own internal chatbots. They often don’t know about the business uses for reasoning or tools or web search and default to minimum permissions, hobbling AI value.
Kimi K2 Thinking is the new leading open weights model: it demonstrates particular strength in agentic contexts but is very verbose, generating the most tokens of any model in completing our Intelligence Index evals @Kimi_Moonshot's Kimi K2 Thinking achieves a 67 in the…
🚀 Hello, Kimi K2 Thinking! The Open-Source Thinking Agent Model is here. 🔹 SOTA on HLE (44.9%) and BrowseComp (60.2%) 🔹 Executes up to 200 – 300 sequential tool calls without human interference 🔹 Excels in reasoning, agentic search, and coding 🔹 256K context window Built…
one of the best ways to make Claude Code a general agent- browserbase's plugin makes it so Claude can actually use your browser (with your cookies) and take actions using language
I've been using Claude Code completely wrong. I gave it a custom skill and Browser CLI tools and letting it do work for me. It can open pages, click buttons, fill in forms all from your authenticated browser. Just published it to the marketplace, install it in 2 commands.
DSPy intersects both in the MCP → DSPy and DSPy → MCP directions. 🧵
very good work, I appreciate the vision of MCP as a standard... based on your knowledge of DSPy, where and how do you see the intersection between MCP and DSPy? And which are the possible interactions?
Today we’re releasing SWE-1.5, our fast agent model. It achieves near-SOTA coding performance while setting a new standard for speed. Now available in @windsurf.
Spent time reading LangChain V1.0 docs today. The create_agent + Middleware is a game-changer: ✅ Controls context length ✅ Routes tools correctly ✅ Removes old results Just simpler, reliable agents. Great work, @LangChainAI ! Docs: docs.langchain.com
We've raised $100M from Kleiner Perkins, Index Ventures, Lightspeed, and NVIDIA. Today we're introducing Sonic-3 - the state-of-the-art model for realtime conversation. What makes Sonic-3 great: - Breakthrough naturalness - laughter and full emotional range - Lightning fast -…
👀New data on the corporate ROI from generative AI from a large-scale tracking survey by my colleagues at Wharton. They found that 75% already have a positive return on investment from AI, less than 5% negative return. Also 46% of businesses leaders now use AI daily themselves.
United States Trends
- 1. #FinallyOverIt 5,061 posts
- 2. #TalusLabs N/A
- 3. Summer Walker 16.2K posts
- 4. Justin Fields 9,926 posts
- 5. 5sos 21.2K posts
- 6. #criticalrolespoilers 3,987 posts
- 7. Jets 68.3K posts
- 8. Jalen Johnson 8,467 posts
- 9. Patriots 150K posts
- 10. Drake Maye 21K posts
- 11. 1-800 Heartbreak 1,281 posts
- 12. Go Girl 25.2K posts
- 13. Judge 202K posts
- 14. Wale 32.3K posts
- 15. Robbed You 3,906 posts
- 16. #BlackOps7 15.7K posts
- 17. #zzzSpecialProgram 2,486 posts
- 18. TreVeyon Henderson 12.8K posts
- 19. AD Mitchell 2,426 posts
- 20. Disc 2 N/A
You might like
-
jansennn
@jansennn__ -
JazzyPboy 🥚👺
@JazzyPboy -
legume.eth
@Legume_tomb -
👺Jesse
@JesseFriedland -
KP🐧✳️
@kpanter18 -
Ral_ontheverse👺
@Ral_ontheverse -
tofu
@greedsgreedy -
GoatCloak👺
@GoatCloak -
Warren
@WarWren_ -
🌭 Zoomair 👺
@Zoomair10 -
PROPAGANDA
@propagandashand -
jeff 👺
@YoMTVRapz -
Drewskiii👺🐲
@Flower_Fuzz -
Swank 👺
@Swxnk47 -
vikefan 👺
@vikefan1180
Something went wrong.
Something went wrong.