iamrobotbear (bk)

@iamrobotbear

Product Manager & AI Engineer working on Gen AI & ML. Opinions are my own, not my employer's. RT !=endorsement

Seattle, WA

Joined November 2021

27KPosts 4KFollowers 7KFollowing

You might like

$jansennn__'s profile picture. Good from far, far from good ¯\_(ツ)_/¯$

@jansennn__

@JazzyPboy

@Legume_tomb

@JesseFriedland

@kpanter18

@Ral_ontheverse

@greedsgreedy

@GoatCloak

@WarWren_

@Zoomair10

@propagandashand

@YoMTVRapz

@Flower_Fuzz

@Swxnk47

@vikefan1180

Pinned

Pat Bergstresser

@PatThePM

Sep 9, 2023

Product Managers when ChatGPT first came out:

iamrobotbear (bk) reposted

GPT-5.1 is now available in the API. It’s faster, more steerable, better at coding, and ships with practical new tools. If you’re building apps or agents where intelligence, speed, and cost matter, GPT-5.1 should feel like a meaningful upgrade. openai.com/index/gpt-5-1-…

OpenAIDevs's tweet card. GPT-5.1 is now available in the API, bringing faster adaptive reasoning, extended prompt caching, improved coding performance, and new apply_patch and shell tools.

Introducing GPT-5.1 for developers

Source: openai.com

iamrobotbear (bk) reposted

swyx🔜 @aidotEngineer CODE 🗽

@swyx

14 h

everyone complained that the GPT5.1 release yesterday had no benchmarks. now you have them. note minor regressions in AIME and Taubench, which increases confidence that this is not benchmarkmaxxing i think more generally model comms for a consumer AI model lab has to be split…

swyx's tweet image. everyone complained that the GPT5.1 release yesterday had no benchmarks. now you have them. note minor regressions in AIME and Taubench, which increases confidence that this is not benchmarkmaxxing

i think more generally model comms for a consumer AI model lab has to be split…

OpenAI Developers

@OpenAIDevs

14 h

Introducing GPT-5.1 for developers

Source: openai.com

iamrobotbear (bk) reposted

Thariq

@trq212

Nov 12

We built a Deep Research demo for the Claude Agent SDK! It's one our most requested use cases: spawn multiple AI agents to research a topic in parallel, then synthesize their findings into a report. 🧵 on how it works:

trq212's tweet image. We built a Deep Research demo for the Claude Agent SDK!

It's one our most requested use cases: spawn multiple AI agents to research a topic in parallel, then synthesize their findings into a report.

🧵 on how it works:

iamrobotbear (bk) reposted

Google DeepMind

@GoogleDeepMind

18 h

SIMA 2 is our most capable AI agent for virtual 3D worlds. 👾🌐 Powered by Gemini, it goes beyond following basic instructions to think, understand, and take actions in interactive environments – meaning you can talk to it through text, voice, or even images. Here’s how 🧵

iamrobotbear (bk) reposted

a16z

@a16z

Nov 11

Gamma CEO Grant Lee on the subtle reason why he isn't worried about being replaced by generalist AI models: “You as the creator need to... feel like you have a lot of input. You want to be involved in that because it is your story you're telling. It's not the AI's story.” Good…

a16z

@a16z

Nov 11

Grant Lee: How Gamma Built a 100 Million User AI Presentation Company Despite being one of the most successful AI tools, @GammaApp was not founded as an AI company. Compared with other presentation tool companies, Gamma was building something deeper from day one: building blocks…

iamrobotbear (bk)

@iamrobotbear

Nov 10

Hey @Snowflake how much do you charge for @streamlit hosting when using Snowflake to host the app? It's not in your pricing table.

iamrobotbear (bk) reposted

Replit ⠕

@Replit

Nov 10

Introducing Replit AI Integrations ✨ Build AI apps with 300+ AI models instantly - no API keys, no setup! 🔥 Access top models (OpenAI, Gemini, Anthropic, Meta, Grok, Mistral & more) with one click - all inside Replit. You ask. The Agent builds. It just works. 🚀

iamrobotbear (bk) reposted

Grant Lee

@thisisgrantlee

Nov 10

Today, as shared by The New York Times, we’re announcing two things: >Our Series B at a $2.1B valuation led by @sarahdingwang at @a16z. >Reaching $100M ARR, profitably, with a team of just 50 people. That's $2M ARR per employee. PowerPoint was invented before the first website,…

iamrobotbear (bk) reposted

Kevin Kern

@kregenrek

Nov 9

LLMs often suggest libraries that might not fit your use case or are outdated. When that happens, I ask codex: "i would like to find a more modern better fast alternative to <library> how should i form the research question?" Then I switch to Perplexity or GPT deep research to…

kregenrek's tweet image. LLMs often suggest libraries that might not fit your use case or are outdated. When that happens, I ask codex:

"i would like to find a more modern better fast alternative to &lt;library&gt; how should i form the research question?"

Then I switch to Perplexity or GPT deep research to…

iamrobotbear (bk) reposted

Windsurf

@windsurf

Nov 4

Codemaps can also be taken literally - visual system diagrams with two way linking strongly grounded in the codebase. we heard this feedback from Aiden but also our biggest Fortune 500 customers trying to wrangle their codebases x.com/aidenybai/stat… see it here:…

windsurf's tweet image. Codemaps can also be taken literally - visual system diagrams with two way linking strongly grounded in the codebase.

we heard this feedback from Aiden but also our biggest Fortune 500 customers trying to wrangle their codebases

x.com/aidenybai/stat…

see it here:…

Aiden Bai

@aidenybai

Nov 2

there's significant alpha in better diagraming/exploration tools for understanding codebases. the ones today suck traditionally done w/ real work or pair programming, but that's slow. practically most engs don't understand the full codebase

iamrobotbear (bk) reposted

Viksit Gaur

@viksit

Nov 7

Link: viksit.substack.com/p/solving-agen… Coming up soon: GEPA based optimization of tools and routes.

iamrobotbear (bk) reposted

Ethan Mollick

@emollick

Nov 7

The fact that API decisions for AI use are decided by IT has large downstream consequences for companies with their own internal chatbots. They often don’t know about the business uses for reasoning or tools or web search and default to minimum permissions, hobbling AI value.

iamrobotbear (bk) reposted

Artificial Analysis

@ArtificialAnlys

Nov 7

Kimi K2 Thinking is the new leading open weights model: it demonstrates particular strength in agentic contexts but is very verbose, generating the most tokens of any model in completing our Intelligence Index evals @Kimi_Moonshot's Kimi K2 Thinking achieves a 67 in the…

ArtificialAnlys's tweet image. Kimi K2 Thinking is the new leading open weights model: it demonstrates particular strength in agentic contexts but is very verbose, generating the most tokens of any model in completing our Intelligence Index evals

@Kimi_Moonshot's Kimi K2 Thinking achieves a 67 in the…

iamrobotbear (bk) reposted

Kimi.ai

@Kimi_Moonshot

Nov 6

🚀 Hello, Kimi K2 Thinking! The Open-Source Thinking Agent Model is here. 🔹 SOTA on HLE (44.9%) and BrowseComp (60.2%) 🔹 Executes up to 200 – 300 sequential tool calls without human interference 🔹 Excels in reasoning, agentic search, and coding 🔹 256K context window Built…

Kimi_Moonshot's tweet image. 🚀 Hello, Kimi K2 Thinking!
The Open-Source Thinking Agent Model is here.

🔹 SOTA on HLE (44.9%) and BrowseComp (60.2%)
🔹 Executes up to 200 – 300 sequential tool calls without human interference
🔹 Excels in reasoning, agentic search, and coding
🔹 256K context window

Built…

iamrobotbear (bk) reposted

Thariq

@trq212

Nov 6

one of the best ways to make Claude Code a general agent- browserbase's plugin makes it so Claude can actually use your browser (with your cookies) and take actions using language

Paul Klein IV

@pk_iv

Nov 6

I've been using Claude Code completely wrong. I gave it a custom skill and Browser CLI tools and letting it do work for me. It can open pages, click buttons, fill in forms all from your authenticated browser. Just published it to the marketplace, install it in 2 commands.

iamrobotbear (bk) reposted

Maxime Rivest 🧙‍♂️🦙🐧

@MaximeRivest

Nov 3

DSPy intersects both in the MCP → DSPy and DSPy → MCP directions. 🧵

Andrea Alberici

@aalberici

Nov 3

very good work, I appreciate the vision of MCP as a standard... based on your knowledge of DSPy, where and how do you see the intersection between MCP and DSPy? And which are the possible interactions?

iamrobotbear (bk) reposted

Cognition

@cognition

Oct 29

Today we’re releasing SWE-1.5, our fast agent model. It achieves near-SOTA coding performance while setting a new standard for speed. Now available in @windsurf.

cognition's tweet image. Today we’re releasing SWE-1.5, our fast agent model.

It achieves near-SOTA coding performance while setting a new standard for speed. Now available in @windsurf.

iamrobotbear (bk) reposted

Paul Raistrick

@PWR_Locus

Oct 29

Spent time reading LangChain V1.0 docs today. The create_agent + Middleware is a game-changer: ✅ Controls context length ✅ Routes tools correctly ✅ Removes old results Just simpler, reliable agents. Great work, @LangChainAI ! Docs: docs.langchain.com

Home - Docs by LangChain

Source: docs.langchain.com

iamrobotbear (bk) reposted

Karan Goel

@krandiash

Oct 28

We've raised $100M from Kleiner Perkins, Index Ventures, Lightspeed, and NVIDIA. Today we're introducing Sonic-3 - the state-of-the-art model for realtime conversation. What makes Sonic-3 great: - Breakthrough naturalness - laughter and full emotional range - Lightning fast -…

iamrobotbear (bk) reposted

Ethan Mollick

@emollick

Oct 28

👀New data on the corporate ROI from generative AI from a large-scale tracking survey by my colleagues at Wharton. They found that 75% already have a positive return on investment from AI, less than 5% negative return. Also 46% of businesses leaders now use AI daily themselves.