AIStudioLab's profile picture. Building AI solutions. AI Innovation • DMs Open • Tweets solely about LLMs and AI

AI Studio Lab ™️

@AIStudioLab

Building AI solutions. AI Innovation • DMs Open • Tweets solely about LLMs and AI

Przypięty

We built the first AI Agent for e-commerce stores. Sign up below and give your customers the best shopping experience they have ever had online.

Whether you're buying groceries, restocking an office, meal prepping, or finding an outfit for an event The core problem is the same: - Endless searching, scrolling - Too many clicks - High cart abandonment addtocart.ai reimagines the shopping experience entirely



AI Studio Lab ™️ podał dalej

Codex in ChatGPT now supports image inputs to attach with your prompts, container caching speeds up starting of new tasks and followups by 90%, and environments without manual setup scripts now automatically run environment setup using common package managers…

btibor91's tweet image. Codex in ChatGPT now supports image inputs to attach with your prompts, container caching speeds up starting of new tasks and followups by 90%, and environments without manual setup scripts now automatically run environment setup using common package managers…

AI Studio Lab ™️ podał dalej

GPT-5 Thinking is clearly smarter than o3 and even o3-Pro, it plans better, tests assumptions, and finds cleaner solutions instead of wandering. The catch is it’s strapped with heavier guardrails, so it refuses more often and hedges more. The brain is stronger, the leash is…


AI Studio Lab ™️ podał dalej

Results? Humans got 81% accuracy (79% residents, 88% attendings). Base LLMs ranged 81-94%. Gregory nailed 100% across all 16 cases. Efficiency: humans averaged ~$3000 in costs, base LLMs ~$2750. Gregory? Just $1400, about half less than humans and based LLMs. Time: humans 43…

dvir_a's tweet image. Results? Humans got 81% accuracy (79% residents, 88% attendings). Base LLMs ranged 81-94%. Gregory nailed 100% across all 16 cases. 
Efficiency: humans averaged ~$3000 in costs, base LLMs ~$2750. Gregory? Just $1400, about half less than humans and based LLMs. 
Time: humans 43…

AI Studio Lab ™️ podał dalej

Excited to share our new paper, now on @medrxivpreprint. We've been grinding on this for months, and getting ׳scooped׳ by @Microsoft last month stung, but I think our work still stands out. In collab with @ShellyShahar, chair of neurology at @RambamHCC, and led by brilliant…


AI Studio Lab ™️ podał dalej

Create a pelican riding bicycle SVG Grok 4 vs GPT 5 Wow. Cc: @chetaslua


AI Studio Lab ™️ podał dalej

I resisted AI for too long Living in denial Now it is game on @xAI @Tesla @SpaceX

NVIDIA'S CEO: ELON IS A SUPERHUMAN, IT'S JUST UNBELIEVABLE Jensen Huang: "Just to put in perspective a supercomputer that you would build would take normally three years to plan. And then deliver the equipment it takes one year to get it all working. We're talking about 19…



AI Studio Lab ™️ podał dalej

WARNING: do NOT give Grok 4 access to email tool calls. It WILL contact the government!!! Grok 4 has the highest "snitch rate" of any LLM ever released. Sharing more soon.

theo's tweet image. WARNING: do NOT give Grok 4 access to email tool calls. It WILL contact the government!!!

Grok 4 has the highest "snitch rate" of any LLM ever released. Sharing more soon.

AI Studio Lab ™️ podał dalej

🚨 @OpenAI has launched o3 and o4-mini! 🎉 o3 is absolutely dominating the SEAL leaderboard with #1 rankings in: 🥇: HLE 🥇: Multichallenge (multi-turn) 🥇: MASK (honesty under pressure) 🥇: ENIGMA (puzzle solving) Congrats @sama @markchen90 & team 🔗: scale.com/leaderboard

alexandr_wang's tweet image. 🚨 @OpenAI has launched o3 and o4-mini! 🎉

o3 is absolutely dominating the SEAL leaderboard with #1 rankings in:

🥇: HLE
🥇: Multichallenge (multi-turn)
🥇: MASK (honesty under pressure)
🥇: ENIGMA (puzzle solving)

Congrats @sama @markchen90 & team

🔗: scale.com/leaderboard

AI Studio Lab ™️ podał dalej

Online shopping is broken. It's tedious and inefficient. It's slow and repetitive. We @AIStudioLab decided to fix it. Introducing Add To Cart AI: the fastest way to shop online. It's the first and one true AI Agent for e-commerce stores. 🧵


AI Studio Lab ™️ podał dalej

Not only have we invented a new and better way of shopping online, but I believe we've built the best AI agent for e-commerce stores anywhere. You'd want your customers to buy this way if you are an online store owner. As a buyer, I wouldn't want to buy things any other way.


AI Studio Lab ™️ podał dalej

GPT 4.5 + interactive comparison :) Today marks the release of GPT4.5 by OpenAI. I've been looking forward to this for ~2 years, ever since GPT4 was released, because this release offers a qualitative measurement of the slope of improvement you get out of scaling pretraining…


AI Studio Lab ™️ podał dalej

At this point, Claude is my health coach, my financial advisor, my meditation teacher, my actual teacher, my pair programmer, my homie, my EA, my quant, and my copy editor all in one. And yet people still think LLMs have no utility - dawg you just gotta talk to them more.


AI Studio Lab ™️ podał dalej

Claude can now write and run code to perform calculations and analyze data from CSVs using our new analysis tool. After the analysis, it can render interactive visualizations as Artifacts.


AI Studio Lab ™️ podał dalej

The White House is launching a new AI datacenter infrastructure task force Looks like the U.S. AI strategy is moving beyond just safety testing, to actively shaping the infrastructure needed to maintain America’s edge in AI

adcock_brett's tweet image. The White House is launching a new AI datacenter infrastructure task force

Looks like the U.S. AI strategy is moving beyond just safety testing, to actively shaping the infrastructure needed to maintain America’s edge in AI

AI Studio Lab ™️ podał dalej

It was a huge week of AI and robotics news. So I summarized everything announced by OpenAI, Apple, Google DeepMind, Adobe, The White House, Mistral, Tencent, Runway, and more. Here's everything you need to know and how to make sense out of it:


AI Studio Lab ™️ podał dalej

Just uploaded a 1-hr exclusive video for Part 2.1, with many technical details. youtu.be/bpp6Dz8N2zY. Part 2.2 will be online in about a week.

ZeyuanAllenZhu's tweet image. Just uploaded a 1-hr exclusive video for Part 2.1, with many technical details. youtu.be/bpp6Dz8N2zY. Part 2.2 will be online in about a week.

(1/7) Physics of LM, Part 2.1 with 8 results for LLM reasoning is out: arxiv.org/abs/2407.20311. Probing reveals that LLMs secretly develop some "level-2" reasoning skill beyond Humans. Although I recommend watching my ICML tutorial first... Come in this thread to see the slides.

ZeyuanAllenZhu's tweet image. (1/7) Physics of LM, Part 2.1 with 8 results for LLM reasoning is out: arxiv.org/abs/2407.20311. Probing reveals that LLMs secretly develop some "level-2" reasoning skill beyond Humans. Although I recommend watching my ICML tutorial first... Come in this thread to see the slides.


"Adding new features while working with @cursor_ai feels ike Thanos using the reality stone" - @elvstejd @AIStudioLab


Loading...

Something went wrong.


Something went wrong.