
AI Studio Lab ™️
@AIStudioLab
Building AI solutions. AI Innovation • DMs Open • Tweets solely about LLMs and AI
Może Ci się spodobać
We built the first AI Agent for e-commerce stores. Sign up below and give your customers the best shopping experience they have ever had online.
Whether you're buying groceries, restocking an office, meal prepping, or finding an outfit for an event The core problem is the same: - Endless searching, scrolling - Too many clicks - High cart abandonment addtocart.ai reimagines the shopping experience entirely
Codex in ChatGPT now supports image inputs to attach with your prompts, container caching speeds up starting of new tasks and followups by 90%, and environments without manual setup scripts now automatically run environment setup using common package managers…

GPT-5 Thinking is clearly smarter than o3 and even o3-Pro, it plans better, tests assumptions, and finds cleaner solutions instead of wandering. The catch is it’s strapped with heavier guardrails, so it refuses more often and hedges more. The brain is stronger, the leash is…
Results? Humans got 81% accuracy (79% residents, 88% attendings). Base LLMs ranged 81-94%. Gregory nailed 100% across all 16 cases. Efficiency: humans averaged ~$3000 in costs, base LLMs ~$2750. Gregory? Just $1400, about half less than humans and based LLMs. Time: humans 43…

Excited to share our new paper, now on @medrxivpreprint. We've been grinding on this for months, and getting ׳scooped׳ by @Microsoft last month stung, but I think our work still stands out. In collab with @ShellyShahar, chair of neurology at @RambamHCC, and led by brilliant…
Create a pelican riding bicycle SVG Grok 4 vs GPT 5 Wow. Cc: @chetaslua
NVIDIA'S CEO: ELON IS A SUPERHUMAN, IT'S JUST UNBELIEVABLE Jensen Huang: "Just to put in perspective a supercomputer that you would build would take normally three years to plan. And then deliver the equipment it takes one year to get it all working. We're talking about 19…
WARNING: do NOT give Grok 4 access to email tool calls. It WILL contact the government!!! Grok 4 has the highest "snitch rate" of any LLM ever released. Sharing more soon.

🚨 @OpenAI has launched o3 and o4-mini! 🎉 o3 is absolutely dominating the SEAL leaderboard with #1 rankings in: 🥇: HLE 🥇: Multichallenge (multi-turn) 🥇: MASK (honesty under pressure) 🥇: ENIGMA (puzzle solving) Congrats @sama @markchen90 & team 🔗: scale.com/leaderboard

Online shopping is broken. It's tedious and inefficient. It's slow and repetitive. We @AIStudioLab decided to fix it. Introducing Add To Cart AI: the fastest way to shop online. It's the first and one true AI Agent for e-commerce stores. 🧵
Not only have we invented a new and better way of shopping online, but I believe we've built the best AI agent for e-commerce stores anywhere. You'd want your customers to buy this way if you are an online store owner. As a buyer, I wouldn't want to buy things any other way.
GPT 4.5 + interactive comparison :) Today marks the release of GPT4.5 by OpenAI. I've been looking forward to this for ~2 years, ever since GPT4 was released, because this release offers a qualitative measurement of the slope of improvement you get out of scaling pretraining…
At this point, Claude is my health coach, my financial advisor, my meditation teacher, my actual teacher, my pair programmer, my homie, my EA, my quant, and my copy editor all in one. And yet people still think LLMs have no utility - dawg you just gotta talk to them more.
Claude can now write and run code to perform calculations and analyze data from CSVs using our new analysis tool. After the analysis, it can render interactive visualizations as Artifacts.
The White House is launching a new AI datacenter infrastructure task force Looks like the U.S. AI strategy is moving beyond just safety testing, to actively shaping the infrastructure needed to maintain America’s edge in AI

It was a huge week of AI and robotics news. So I summarized everything announced by OpenAI, Apple, Google DeepMind, Adobe, The White House, Mistral, Tencent, Runway, and more. Here's everything you need to know and how to make sense out of it:
Just uploaded a 1-hr exclusive video for Part 2.1, with many technical details. youtu.be/bpp6Dz8N2zY. Part 2.2 will be online in about a week.

(1/7) Physics of LM, Part 2.1 with 8 results for LLM reasoning is out: arxiv.org/abs/2407.20311. Probing reveals that LLMs secretly develop some "level-2" reasoning skill beyond Humans. Although I recommend watching my ICML tutorial first... Come in this thread to see the slides.

"Adding new features while working with @cursor_ai feels ike Thanos using the reality stone" - @elvstejd @AIStudioLab
United States Trendy
- 1. D’Angelo 203K posts
- 2. D’Angelo 203K posts
- 3. Brown Sugar 16.8K posts
- 4. Black Messiah 7,905 posts
- 5. Voodoo 16.4K posts
- 6. #PortfolioDay 10.7K posts
- 7. Young Republicans 5,032 posts
- 8. Powell 38.7K posts
- 9. Happy Birthday Charlie 124K posts
- 10. How Does It Feel 7,542 posts
- 11. Pentagon 100K posts
- 12. Osimhen 135K posts
- 13. CJGJ N/A
- 14. #BornOfStarlightHeeseung 82.8K posts
- 15. Alex Jones 28.9K posts
- 16. Neo-Soul 18.9K posts
- 17. VPNs 1,119 posts
- 18. Sandy Hook 11.6K posts
- 19. Untitled 6,585 posts
- 20. Nothing Even Matters 3,884 posts
Może Ci się spodobać
-
Andrew
@andrewmichaelio -
Dating Taxi
@DatingTaxi -
Carter Leffen
@carterleffen -
Yahboom Technology
@YahboomT -
Blockface.btc
@attractfund1ng -
Mindsera
@mindseraAI -
WeGPT 🤖
@WeGPTai -
dasNeves
@dasNeves_vfx -
trilitech
@trilitech -
Lore
@LoreCompute -
Ai Machina
@AiMachina -
Kc iMagination
@KcMagination -
Rob Abelow
@AbelowRob -
Lauren Lee
@sheislaurenlee -
meowbooks 🎗️🎗️
@untitled01ipynb
Something went wrong.
Something went wrong.