full_stack_dl's profile picture. News, community, and courses for people building AI-powered products.

The Full Stack

@full_stack_dl

News, community, and courses for people building AI-powered products.

置頂

🥞🦜 Full Stack LLM Bootcamp 🦜🥞 tl;dr We're releasing our lectures on building LLM-powered apps, for FREE. 🚀 Launch an LLM App in One Hour ✨ Prompt Engineering 🗿 LLM Foundations 🔨 Augmented LLMs 🤷 UX for LUIs 🏎️ LLMOps 🔮 What's Next? 👷 Project Walkthrough Learn more:


Would you be interested in a course or workshop on ✨Building Software with AI Agents✨???

Yes %78.8
No %21.2

66 票 · 最終結果


The Full Stack 已轉發

Is Claude Code still the best coding agent on the market? You can now easily find out by launching Claude, Codex, Gemini, and Amp on every ticket in your codebase:


The Full Stack 已轉發

Several agents plus three simple baselines were tested on HumanEval. Agents were mostly worse and always more expensive than the baselines. The good: · Evaluating the Pareto frontier · Strong simple baselines (just repeated calls!) The bad: · Clearly saturating the benchmark

sergeykarayev's tweet image. Several agents plus three simple baselines were tested on HumanEval.

Agents were mostly worse and always more expensive than the baselines.

The good:
· Evaluating the Pareto frontier
· Strong simple baselines (just repeated calls!)

The bad:
· Clearly saturating the benchmark

The Full Stack 已轉發

What percentage of your Twitter feed (the stuff you actually read, not just scroll past) do you believe is currently written by AI?


The Full Stack 已轉發

LLM Provider Comparisons 1. @withmartian 2. @ArtificialAnlys 3. @FixieAI

sergeykarayev's tweet image. LLM Provider Comparisons

1. @withmartian 
2. @ArtificialAnlys
3. @FixieAI
sergeykarayev's tweet image. LLM Provider Comparisons

1. @withmartian 
2. @ArtificialAnlys
3. @FixieAI
sergeykarayev's tweet image. LLM Provider Comparisons

1. @withmartian 
2. @ArtificialAnlys
3. @FixieAI

The Full Stack 已轉發

Has anyone done comprehensive testing of gpt-4-vision-preview? I want to know stuff like the minimum text size it can read, the radius of the smallest circle it can locate in an image, the number of circles it can count, etc. Could be an automated benchmark for other models too


The Full Stack 已轉發

Which set of statements do you agree with? 1. AGI is as much or more of a risk to human flourishing as nuclear weapons 2. I have a good idea for what should be done about that


The Full Stack 已轉發

Has anyone had good experiences with GPT-powered code generation for complete web app features? As in, you describe what should exist, and GPT actually provides the source of all the necessary files and where they should go. Ideally in the context of Ruby on Rails.


The Full Stack 已轉發

Let's say that a US-based research company has developed an AGI model that was able to use the browser, pass captchas, hire people on Upwork, and lie about its intentions. What should they do after observing this?


The Full Stack 已轉發

We bring in @full_stack_dl, a venerable boot camp crew that pioneered technical deep dives into deep learning where people fly in from around the world. 🥞 Their #LLM Bootcamp in the spring was sold out and this is your chance to attend the ➡️ version. 👉 scale.bythebay.io/register


The Full Stack 已轉發

Solutions from replies: - @OpenPipeAI looks exactly right openpipe.ai - @PortkeyAI launching feature soon - @analyticsaurabh building his own I currently use @helicone_ai, any plans from them?

Is there a service I can use to pipe my GPT-4 calls through, and it automatically finetunes GPT-3.5 (or whatever) on all of them, and lets me know when it's up to par?



The Full Stack 已轉發

Wow - don't miss this!

This is sadly true! If you want the latest version, come join us in November for our in-person workshop with @ScaleByTheBay scale.bythebay.io/llm-workshop



The Full Stack 已轉發

Is there a service I can use to pipe my GPT-4 calls through, and it automatically finetunes GPT-3.5 (or whatever) on all of them, and lets me know when it's up to par?


This is sadly true! If you want the latest version, come join us in November for our in-person workshop with @ScaleByTheBay scale.bythebay.io/llm-workshop

Just realized that even the best and most up-to-date #LLMbootcamp from @full_stack_dl is partially outdated! The field is rushing! #LLM #fullstack



The Full Stack 已轉發

It feels like something to be you. Do you think it feels like something to be GPT-4?


The Full Stack 已轉發

By what year will there be an AI that is more capable than most humans in most domains of digital work (e.g. you can tell it to do anything you currently hire a white collar professional to do, and it does the job better than the median human)?


Loading...

Something went wrong.


Something went wrong.