full_stack_dl's profile picture. News, community, and courses for people building AI-powered products.

The Full Stack

@full_stack_dl

News, community, and courses for people building AI-powered products.

Pinned

🥞🦜 Full Stack LLM Bootcamp 🦜🥞 tl;dr We're releasing our lectures on building LLM-powered apps, for FREE. 🚀 Launch an LLM App in One Hour ✨ Prompt Engineering 🗿 LLM Foundations 🔨 Augmented LLMs 🤷 UX for LUIs 🏎️ LLMOps 🔮 What's Next? 👷 Project Walkthrough Learn more:


Would you be interested in a course or workshop on ✨Building Software with AI Agents✨???

Yes %78.8
No %21.2

66 vote · Final results


The Full Stack reposted

Is Claude Code still the best coding agent on the market? You can now easily find out by launching Claude, Codex, Gemini, and Amp on every ticket in your codebase:


The Full Stack reposted

Several agents plus three simple baselines were tested on HumanEval. Agents were mostly worse and always more expensive than the baselines. The good: · Evaluating the Pareto frontier · Strong simple baselines (just repeated calls!) The bad: · Clearly saturating the benchmark

sergeykarayev's tweet image. Several agents plus three simple baselines were tested on HumanEval.

Agents were mostly worse and always more expensive than the baselines.

The good:
· Evaluating the Pareto frontier
· Strong simple baselines (just repeated calls!)

The bad:
· Clearly saturating the benchmark

The Full Stack reposted

What percentage of your Twitter feed (the stuff you actually read, not just scroll past) do you believe is currently written by AI?


The Full Stack reposted

LLM Provider Comparisons 1. @withmartian 2. @ArtificialAnlys 3. @FixieAI

sergeykarayev's tweet image. LLM Provider Comparisons

1. @withmartian 
2. @ArtificialAnlys
3. @FixieAI
sergeykarayev's tweet image. LLM Provider Comparisons

1. @withmartian 
2. @ArtificialAnlys
3. @FixieAI
sergeykarayev's tweet image. LLM Provider Comparisons

1. @withmartian 
2. @ArtificialAnlys
3. @FixieAI

The Full Stack reposted

Has anyone done comprehensive testing of gpt-4-vision-preview? I want to know stuff like the minimum text size it can read, the radius of the smallest circle it can locate in an image, the number of circles it can count, etc. Could be an automated benchmark for other models too


The Full Stack reposted

Which set of statements do you agree with? 1. AGI is as much or more of a risk to human flourishing as nuclear weapons 2. I have a good idea for what should be done about that


The Full Stack reposted

Has anyone had good experiences with GPT-powered code generation for complete web app features? As in, you describe what should exist, and GPT actually provides the source of all the necessary files and where they should go. Ideally in the context of Ruby on Rails.


The Full Stack reposted

Let's say that a US-based research company has developed an AGI model that was able to use the browser, pass captchas, hire people on Upwork, and lie about its intentions. What should they do after observing this?


The Full Stack reposted

We bring in @full_stack_dl, a venerable boot camp crew that pioneered technical deep dives into deep learning where people fly in from around the world. 🥞 Their #LLM Bootcamp in the spring was sold out and this is your chance to attend the ➡️ version. 👉 scale.bythebay.io/register


The Full Stack reposted

Solutions from replies: - @OpenPipeAI looks exactly right openpipe.ai - @PortkeyAI launching feature soon - @analyticsaurabh building his own I currently use @helicone_ai, any plans from them?

Is there a service I can use to pipe my GPT-4 calls through, and it automatically finetunes GPT-3.5 (or whatever) on all of them, and lets me know when it's up to par?



The Full Stack reposted

Wow - don't miss this!

This is sadly true! If you want the latest version, come join us in November for our in-person workshop with @ScaleByTheBay scale.bythebay.io/llm-workshop



The Full Stack reposted

Is there a service I can use to pipe my GPT-4 calls through, and it automatically finetunes GPT-3.5 (or whatever) on all of them, and lets me know when it's up to par?


This is sadly true! If you want the latest version, come join us in November for our in-person workshop with @ScaleByTheBay scale.bythebay.io/llm-workshop

Just realized that even the best and most up-to-date #LLMbootcamp from @full_stack_dl is partially outdated! The field is rushing! #LLM #fullstack



The Full Stack reposted

It feels like something to be you. Do you think it feels like something to be GPT-4?


The Full Stack reposted

By what year will there be an AI that is more capable than most humans in most domains of digital work (e.g. you can tell it to do anything you currently hire a white collar professional to do, and it does the job better than the median human)?


Loading...

Something went wrong.


Something went wrong.