albertzhang36's profile picture. @braintrustdata, prev @ivp @qatalystgroup

Albert Zhang

@albertzhang36

@braintrustdata, prev @ivp @qatalystgroup

Talvez você curta
Fixado

We raised $36M from a16z! In the last 9 months since I joined, it’s been inspiring to see so many of the best AI teams adopt Braintrust - Stripe, Notion, Zapier, Airtable, and many more. It's also incredibly motivating to be part of such an amazing team. I deeply believe that…

Excited to share that we've raised $36m from @martin_casado at @a16z along with @saammotamedi @GreylockVC @eladgil @basecasevc to further our mission of helping developers build AI products that work. A bit more on what we're up to 🧵

ankrgyl's tweet image. Excited to share that we've raised $36m from @martin_casado at @a16z along with @saammotamedi @GreylockVC @eladgil @basecasevc to further our mission of helping developers build AI products that work.

A bit more on what we're up to 🧵


Albert Zhang repostou

I’m excited to announce @DoeLabs coming out of stealth. @YuhuangOu and I are building vertical-specific AI tuned for industries still stuck with legacy software and nuanced workflows. Dental is just the start. Proudly backed by @ycombinator and an exceptional team.


Albert Zhang repostou

The cost of intelligence is going to 0. We evaluated DeepSeek's new R1 model with @braintrustdata: -R1 generally beats o1 mini but lags behind o1 -The value is wild: o1 costs ~20x more for just ~4.5% better accuracy. Game-changing performance for the price.

AdrianBarbir's tweet image. The cost of intelligence is going to 0.

We evaluated DeepSeek's new R1 model with @braintrustdata:
-R1 generally beats o1 mini but lags behind o1
-The value is wild: o1 costs ~20x more for just ~4.5% better accuracy. 

Game-changing performance for the price.
AdrianBarbir's tweet image. The cost of intelligence is going to 0.

We evaluated DeepSeek's new R1 model with @braintrustdata:
-R1 generally beats o1 mini but lags behind o1
-The value is wild: o1 costs ~20x more for just ~4.5% better accuracy. 

Game-changing performance for the price.

Go Corinne!!!!!

🎉Congrats, @CorinneMRiley on making the 2024 @Forbes 30 Under 30 VC list 🎉 @GreylockVC is thrilled to see you get recognized for creating Greylock Edge and the Greylock Scouts program; and for your partnership w/ early stage enterprise cos. bit.ly/3ZgoCZ3

GreylockVC's tweet image. 🎉Congrats, @CorinneMRiley on making the 2024 @Forbes 30 Under 30 VC list 🎉

@GreylockVC is thrilled to see you get recognized for creating Greylock Edge and the Greylock Scouts program; and for your partnership w/ early stage enterprise cos. 

bit.ly/3ZgoCZ3


Albert Zhang repostou

Shipping reliable AI-powered apps isn't just about model performance – it's about delivering consistent value to users. That's why with LLM evals - response quality, task completion rates, and user satisfaction often matter more than pure model performance. Love how…


Albert Zhang repostou

At @TechCrunch Disrupt earlier this week, I was fortunate to speak on the topic of what founders need to know when going from seed to Series A. When it comes to milestones, one of the things that what matters most is quality of the customer base - and @braintrustdata is a perfect…


Albert Zhang repostou

AI Case Study: How do you reduce hallucinations by over 80%? Start with a robust evals framework. A look inside our project teaming up with @zapier on their awesome AI-powered API integration builder: buff.ly/3YzTsg8 Big thanks to @braintrustdata, our go-to evals tool!


Albert Zhang repostou

delighted to see @braintrustdata & @browserbasehq highlighted by @theinformation as 2 of the top 7 most promising ai startups 🙂

alanaagoyal's tweet image. delighted to see @braintrustdata & @browserbasehq highlighted by @theinformation as 2 of the top 7 most promising ai startups 🙂

One of the most common questions we hear from Braintrust users is: "I just ran an eval... what should I do next?" If you're wondering the same (or want to learn more about how many of our customers iterate), check out this blog post!

So you ran an eval... now what?



Albert Zhang repostou

Evals used to be part of my daily life working on Google search. It's a profound change that with the advent of generative AI, running evals has become a part of mainstream software development. We wrote about what evals even are and our process for @v0 vercel.com/blog/eval-driv…


Albert Zhang repostou

Braintrust is lightyears ahead of every other LLM eval tool we’ve tested (and there are lots). Absolute game changer for anyone trying to use LLMs in production. @braintrustdata


Albert Zhang repostou

We're hosting a curated event next week for engineering & product leaders building with Generative AI in SF. Only a couple of spots left - DM us or reply if you're interested!

braintrust's tweet image. We're hosting a curated event next week for engineering & product leaders building with Generative AI in SF. 

Only a couple of spots left - DM us or reply if you're interested!

Albert Zhang repostou

.@NotionHQ has built an incredible suite of AI features, and we’re honored they used Braintrust to evolve their eval workflow. They now triage and fix 30 issues per day! Learn how Notion develops world-class AI features: braintrust.dev/blog/notion


Albert Zhang repostou

If yesterday was not enough Braintrust for you -- I went on No priors with @eladgil to talk about the journey so far, fundraise, and what's ahead. Was especially fun to talk about learnings from Impira, since Elad was an investor there too :)


Albert Zhang repostou

"Omg did you see what this AI sai-" You bolt awake in your SoMA apartment. It is 2024. Your X feed is all Braintrust Series A. You are the CTO of a tech company, and you have changed your mind. The future cannot come to pass. The LLMs must be logged and evaled w @braintrustdata

CorinneMRiley's tweet image. "Omg did you see what this AI sai-"

You bolt awake in your SoMA apartment. It is 2024. Your X feed is all Braintrust Series A. You are the CTO of a tech company, and you have changed your mind. The future cannot come to pass. The LLMs must be logged and evaled w @braintrustdata

Albert Zhang repostou

Watching @braintrustdata skyrocket over the last year has been such a privilege, and excited for the next phase with their $36M Series A. Since the seed Braintrust has: - continued to grow mindshare of leading AI companies like @stripe, @NotionHQ, @Replit, @vercel and so many…

Exclusive: Braintrust, which helps Airtable, Brex, Notion and Stripe build AI products, has raised $36M in a Series A led by a16z. The one-year-old startup offering LLM evaluations and monitoring is now valued at about $150M, a source tells @Forbes. forbes.com/sites/alexkonr…



Albert Zhang repostou

The pace that @ankrgyl and @braintrustdata ship at is breathtaking. And with incredible taste and care for getting the abstractions for AI development right. They’ve quickly become the leading dev platform for AI, powering Stripe, Notion, Airtable, Instacart and more. Proud…

Exclusive: Braintrust, which helps Airtable, Brex, Notion and Stripe build AI products, has raised $36M in a Series A led by a16z. The one-year-old startup offering LLM evaluations and monitoring is now valued at about $150M, a source tells @Forbes. forbes.com/sites/alexkonr…



Albert Zhang repostou

O1 now available in Braintrust playground. Interesting to see how this thing works -- it only supports temperature=1 and stream=false. Also interesting to see how many more tokens it consumes along the way towards a (more reasoned) answer.


Albert Zhang repostou

Super excited to launch server-side online scoring in Braintrust. It's insanely easy to configure. Just pick scorers + sampling rate, and they run on your logs automatically. It supports LLM-as-a-judge, TS/Py code (UI or from your codebase via bundling), or any autoeval.


United States Tendências

Talvez você curta

Loading...

Something went wrong.


Something went wrong.