AfterQuery

@AfterQuery

Investigating the boundaries of AI capabilities

afterquery.com

於二月 2025 加入

10貼文 202位跟隨者 6個跟隨中

AfterQuery 已轉發

Spencer Mateega

@spencermateega

年10月17日

The frontier begets the frontier. I highly recommend reading @jaminball's latest Clouded Judgement article which spells out the AfterQuery thesis (thread)

spencermateega's tweet image. The frontier begets the frontier.

I highly recommend reading @jaminball's latest Clouded Judgement article which spells out the AfterQuery thesis

(thread)

AfterQuery

@AfterQuery

年9月18日

Excited for UI-Bench to be the leading benchmark for UI/web design! Congrats to the @figma make team for claiming the #2 ranking

Dylan Field

@zoink

年9月18日

while leaderboards are fun and motivating, this is just the start for figma make. can't wait to share all the improvements we are making over coming days / weeks / months!

zoink's tweet image. while leaderboards are fun and motivating, this is just the start for figma make. can't wait to share all the improvements we are making over coming days / weeks / months!

AfterQuery 已轉發

Introducing UI-Bench by @afterquery. The first and only rigorous eval of vibe coding tools. > 4,000+ blinded pairwise judgments > @orchidsapp, @figma make, and @lovable_dev take the lead > @v0 and @replit ranked dead last > performance gaps = differences in LLM orchestration,…

AfterQuery 已轉發

Spencer Mateega

@spencermateega

年8月25日

finally got the @afterquery team to touch grass

AfterQuery 已轉發

Spencer Mateega

@spencermateega

年7月4日

🇺🇸 249 years ago, America declared that innovation belongs to the bold. Today, we're writing the next chapter—one dataset at a time. At @AfterQuery, we believe AI's future isn't just about algorithms. It's about the human ingenuity that teaches machines to think, reason, and…

spencermateega's tweet image. 🇺🇸 249 years ago, America declared that innovation belongs to the bold.

Today, we're writing the next chapter—one dataset at a time.

At @AfterQuery, we believe AI's future isn't just about algorithms. It's about the human ingenuity that teaches machines to think, reason, and…

AfterQuery 已轉發

Spencer Mateega

@spencermateega

年6月30日

Today, we’re pulling back the curtains. After collecting thousands of original, human-written coding problems, @AfterQuery created internal, contamination-free evals to test LLM code generation. No leaderboard tricks. No test-set leakage. Just raw task execution. Thread 🧵

AfterQuery 已轉發

Ethan Liu

@ethantsliu

年5月29日

Excited to share VADER, AfterQuery's new, human-evaluated benchmark for evaluating LLMs on real-world vulnerability handling! Paper: lnkd.in/g7EfAi2cAll data, evaluation tools & results are open-sourced at: lnkd.in/gYPUKwub [1/4]

ethantsliu's tweet image. Excited to share VADER, AfterQuery's new, human-evaluated benchmark for evaluating LLMs on real-world vulnerability handling!

Paper: lnkd.in/g7EfAi2cAll data, evaluation tools &amp; results are open-sourced at: lnkd.in/gYPUKwub

[1/4]

AfterQuery 已轉發

Gary Qi

@gary_qz

年5月22日

Hey devs! Trae’s back in SF! We’re proud to be a lead partner of AGENTHACKS hackathon 📍 Join us in San Francisco and meet our amazing hosts and candidate. 🗓️ May 23–24 @ AGI House SF 💰 $10K+ in prizes, free AI credits, bounties & more! 🌐 Join now: agenthacks.org…