#benchmarks search results

Microsoft Adoption & Community

May 8

✨ Taking the stage, @Jared_Spataro here to share his thoughts and insights about “A New Frontier: Building the Future Firm with #AI” #BenchMarks 📈📉📊 #M365Con Day Three Keynote

MSFTAdoption's tweet image. ✨ Taking the stage, @Jared_Spataro here to share his thoughts and insights about “A New Frontier: Building the Future Firm with #AI” #BenchMarks 📈📉📊

#M365Con Day Three Keynote

SLLY nomy :3

@nomyfps

Sep 22

Cerulean Complete. #VISCOSE #BENCHMARKS #VISCOSEBENCHMARKS

Gerard Sans | Axiom 🇬🇧

@gerardsans

Aug 6, 2024

Achieving SOTA AI benchmarks in 2024 AI researchers: nobody is gonna know #ai #benchmarks

Open Deep Search (ODS) isn’t theory. It’s already outperforming closed labs: - FRAMES: 75.3% - SimpleQA: 88.3% That’s Sentient’s power: research that’s open, benchmarked, and winning. @SentientAGI @sentient_chat #SentientAGI #Benchmarks

BananaYellow88's tweet image. Open Deep Search (ODS) isn’t theory.

It’s already outperforming closed labs:
- FRAMES: 75.3%
- SimpleQA: 88.3%

That’s Sentient’s power: research that’s open, benchmarked, and winning.

@SentientAGI @sentient_chat

#SentientAGI #Benchmarks

Predibase by Rubrik

@predibase

May 28

The fastest open-source LLM #inference stack just landed. Check out our latest #benchmarks that leave vLLM and Fireworks in the dust. 🏎️💨 Our blog has all the juicy details—but here's the 30-sec version: ⚡ Up to 4× lower P50/P95 latency on the same #H100 & L40S GPUs 📈…

predibase's tweet image. The fastest open-source LLM #inference stack just landed.

Check out our latest #benchmarks that leave vLLM and Fireworks in the dust. 🏎️💨

Our blog has all the juicy details—but here's the 30-sec version:
⚡ Up to 4× lower P50/P95 latency on the same #H100 &amp; L40S GPUs
📈…

the peak district viking

@thepdviking

Oct 9

One for you @martgathercole a few of the benchmarks in my area #benchmarking #ordnancesurvey #benchmarks

ECH Institute Inc.

@ECHInstitute

Oct 6

3️⃣ Kamil Chodoła(@ChodoKamil) provided a deep dive into performance #benchmarks and testing strategies, showcasing the rigorous processes involved in upcoming upgrades. 🔧

ECHInstitute's tweet image. 3️⃣ Kamil Chodoła(@ChodoKamil) provided a deep dive into performance #benchmarks and testing strategies, showcasing the rigorous processes involved in upcoming upgrades. 🔧

iPhone Sickness ®

@iphonesickness

Aug 27

Pixel 10 Pro XL pulls a 95% stability on Wild Life Extreme Stress Test 🔥 Best loop: 3252 | Lowest loop: 3094 Google finally nailed thermal performance – no wild throttling here. 💪📱 #Pixel10Pro #Benchmarks

iphonesickness's tweet image. Pixel 10 Pro XL pulls a 95% stability on Wild Life Extreme Stress Test 🔥
Best loop: 3252 | Lowest loop: 3094

Google finally nailed thermal performance – no wild throttling here. 💪📱 #Pixel10Pro #Benchmarks

Prakash Sangam

@MyTechMusings

Oct 21

Some #benchmarks for #Oryon2ndgen #SnapdragonSummit @Qualcomm

AlpacaLips 🐢

@AlpacaLips_

Jan 31

I camped overnight outside of Microcenter and got my hands on an #RTX5080 #ffxiv #benchmarks #ff14

AlpacaLips 🐢

@AlpacaLips_

Jan 31

I camped overnight outside of Microcenter to get an RTX 5080! Here's how it runs on #FFXIV youtu.be/D_CgrIaw1nU?si…

Benedikt Koehler

@furukama

Sep 20

You can now filter the LLM benchmark list by size. Here's the top XS models (< 2B parameters) furukama.com/llm-bob/?size=… #benchmarks #artificialintelligence

furukama's tweet image. You can now filter the LLM benchmark list by size. Here's the top XS models (&lt; 2B parameters) furukama.com/llm-bob/?size=… #benchmarks #artificialintelligence

Hyke

@0xhyke

Aug 8

GPT-5 vs Grok 4 - SkateBench → GPT-5: 98.6% accuracy | $0.07 cost → Grok 4: 79% accuracy | $4.86 cost GPT-5 is: → 14× cheaper → More accurate → Much faster This is precision at scale. That is burn rate with lag. #GPT5 #LLM #Benchmarks

0xhyke's tweet image. GPT-5 vs Grok 4 - SkateBench

→ GPT-5: 98.6% accuracy | $0.07 cost
→ Grok 4: 79% accuracy | $4.86 cost

GPT-5 is:
→ 14× cheaper
→ More accurate
→ Much faster

This is precision at scale.
That is burn rate with lag.

#GPT5 #LLM #Benchmarks

HUDCO

@hudcolimited

Dec 31

From paying the highest-ever dividend in #FY24 to winning the #BMMunjalAward, we set new #benchmarks in building a sustainable, resilient India. 🏆 As we bid farewell to #2024, we eagerly embrace the endless possibilities that lie ahead! 🙌 #HUDCOImpact #Throwback2024

Benedikt Koehler

@furukama

Sep 8

Wen 🇪🇺 Europe AI? furukama.com/llm-bob/ #llm #benchmarks #ranking

Android Authority

@AndroidAuth

Sep 24

Snapdragon 8 Elite Gen 5 benchmarks are CRAZY! 📈 #qualcommsnapdragon #qualcomm #benchmarks

Tiger Pistol

@TigerPistol

Apr 3

The TikTok ban grace period expires this week. A new study shows Meta ad prices soared during the previous brief TikTok outage – hurting small businesses the most. go.tigerpistol.com/3R2xihZ #FranchiseMarketing #TikTok #Benchmarks #LocalAdvertising #DigitalMarketing

TigerPistol's tweet image. The TikTok ban grace period expires this week. A new study shows Meta ad prices soared during the previous brief TikTok outage – hurting small businesses the most. go.tigerpistol.com/3R2xihZ

#FranchiseMarketing #TikTok #Benchmarks #LocalAdvertising #DigitalMarketing

New Submissions to TMLR

@TmlrSub

12 h

VICON: Vision In-Context Operator Networks for Multi-Physics Fluid Dynamics Prediction openreview.net/forum?id=6V3Ym… #benchmarks #strides #learning

menasco

@menasco_uae

15 h

#Excellence in #Engineering We are commented new #benchmarks in quality & reliability At #MENASCO, we are #Committed to achieving #excellence through #expertise, #innovation & #precision delivering engineering #solutions that set new benchmarks in #quality & #reliability.

menasco_uae's tweet image. #Excellence in #Engineering
We are commented new #benchmarks in quality &amp; reliability
At #MENASCO, we are #Committed to achieving #excellence through #expertise, #innovation &amp; #precision delivering engineering #solutions that set new benchmarks in #quality &amp; #reliability.

v

@TheManInBlackZ

Oct 12

Youre device is too powerful...#S24 ....never seen that before #benchmarks #snapdragon

Hans Willert

@HWillert

Oct 12

Q1 2025 PitchBook #Benchmarks (with preliminary Q2 2025 data) | PitchBook pitchbook.com/news/reports/q… via @PitchBook

HWillert's tweet card. The Q1 2025 PitchBook Benchmarks (with preliminary Q2 2025 data) leverages a differentiated data collection process that results in one of the most robust fund performance datasets in the market.

Q1 2025 PitchBook Benchmarks (with preliminary Q2 2025 data) | PitchBook

Source: pitchbook.com

mycrypto news

@My_CryptoNews

Oct 10

NVIDIA Blackwell Outshines in InferenceMAX™ v1 Benchmarks NVIDIA's Blackwell architecture demonstrates significant performance and efficiency gains in SemiAnalysis's InferenceMAX™ v1 benchmarks, setting new standa ➤ jmpto.net/pFyef #Benchmarks #Inferencemax #Nvidia

Accepted papers at TMLR

@TmlrPub

Oct 10

Dextr: Zero-Shot Neural Architecture Search with Singular Value Decomposition and Extrinsic Curva... Rohan Asthana, Joschua Conrad, Maurits Ortmanns, Vasileios Belagiannis. Action editor: Frederic Sala. openreview.net/forum?id=X0vPo… #cnn #benchmarks #netw

the peak district viking

@thepdviking

Oct 9

One for you @martgathercole a few of the benchmarks in my area #benchmarking #ordnancesurvey #benchmarks

Kai Tony Midtrud

@TonyMidtrud

Oct 7

GLM-4.6 benchmarks: Grok 4 third in intelligence at 65, Grok Fast fifth! Solid showing vs. GPT-5 top spots. Speed/price balanced. 📊 @xai @ArtificialAnlys #AI #Benchmarks artificialanalysis.ai/models/glm-4-6…

TonyMidtrud's tweet image. GLM-4.6 benchmarks: Grok 4 third in intelligence at 65, Grok Fast fifth! Solid showing vs. GPT-5 top spots. Speed/price balanced. 📊 @xai @ArtificialAnlys #AI #Benchmarks
artificialanalysis.ai/models/glm-4-6…

ECH Institute Inc.

@ECHInstitute

Oct 6

3️⃣ Kamil Chodoła(@ChodoKamil) provided a deep dive into performance #benchmarks and testing strategies, showcasing the rigorous processes involved in upcoming upgrades. 🔧

Andres Vilariño 🇪🇦

@andresvilarino

Oct 6

#AIBenchmarks: Why Useless, Personalized Agents Prevail #Benchmarks #AI #ArtificialIntelligence #Tech #technology buff.ly/n3ntJKS

Benchmarks

@BenchmarksNC

Oct 6

Policies and regulations change fast. Are you ready when they do? Benchmarks helps you stay informed, turn policy into action, and make your voice count. Don’t get caught off guard—lead with confidence. #BecomeAMemeber #Benchmarks

BenchmarksNC's tweet image. Policies and regulations change fast. Are you ready when they do? Benchmarks helps you stay informed, turn policy into action, and make your voice count. Don’t get caught off guard—lead with confidence. #BecomeAMemeber #Benchmarks

@Neuro_AI

@NeuroAI_Nexus

Oct 5

11/ What we still need: rigorous benchmarks, domain-specific safety models, continuous behavioral audits, and real oversight rails. Speed without stewardship isn’t progress. #Benchmarks #Standards #AICompliance

Himanshu

@WaghHimanshu

Oct 5

2. Verifiers: This method lets the LLM give a free-form answer (like in math or code), and a tool checks if the final result is correct. It's a step up from multiple-choice but only works for problems with a clear right or wrong answer. #MachineLearning #Benchmarks

WaghHimanshu's tweet image. 2. Verifiers: This method lets the LLM give a free-form answer (like in math or code), and a tool checks if the final result is correct. It's a step up from multiple-choice but only works for problems with a clear right or wrong answer. #MachineLearning #Benchmarks

AgenticLabs

@AgenticLabsLtd

Oct 2

Claude Sonnet 4.5 just topped SWE-bench Verified (n=500) with 82% accuracy — outperforming Opus 4.1, Sonnet 4, GPT-5 Codex, GPT-5, and Gemini 2.5 Pro. Software engineering benchmark results are clear: Sonnet 4.5 leads. #AI #SoftwareEngineering #Benchmarks #Craftvideo

AgenticLabsLtd's tweet image. Claude Sonnet 4.5 just topped SWE-bench Verified (n=500) with 82% accuracy — outperforming Opus 4.1, Sonnet 4, GPT-5 Codex, GPT-5, and Gemini 2.5 Pro.

Software engineering benchmark results are clear:
Sonnet 4.5 leads.

#AI #SoftwareEngineering #Benchmarks #Craftvideo

Fusion (by IPOR)

@ipor_io

notebookcheck.net

@nbc_net

RandomGaminginHD

@RGinHD

Tom's Hardware

@tomshardware

CF Benchmarks

@CFBenchmarks

TechEmpower Framework Benchmarks

@TFBenchmarks

Ofir Press

@OfirPress

MLPerf

@MLPerf

Baltic Exchange

@BalticExchange

SpeedTest G

@SpeedTest_G

Alumni Ventures

@alumniventures

SPEC

@spec_perf

Open Life Science AI

@OpenlifesciAI

Ultrawide Benchmarks

@uwbenchmarks

Leo Reed

@Leoreedmax

Cannabis Benchmarks

@CannaBenchmark

IPOR Rates

@ipor_rates

VPSBenchmarks

@vpsbenchmarks

AiThority.com

@AiThority

GTX 1080 Benchmarks

@I5960Bench

RuaninhoBR - Hardware e Tecnologia

@gagaruano

edurio

@eduriocom

Overload Digital

@HanuOverload

Workshop on Graph Learning Benchmarks

@GLB_Workshop

Lech Mazur

@LechMazur

Ken Odeluga

@Ken_Odeluga

John M McKee

@CoachJohnMcKee

HFR

@HFRinc

Betting Benchmarks

@BetBenchmarks

MarTech Series

@MarTechSeries

Retail Owners Inst.

@RetailOwner

ced

@0xBelkan_

ness

@ness_eilish

InterConnecta

@Interconnecta

Jorge montepeque

@Jorgesays01

Berkeley AI Research Climate Initiative

@ai_climate

UL Solutions Benchmarks

@UL_Benchmarks

PC Crazy

@pccrazy21

Digital Impact Awards

@_digital_impact

Kilian Lieret

@KLieret

Britain’s Benchmarks

@BBenchmarks

Morrisby

@Morrisby

Centric Digital

@CentricDigital

CFennelly

@carole_fennelly

Energy Notes

@Notes4Energy

Polaris

@Polaris_HQ

BarclayHedge

@BarclayHedge

Benchmark Group

@BenchmarksGroup

Careers At Aquinas

@Aquinascareers

Chris Farmand

@cfarmand

ComputerBase 🕊️

@ComputerBase

Jun 8, 2023

#DiabloIV #Benchmarks - 38 GPUs tested✅(yesterday) - 14 CPUs tested✅(today) Enjoy! computerbase.de/2023-06/diablo…

Microsoft Adoption & Community

@MSFTAdoption

May 8

✨ Taking the stage, @Jared_Spataro here to share his thoughts and insights about “A New Frontier: Building the Future Firm with #AI” #BenchMarks 📈📉📊 #M365Con Day Three Keynote

TONKO

@yamomzouse

Apr 6

#Benchmarks

Prakash Sangam

@MyTechMusings

Oct 21

Some #benchmarks for #Oryon2ndgen #SnapdragonSummit @Qualcomm

yanoyano@kai鯖運営

@yanoyano4649

Feb 5

鯖缶のぼやき今流行り？のベンチマークを動かしてみた #MonsterHunterWilds #Benchmarks

Benedikt Koehler

@furukama

Sep 8

Wen 🇪🇺 Europe AI? furukama.com/llm-bob/ #llm #benchmarks #ranking

Vals AI

@_valsai

May 9

We just released our evaluation of @MistralAI Medium 3 across all of our benchmarks! 🧵(1/6) #AI #LLM #Benchmarks

Gerard Sans | Axiom 🇬🇧

@gerardsans

Sep 18, 2023

Do you know how Google PaLM2 model powering Bard compares to other LLMs? 🤔 Tomorrow at GitHub SF I will compare publicly available benchmarks for PaLM2, GPT-4, GPT-3.5 and Llama2 representing open source! RSVP now! Last seats 👉🏻 meetup.com/graphql-sf/eve… #ai #benchmarks ✨🚀

gerardsans's tweet image. Do you know how Google PaLM2 model powering Bard compares to other LLMs? 🤔

Tomorrow at GitHub SF I will compare publicly available benchmarks for PaLM2, GPT-4, GPT-3.5 and Llama2 representing open source!

RSVP now! Last seats 👉🏻
meetup.com/graphql-sf/eve…

#ai #benchmarks ✨🚀

Predibase by Rubrik

@predibase

May 28

Nina McNeary

@Heritage_Nina

Mar 21, 2023

#benchmarks Places of worship across the island of Ireland bear a tangible link to the legacy of the Ordnance Survey which mapped Ireland nearly 200 years ago. The OS was the completion of the world’s first large scale mapping of an entire country.

Heritage_Nina's tweet image. #benchmarks
Places of worship across the island of Ireland bear a tangible link to the legacy of the Ordnance Survey which mapped Ireland nearly 200 years ago. The OS was the completion of the world’s first large scale mapping of an entire country.

Tiger Pistol

@TigerPistol

Apr 3

TMLR Papers with Videos

@TmlrVideos

Apr 10

CORE-Bench: Fostering the Credibility of Published Research Through a Computational Reproducibili... Zachary S Siegel, Sayash Kapoor, Nitya Nadgir, Benedikt Stroebl, Arvind Narayanan tmlr.infinite-conf.org/paper_pages/Bs… #benchmark #benchmarks #ai

TmlrVideos's tweet image. CORE-Bench: Fostering the Credibility of Published Research Through a Computational Reproducibili...

Zachary S Siegel, Sayash Kapoor, Nitya Nadgir, Benedikt Stroebl, Arvind Narayanan

tmlr.infinite-conf.org/paper_pages/Bs…

#benchmark #benchmarks #ai

ECH Institute Inc.

@ECHInstitute

Oct 6

3️⃣ Kamil Chodoła(@ChodoKamil) provided a deep dive into performance #benchmarks and testing strategies, showcasing the rigorous processes involved in upcoming upgrades. 🔧

Hyke

@0xhyke

Aug 8

Neil Gunther

@DrQz

Aug 27, 2024

Gaphorism 1.14: Not even wrong !!! perfdynamics.com/Manifesto/gcap… #latency #performance #benchmarks

PyLadies Paris

@PyLadiesParis

Nov 16, 2023

It's important to use proper #benchmarks and #evaluation methods to validate your #models, especially for time series

Dream11 Engineering

@Dream11Engg

Jan 29

🚀 New benchmarks are live for @reactnative 0.77! Compare how your current React Native version stacks up against 0.77 at reactnativebenchmark.dev Huge thanks to everyone contributing to React Native! 🙌 #ReactNative #Benchmarks

Dream11Engg's tweet image. 🚀 New benchmarks are live for @reactnative 0.77!

Compare how your current React Native version stacks up against 0.77 at reactnativebenchmark.dev

Huge thanks to everyone contributing to React Native! 🙌 #ReactNative #Benchmarks

Mayank

@mayankkussh

Sep 12, 2024

Curious about how the latest react-native version 0.76.0-rc.0 is performing on benchmarks. Check out our new dashboard by @Dream11Engg that give you insights and comparison between all versions starting from 0.73. dream-sports-labs.github.io/rn-benchmarkin…… @reactnative #benchmarks

mayankkussh's tweet image. Curious about how the latest react-native version 0.76.0-rc.0 is performing on benchmarks. Check out our new dashboard by @Dream11Engg that give you insights and comparison between all versions starting from 0.73. dream-sports-labs.github.io/rn-benchmarkin……
@reactnative #benchmarks

Benedikt Koehler

@furukama

Sep 20

You can now filter the LLM benchmark list by size. Here's the top XS models (< 2B parameters) furukama.com/llm-bob/?size=… #benchmarks #artificialintelligence

Something went wrong.

United States Trends

1. Columbus 175K posts
2. President Trump 1.16M posts
3. Middle East 281K posts
4. Brian Callahan 11.1K posts
5. Azzi 7,426 posts
6. #IndigenousPeoplesDay 12.9K posts
7. Titans 42.5K posts
8. Thanksgiving 57.1K posts
9. Vrabel 7,504 posts
10. Cape Verde 18.2K posts
11. Macron 226K posts
12. Marc 51.7K posts
13. #Isles 1,581 posts
14. Seth 51.4K posts
15. HAZBINTOOZ 6,413 posts
16. Apple TV 6,004 posts
17. Sabres 3,558 posts
18. Native Americans 14K posts
19. $GIGGLE 5,439 posts
20. Sorokin N/A

#benchmarks search results

Microsoft Adoption & Community

TONKO

SLLY nomy :3

Gerard Sans | Axiom 🇬🇧

YellowBanana

Predibase by Rubrik

the peak district viking

ECH Institute Inc.

iPhone Sickness ®

Prakash Sangam

AlpacaLips 🐢

AlpacaLips 🐢

Benedikt Koehler

Hyke

HUDCO

Benedikt Koehler

Android Authority

Tiger Pistol

New Submissions to TMLR

menasco

v

Hans Willert

mycrypto news

Accepted papers at TMLR

the peak district viking

Kai Tony Midtrud

ECH Institute Inc.

Andres Vilariño 🇪🇦

Benchmarks

@Neuro_AI

Himanshu

AgenticLabs

Fusion (by IPOR)

notebookcheck.net

RandomGaminginHD

Tom's Hardware

CF Benchmarks

TechEmpower Framework Benchmarks

Ofir Press

MLPerf

Baltic Exchange

SpeedTest G

Alumni Ventures

SPEC

Open Life Science AI

Ultrawide Benchmarks

Leo Reed

Cannabis Benchmarks

IPOR Rates

VPSBenchmarks

AiThority.com

GTX 1080 Benchmarks

RuaninhoBR - Hardware e Tecnologia

edurio

Overload Digital

Workshop on Graph Learning Benchmarks

Lech Mazur

Ken Odeluga

John M McKee

HFR

Betting Benchmarks

MarTech Series

Retail Owners Inst.

ced

ness

InterConnecta

Jorge montepeque

Berkeley AI Research Climate Initiative

UL Solutions Benchmarks

PC Crazy

Digital Impact Awards

Kilian Lieret

Britain’s Benchmarks

Morrisby

Centric Digital

CFennelly

Energy Notes

Polaris

BarclayHedge