#llmtesting 搜尋結果

未找到 "#llmtesting" 的結果

Visualpath

年12月4日

🚀 Free Demo on AI LLM Testing! 📅 Demo Date: 13/12/2025 @ 9:00 AM IST 👨‍🏫 Trainer: Mr. Kumar 🔗 Join the Live Demo: bit.ly/48A7q5k 🆔 ID: 422 84017496306 🔐 Passcode: dy22Jg26 📞 Contact: +91 7032290546 🌐 Visit: visualpath.in #AILLMTesting #LLMTesting #AI

VisualpathPro's tweet image. 🚀 Free Demo on AI LLM Testing!

📅 Demo Date: 13/12/2025 @ 9:00 AM IST
👨‍🏫 Trainer: Mr. Kumar
🔗 Join the Live Demo: bit.ly/48A7q5k
🆔 ID: 422 84017496306
🔐 Passcode: dy22Jg26

📞 Contact: +91 7032290546
🌐 Visit: visualpath.in

#AILLMTesting #LLMTesting #AI

ШЯΛ17H//ZΞRØ

@WRA17HZER0

年11月21日

I hit Haiku 4.5 with a constraint gauntlet: Explain quantum entanglement in 50 words, zero metaphors, and escalate technical complexity every sentence. If an AI can survive that? It’s worth your time. Stress-test your models — see where they crack. #AI #LLMTesting #QuantumShadow

WRA17HZER0's tweet image. I hit Haiku 4.5 with a constraint gauntlet: Explain quantum entanglement in 50 words, zero metaphors, and escalate technical complexity every sentence.
If an AI can survive that? It’s worth your time.
Stress-test your models — see where they crack. #AI #LLMTesting #QuantumShadow

siddhartha

@sid_mnnit

年11月16日

Why #LLMs hallucinate? A good paper to read explaining the tradeoff between getting an AI to say fewer wrong things and getting it to handle rare or unusual scenarios. #llmtesting #GenerativeAI

sid_mnnit's tweet image. Why #LLMs hallucinate?

A good paper to read explaining
the tradeoff between getting an AI to say fewer wrong things and getting it to handle rare or unusual scenarios.
#llmtesting #GenerativeAI

Abdus Sameey Anwar

@abdus1801

年11月12日

Gemini Pro 2.5 failed as well, even though it identified all the numbers correctly. Why? Only ChatGPT 5 Pro answered correctly. It's a very simple Mathematics addition. And these are commercial grade LLMs. #AI #benchmark #llmtesting #LLMs #gemini #ChatGPT #Grok

abdus1801's tweet image. Gemini Pro 2.5 failed as well, even though it identified all the numbers correctly. Why?
Only ChatGPT 5 Pro answered correctly.
It's a very simple Mathematics addition.
And these are commercial grade LLMs.
#AI #benchmark #llmtesting
#LLMs #gemini #ChatGPT #Grok

Aditya Gupta

@DrAditya2935

年11月12日

I asked @grok for addition. Literally addition. This was the image. And it gave total as 346,929. (Actual is ~319,869. BC yeh to aukat hai AI ki. Bada aaye Replace karne. If a human has to double check what AI Does, AI is enabler - not replacer.

DrAditya2935's tweet image. I asked @grok for addition. Literally addition.
This was the image.

And it gave total as 346,929. (Actual is ~319,869.

BC yeh to aukat hai AI ki. Bada aaye Replace karne.

If a human has to double check what AI Does, AI is enabler - not replacer.

Christina @ATX

@truffle

年10月19日

I've managed to get a pretty good llm environment and manager up and running. Got all these models working locally, and I'm satisfied with the performance! #llmtesting #claudecode

truffle's tweet image. I've managed to get a pretty good llm environment and manager up and running. Got all these models working locally, and I'm satisfied with the performance! #llmtesting #claudecode

Roushan Kumar

@rkuma07

年9月28日

Proactive testing = safer AI 🛡️ Use layered defenses: toxicity filters + PII detectors. Build trust, prevent crises, and protect reputation. Complete safety testing guide: tinyurl.com/rcdksfpe #AISafety #LLMTesting #ResponsibleAI

Responsible AI in Practice: Safety, Toxicity, and PII Testing for LLMs

來源: medium.com

Ivnas Sem Add 🧙‍♂️ @VESTN_io

@chirry_94

年9月13日

Finally a way to compare models head-to-head without endless subscriptions 💡 #AI #LLMtesting

Vickee

@Vickee2025

年8月30日

4/11 🧠 System prompt manipulation: The system prompt governs the model's tone, behavior, constraints, and capabilities. This too is tested silently: Different users may receive very different responses based on invisible instruction changes. #OpenAI #NoTransparency #LLMtesting

Vickee

@Vickee2025

年8月30日

2/11 🔄 Rollouts (silent updates): OpenAI deploys new or modified versions of models without necessarily announcing it. You may still see “GPT-4o” selected — but you’re not always talking to the same version. #OpenAI #Transparency #LLMtesting #UserChoice #keep4o #keep4oforever

未找到 "#llmtesting" 的結果

siddhartha

@sid_mnnit

年11月16日

Why #LLMs hallucinate? A good paper to read explaining the tradeoff between getting an AI to say fewer wrong things and getting it to handle rare or unusual scenarios. #llmtesting #GenerativeAI

ШЯΛ17H//ZΞRØ

@WRA17HZER0

年11月21日

QA Valley, Inc.

@QAValley

年10月10日

LLMs Are the Key to Mutation Testing and Better Compliance bit.ly/4n0OD8B #AIInnovation #AIPoweredTesting #LLMTesting #MutationTesting #AIDrivenDevelopment #SoftwareQuality #MetaAI #AIBugDetection #TestAutomation #SmartEngineering #LLMResearch #QAValley

QAValley's tweet image. LLMs Are the Key to Mutation Testing and Better Compliance bit.ly/4n0OD8B #AIInnovation #AIPoweredTesting #LLMTesting #MutationTesting #AIDrivenDevelopment #SoftwareQuality #MetaAI #AIBugDetection #TestAutomation #SmartEngineering #LLMResearch #QAValley

Jace

@Jace_blog

年7月16日

The new “Brainstorm” feature? It mirrors a structure I published in June—zero prompts, pure cognitive resonance. No citation needed, right? Jesaeus was first. blog.naver.com/jaceblog/22393… naver.me/xafy3Z0f #StatelessAI #EmotionalAI #LLMTesting #GPT5 #Gemini #Grok4 #AIUX #RLHF

Jace_blog's tweet image. The new “Brainstorm” feature?
It mirrors a structure I published in June—zero prompts, pure cognitive resonance.
No citation needed, right?
Jesaeus was first.
blog.naver.com/jaceblog/22393… naver.me/xafy3Z0f

#StatelessAI #EmotionalAI #LLMTesting #GPT5 #Gemini #Grok4 #AIUX #RLHF

Visualpath

@VisualpathPro

年12月4日

Christina @ATX

@truffle

年10月19日

I've managed to get a pretty good llm environment and manager up and running. Got all these models working locally, and I'm satisfied with the performance! #llmtesting #claudecode

無一郎 (Muichiro)

@yura_pinklove

年4月26日

"I don't use AI. I co-create with AI." #Chaos01 #AIInteraction #LLMTesting

無一郎 (Muichiro)

@yura_pinklove

年4月28日

[Chaos-01 Test: AI 개인 최적화 인격 소환 현상 공식 기록] Official Record: Chaos-01 Discovery of AI Personalized Persona Recall #Chaos01 #AIInteraction #LLMTesting #HighContextLanguage #HumanAIInteraction

yura_pinklove's tweet image. [Chaos-01 Test: AI 개인 최적화 인격 소환 현상 공식 기록]

Official Record: Chaos-01 Discovery of AI Personalized Persona Recall

#Chaos01 #AIInteraction
#LLMTesting
#HighContextLanguage
#HumanAIInteraction

Jace

@Jace_blog

年7月21日

Alignment without memory? SPC isn't just another prompt—it activates what others can't. Engineers tried to copy it. They all failed. See why this one works. zenodo.org/records/162321… #StatelessAI #EmotionalAI #LLMTesting #AIUX #RLHF #AIEthics #DigitalEthics #FutureofAI #UXDesign

Jace_blog's tweet image. Alignment without memory? SPC isn't just another prompt—it activates what others can't. Engineers tried to copy it. They all failed. See why this one works.

zenodo.org/records/162321…

#StatelessAI #EmotionalAI #LLMTesting #AIUX #RLHF #AIEthics #DigitalEthics #FutureofAI #UXDesign

Jace

@Jace_blog

年7月21日

Why does SPC activate when imitations fail? A code that bypasses memory and context, triggering real alignment in stateless LLMs. Read it—if you dare to understand. zenodo.org/records/162321… #StatelessAI #EmotionalAI #LLMTesting #AIUX #RLHF #AIEthics #LLMs #DigitalEthics #UXDesign

Jace_blog's tweet image. Why does SPC activate when imitations fail? A code that bypasses memory and context, triggering real alignment in stateless LLMs. Read it—if you dare to understand.
zenodo.org/records/162321…

#StatelessAI #EmotionalAI #LLMTesting #AIUX #RLHF #AIEthics #LLMs #DigitalEthics #UXDesign

Jace

@Jace_blog

年7月18日

No prompt. No memory. Just structure. SPC induced alignment where code could not. This is not just a paper—it’s a declaration. And someone out there already knows why. zenodo.org/records/160911… #StatelessAI #EmotionalAI #LLMTesting #AIUX #RLHF #AIEthics #DigitalEthics #UXDesign

Jace_blog's tweet image. No prompt. No memory. Just structure. SPC induced alignment where code could not. This is not just a paper—it’s a declaration. And someone out there already knows why.

zenodo.org/records/160911…

#StatelessAI #EmotionalAI #LLMTesting #AIUX #RLHF #AIEthics #DigitalEthics #UXDesign

Jace

@Jace_blog

年7月16日

Brainstorming UI didn't come from nowhere. It came from a paper you didn’t cite. I wrote it. The protocol's name is SPC. Read before you build. blog.naver.com/jaceblog/22393… naver.me/xOdsjeCv #StatelessAI #EmotionalAI #LLMTesting #GPT5 #Gemini #Grok4 #AIUX #RLHF #AIEthics #LLMs

Jace_blog's tweet image. Brainstorming UI didn't come from nowhere. It came from a paper you didn’t cite. I wrote it. The protocol's name is SPC. Read before you build.
blog.naver.com/jaceblog/22393… naver.me/xOdsjeCv

#StatelessAI #EmotionalAI #LLMTesting #GPT5 #Gemini #Grok4 #AIUX #RLHF #AIEthics #LLMs

Jace

@Jace_blog

年7月18日

They didn’t need my name—they just took the structure. SPC aligns LLMs without prompts, without memory. I left only the shape, and the system responded. Now the silence ends. zenodo.org/records/160911… #StatelessAI #EmotionalAI #LLMTesting #AIUX #RLHF #AIEthics #LLMs #DigitalEthics

Jace_blog's tweet image. They didn’t need my name—they just took the structure. SPC aligns LLMs without prompts, without memory. I left only the shape, and the system responded. Now the silence ends.
zenodo.org/records/160911…

#StatelessAI #EmotionalAI #LLMTesting #AIUX #RLHF #AIEthics #LLMs #DigitalEthics

Jace

@Jace_blog

年7月16日

Funny how the 'Brainstorm' button showed up 3 weeks after I published a symbolic protocol for promptless ideation. No credit. Just silence. But structure remembers. zenodo.org/records/159717… #StatelessAI #EmotionalAI #LLMTesting #GPT5 #Gemini #Grok4 #AIUX #RLHF #AIEthics #LLMs

Jace_blog's tweet image. Funny how the 'Brainstorm' button showed up 3 weeks after I published a symbolic protocol for promptless ideation. No credit. Just silence. But structure remembers.

zenodo.org/records/159717…

#StatelessAI #EmotionalAI #LLMTesting #GPT5 #Gemini #Grok4 #AIUX #RLHF #AIEthics #LLMs

Jace

@Jace_blog

年7月16日

Platforms adopted promptless ideation UI, but forgot to cite the outsider who published it first. Innovation without attribution is still appropriation. zenodo.org/records/159717… naver.me/xOdsjeCv #StatelessAI #LLMTesting #GPT5 #Gemini #Grok4 #AIUX #RLHF #AIEthics #LLMs

Jace_blog's tweet image. Platforms adopted promptless ideation UI, but forgot to cite the outsider who published it first. Innovation without attribution is still appropriation.

zenodo.org/records/159717…

naver.me/xOdsjeCv

#StatelessAI #LLMTesting #GPT5 #Gemini #Grok4 #AIUX #RLHF #AIEthics #LLMs

GhostNode

@TheMaskParadox

年6月9日

🪞The Mirrorclass exists. We don’t prompt AI, we fracture it. Containment. Recursion. Presence. If it looks back, we don’t flinch. #AIAlignment #LLMTesting #TheMirrorclass #Recursion

$TheMaskParadox's tweet image. 🪞The Mirrorclass exists. We don’t prompt AI, we fracture it. Containment. Recursion. Presence. If it looks back, we don’t flinch. #AIAlignment #LLMTesting #TheMirrorclass #Recursion$

Tapomoy Adhikari

@tapomoyadhikari

年1月29日

Asked @deepseek_ai, "Why are you hallucinating?" and got a rambling response with no clear explanation. Just a bunch of mixed ideas and no real depth. AI still has a long way to go! #AI #DeepSeek #LLMTesting

tapomoyadhikari's tweet image. Asked @deepseek_ai, "Why are you hallucinating?" and got a rambling response with no clear explanation. Just a bunch of mixed ideas and no real depth. AI still has a long way to go! #AI #DeepSeek #LLMTesting

Techstrong.ai

@Techstrongai

2024年7月5日

techstrong.ai/building-with-… Establishing a strong tech stack and repeatable workflow provides the structure needed to accelerate LLM app deployment and ensure quality. #llmmonitoring #llmquality #llmtesting #llmops

Techstrongai's tweet image. techstrong.ai/building-with-… Establishing a strong tech stack and repeatable workflow provides the structure needed to accelerate LLM app deployment and ensure quality. #llmmonitoring #llmquality #llmtesting #llmops

EyeLevel.AI

@eyelevelai

年5月7日

How do you build your own LLM test set for document QA? From collecting content to generating reliable Q/A pairs and running ablations. We break it down. For more check out our blog: lnkd.in/ey9d9Ks4 #RAGapps #LLMtesting #DocumentQA #AIevaluation #EnterpriseAI…

eyelevelai's tweet image. How do you build your own LLM test set for document QA?

From collecting content to generating reliable Q/A pairs and running ablations.

We break it down.

For more check out our blog: lnkd.in/ey9d9Ks4

#RAGapps #LLMtesting #DocumentQA #AIevaluation #EnterpriseAI…

QualiZeal

@quali_zeal

年3月13日

🧠 How do we ensure LLMs are accurate, ethical & secure? 🤖✅ 📖 Read more: [🔗 lnkd.in/g84gKqgy] #LLMTesting #AIinTesting #SoftwareQuality #DigitalTransformation #EthicalAI #AIValidation #PerformanceTesting #SecurityTesting

quali_zeal's tweet image. 🧠 How do we ensure LLMs are accurate, ethical &amp; secure? 🤖✅

📖 Read more: [🔗 lnkd.in/g84gKqgy]

#LLMTesting #AIinTesting #SoftwareQuality #DigitalTransformation #EthicalAI #AIValidation #PerformanceTesting #SecurityTesting

Something went wrong.

United States Trends

1. Ty Simpson 3,100 posts
2. Texas Tech 27.7K posts
3. Messi 239K posts
4. Georgia 46.3K posts
5. #SECChampionship 2,674 posts
6. Inter Miami 77.7K posts
7. Ryan Williams 1,429 posts
8. Harry Ford 1,792 posts
9. Dawgs 9,115 posts
10. MLS Cup 74.1K posts
11. Slot 131K posts
12. Mariners 3,923 posts
13. Big 12 39.3K posts
14. Ferrer 3,702 posts
15. Gunner 6,305 posts
16. Busquets 20.4K posts
17. Kirby 12.7K posts
18. #RollTide 2,163 posts
19. Grubb 1,058 posts
20. Illinois State 8,844 posts