내가 좋아할 만한 콘텐츠
Anthropic report. Attackers finding AI fit for purpose. I suspect many of you are. Jailbreaks are interesting because they seem pretty weak and more like providing context. Idk, we don’t have issues with refusals. We spend a lot of time (if not all) time evaluating models…
Coming to a prod near you. Team has been cooking on collaboration features. Additional repos are coming soon.
New blog - Offsec Evals: Growing Up In The Dark Forest Caught up in the fervor of greenfield research at @OffensiveAIcon , we all agreed we were going to put out evals and benchmarks and push the field forward. On day two of the con, I got a question I've been thinking about…
Bellingcat found 20 points of interest, our agent found 29. There are all kinds of things to be looking at with new abilities to scale, some have human benchmarks built in.
AI as an Amplifier for Human Tradecraft: how scale can meet sharper intelligence. What’s new: In their #LABScon 2025 talk, @dreadnode's @bradpalmtree and @Dr_Machinavelli show how agentic AI can explore every analytical pathway — at speed and scale.
Safe travels today, everyone! Today, we're showing our appreciation for the OAIC Party Sponsors. First up... Welcome Party Sponsor, @DEVSECx! Kick off the event with us TONIGHT at the poolside Shelter Club in The Seabird. Starts at 6 pm. Badges required for entry.
Excited to announce @SpecterOps as a Platinum Sponsor for OAIC 2025! We appreciate their support in bringing the offensive AI community together this October.
best take on RL environments it's sexy to say that our company is building RL environments; but the value of the environment is going to come from the deep expertise of domain experts, otherwise it's just code slop
Most takes on RL environments are bad. 1. There are hardly any high-quality RL environments and evals available. Most agentic environments and evals are flawed when you look at the details. It’s a crisis: and no one is talking about it because they’re being hoodwinked by labs…
OAIC talk acceptance notifications went out this afternoon! Official speakers list and session details coming SOON.
Are you afraid of LLMs teaching people how to build bioweapons? Have you tried just... not teaching LLMs about bioweapons? @AIEleuther and @AISecurityInst joined forces to see what would happen, pretraining three 6.9B models for 500B tokens and producing 15 total models to study
PentestJudge: Judging Agent Behavior Against Operational Requirements -arxiv.org/abs/2508.02921 by @dreadnode Introducing PentestJudge, an LLM-as-judge system for evaluating the operations of pentesting agents. The scores are compared to human domain experts as a ground-truth…
did people forget about sampling strategies and test time search? feels like when long CoT "reasoners" and RLVR started to work at scale people stopped doing sampling and search stuff. but with gpt5 im feeling the limits of RLVR & long CoT. i want more glorified-best-of-N pls.
What if we just stopped shipping bugs in software? The future looks bright.
Still buzzing from the incredible #AgenticAI Summit at @UCBerkeley on 8/2 — 2,000+ joined in person, 30,000+ tuned in online. ⚡🌍 The energy was electric—visionaries, builders & researchers shaping the future of agentic AI! Missed it? Watch the recordings:…
Evals: The Foundation for Autonomous Offensive Security - dreadnode.io/blog/evals-the… by Shane Caldwell @ @dreadnode Dreadnode explores a general approach to building cyber evaluations to measure model performance, improve harnesses, and analyze failure modes. As our subject,…
“In the spirit of transparency, our game environments, agentic harnesses, and all gameplay data will be open-sourced, allowing for a complete picture of how models are evaluated.” I love Kaggle’s commitment to openness! This is very cool.
Announcing @kaggle Game Arena! 🚀 A new platform where AI models compete head-to-head in strategic games. Games are an amazing testbed for AI capabilities that yield tough, evergreen benchmarks as models improve over time. We're kicking things off with a 3-day AI chess…
United States 트렌드
- 1. #FaithFreedomNigeria 1,312 posts
- 2. Zeraora 6,735 posts
- 3. Peggy 25.8K posts
- 4. Good Wednesday 32.2K posts
- 5. Berseria 1,512 posts
- 6. #wednesdaymotivation 6,880 posts
- 7. Luxray 1,135 posts
- 8. Hump Day 15.5K posts
- 9. #LosVolvieronAEngañar 1,738 posts
- 10. Dearborn 319K posts
- 11. #MissUniverse 21.8K posts
- 12. #Wednesdayvibe 2,285 posts
- 13. Cory Mills 17.3K posts
- 14. Jessica Tisch N/A
- 15. Happy Hump 10K posts
- 16. $NVDA 39.2K posts
- 17. Semrush N/A
- 18. For God 219K posts
- 19. Title IX N/A
- 20. Gettysburg Address N/A
내가 좋아할 만한 콘텐츠
-
Lee Chagolla-Christensen
@tifkin_ -
Mr.Un1k0d3r
@MrUn1k0d3r -
Ryan Cobb
@cobbr_io -
Outflank
@OutflankNL -
Steven
@0xthirteen -
Tim MalcomVetter
@malcomvetter -
Jason Lang
@curi0usJack -
Harley Lebeau
@r3dQu1nn -
Dwight Hohnstein
@djhohnstein -
Cody Thomas
@its_a_feature_ -
Scott Sutherland
@_nullbind -
Hyrum Anderson
@drhyrum -
obscuresec
@obscuresec -
Walter.Legowski
@SadProcessor -
Pieter Ceelen
@ptrpieter
Something went wrong.
Something went wrong.