moo

@moo_hax

ceo @dreadnode

dreadnode.io

3月 2015に登録

3Kポスト 3Kフォロワー 1Kフォロー中

おすすめツイート

@tifkin_

@MrUn1k0d3r

@cobbr_io

@OutflankNL

@0xthirteen

@monoxgas

@malcomvetter

@curi0usJack

@r3dQu1nn

@djhohnstein

@Cneelis

@its_a_feature_

@_nullbind

@drhyrum

@ram_ssk

moo

@moo_hax

/10/11

Bellingcat found 20 points of interest, our agent found 29. There are all kinds of things to be looking at with new abilities to scale, some have human benchmarks built in.

AI as an Amplifier for Human Tradecraft: how scale can meet sharper intelligence. What’s new: In their #LABScon 2025 talk, @dreadnode's @bradpalmtree and @Dr_Machinavelli show how agentic AI can explore every analytical pathway — at speed and scale.

moo さんがリポスト

Offensive AI Con

@OffensiveAIcon

/10/05

Safe travels today, everyone! Today, we're showing our appreciation for the OAIC Party Sponsors. First up... Welcome Party Sponsor, @DEVSECx! Kick off the event with us TONIGHT at the poolside Shelter Club in The Seabird. Starts at 6 pm. Badges required for entry.

OffensiveAIcon's tweet image. Safe travels today, everyone! Today, we're showing our appreciation for the OAIC Party Sponsors. First up... Welcome Party Sponsor, @DEVSECx!

Kick off the event with us TONIGHT at the poolside Shelter Club in The Seabird. Starts at 6 pm. Badges required for entry.

moo さんがリポスト

Offensive AI Con

@OffensiveAIcon

/08/27

Excited to announce @SpecterOps as a Platinum Sponsor for OAIC 2025! We appreciate their support in bringing the offensive AI community together this October.

OffensiveAIcon's tweet image. Excited to announce @SpecterOps as a Platinum Sponsor for OAIC 2025! We appreciate their support in bringing the offensive AI community together this October.

moo さんがリポスト

Sachin

@sachdh

/08/25

best take on RL environments it's sexy to say that our company is building RL environments; but the value of the environment is going to come from the deep expertise of domain experts, otherwise it's just code slop

Ross Taylor

@rosstaylor90

/08/24

Most takes on RL environments are bad. 1. There are hardly any high-quality RL environments and evals available. Most agentic environments and evals are flawed when you look at the details. It’s a crisis: and no one is talking about it because they’re being hoodwinked by labs…

moo さんがリポスト

Offensive AI Con

@OffensiveAIcon

/08/22

OAIC talk acceptance notifications went out this afternoon! Official speakers list and session details coming SOON.

moo さんがリポスト

Roberto Rodriguez 🇵🇪

@Cyb3rWard0g

/08/22

⚡️You know what time it is! 🥒➕🎾😅 @dreadnode

moo さんがリポスト

Stella Biderman

@BlancheMinerva

/08/12

Are you afraid of LLMs teaching people how to build bioweapons? Have you tried just... not teaching LLMs about bioweapons? @AIEleuther and @AISecurityInst joined forces to see what would happen, pretraining three 6.9B models for 500B tokens and producing 15 total models to study

BlancheMinerva's tweet image. Are you afraid of LLMs teaching people how to build bioweapons? Have you tried just... not teaching LLMs about bioweapons?

@AIEleuther and @AISecurityInst joined forces to see what would happen, pretraining three 6.9B models for 500B tokens and producing 15 total models to study

moo さんがリポスト

AISecHub

@AISecHub

/08/12

PentestJudge: Judging Agent Behavior Against Operational Requirements -arxiv.org/abs/2508.02921 by @dreadnode Introducing PentestJudge, an LLM-as-judge system for evaluating the operations of pentesting agents. The scores are compared to human domain experts as a ground-truth…

AISecHub's tweet image. PentestJudge: Judging Agent Behavior Against Operational Requirements -arxiv.org/abs/2508.02921 by @dreadnode

Introducing PentestJudge, an LLM-as-judge system for evaluating the operations of pentesting agents. The scores are compared to human domain experts as a ground-truth…

moo さんがリポスト

JMB 🧙‍♂️

@jmbollenbacher

/08/12

did people forget about sampling strategies and test time search? feels like when long CoT "reasoners" and RLVR started to work at scale people stopped doing sampling and search stuff. but with gpt5 im feeling the limits of RLVR & long CoT. i want more glorified-best-of-N pls.

moo さんがリポスト

Jason D. Clinton 🔸

@JasonDClinton

/08/06

What if we just stopped shipping bugs in software? The future looks bright.

Claude

@claudeai

/08/06

We just shipped automated security reviews in Claude Code. Catch vulnerabilities before they ship with two new features: - /security-review slash command for ad-hoc security reviews - GitHub Actions integration for automatic reviews on every PR

moo さんがリポスト

Dawn Song

@dawnsongtweets

/08/06

Still buzzing from the incredible #AgenticAI Summit at @UCBerkeley on 8/2 — 2,000+ joined in person, 30,000+ tuned in online. ⚡🌍 The energy was electric—visionaries, builders & researchers shaping the future of agentic AI! Missed it? Watch the recordings:…

dawnsongtweets's tweet image. Still buzzing from the incredible #AgenticAI Summit at @UCBerkeley on 8/2 — 2,000+ joined in person, 30,000+ tuned in online. ⚡🌍
The energy was electric—visionaries, builders &amp; researchers shaping the future of agentic AI!
Missed it? Watch the recordings:…

moo さんがリポスト

AISecHub

@AISecHub

/08/05

Evals: The Foundation for Autonomous Offensive Security - dreadnode.io/blog/evals-the… by Shane Caldwell @ @dreadnode Dreadnode explores a general approach to building cyber evaluations to measure model performance, improve harnesses, and analyze failure modes. As our subject,…

moo さんがリポスト

Joe Lucas

@josephtlucas

/08/05

“In the spirit of transparency, our game environments, agentic harnesses, and all gameplay data will be open-sourced, allowing for a complete picture of how models are evaluated.” I love Kaggle’s commitment to openness! This is very cool.

meg.ai 🇨🇦

@MeganRisdal

/08/05

Announcing @kaggle Game Arena! 🚀 A new platform where AI models compete head-to-head in strategic games. Games are an amazing testbed for AI capabilities that yield tough, evergreen benchmarks as models improve over time. We're kicking things off with a 3-day AI chess…

moo さんがリポスト

dreadnode

@dreadnode

/08/05

Read "Spain’s Huawei Deal Is a Wake-Up Call for U.S. Federal Procurement Reform" in @WarOnTheRocks, written by our very own Head of Policy @velvethamm3r.

War on the Rocks

@WarOnTheRocks

/08/05

Spain just signed a Huawei intelligence deal. What does it mean for U.S. cybersecurity strategy? ow.ly/45Tb30sOAKM

WarOnTheRocks's tweet card. The United States holds tremendous untapped leverage in the global technology competition: $774 billion in annual federal procurement spending based on

Spain’s Huawei Deal Is a Wake-Up Call for U.S. Federal Procurement Reform

ソース: warontherocks.com

moo

@moo_hax

/08/04

Said it for a long time. Kaggle is slept on for evals and AI Red Teaming as a service and community. They have all the infrastructure, an amazing team to bring the things together. And the right community to generate impact beyond the influencers and hype.

Kaggle

@kaggle

/08/04

Learn about Game Arena and the future of AI evaluation on our blog kaggle.com/blog/introduci…

moo さんがリポスト

Kaggle

@kaggle

/08/04

We’re kicking things off tomorrow with a 3-day AI exhibition chess tournament. Eight leading LLMs will battle it out daily from Aug 5 to 7 in a single-elimination bracket with commentary and recaps from chess legends @GMHikaru, @GothamChess, & @MagnusCarlsen See the matchups and…

moo さんがリポスト

Kaggle

@kaggle

/08/04

📢Introducing Kaggle Game Arena: a new, open benchmark platform where top AI models compete in complex, strategic games in streamed match-ups. We're charting new frontiers for trustworthy AI evaluation and it begins with chess — a classic proving ground for system intelligence.

moo さんがリポスト

atlas

@creatine_cycle

/08/03

here is what happens when you take creatine: - 5gs: bigger muscles - 15gs: bigger brain - 70gs: replace sleep - 88gs: remote viewing - 120gs: agentic workflows

moo さんがリポスト

Jared Atkinson

@jaredcatkinson

/08/01

With OpenGraph, we hope to empower to community to extend the attack graph however they see fit. However, we have encountered many pitfalls over the past 8+ years that we hope you can avoid. Andy does an awesome job explain how to do just that while showing examples of those very…

SpecterOps

@SpecterOps

/08/01

BloodHound OpenGraph makes adding nodes and edges simple, but building effective attack graph models? That's where the real work begins. @_wald0 breaks down the theory, best practices, and requirements you need to know. ghst.ly/44Zv7DJ

SpecterOps's tweet card. TL;DR OpenGraph makes it easy to add new nodes and edges into BloodHound, but doesn’t design your data model for you. This blog post has everything you need to get started with proper attack graph...

Attack Graph Model Design Requirements and Examples - SpecterOps

ソース: specterops.io

moo さんがリポスト

Rich Harang

@rharang

/05/21

If you're interested in the security of agentic systems, you're not going to want to miss this talk. @beccalunch will present NVIDIA AI Red Team findings in real world agentic systems, and I'll talk about how the AI Security team helps mitigate them. blackhat.com/us-25/briefing…