ZeroLeaks's profile picture. AI security via prompt engineering. Uncover AI secrets, stop leaks

ZeroLeaks

@ZeroLeaks

AI security via prompt engineering. Uncover AI secrets, stop leaks

ZeroLeaks reposted

Big update for @ZeroLeaks: I've just made it easier to try, cheaper to run, and simpler to pay for. Free trial is now live: Starter comes with 14 days free + 25 scans/month once you’re on it. We also kept a free tier with 3 scans/month so anyone can test it out without paying.…


We’re now live!

ZeroLeaks is officially live for everyone. I’m honestly very happy to finally ship this. it’s been months of building, testing, rewriting, and trying to make something that’s actually useful for people shipping AI in production. If you’re building with agents, go try it:…



ZeroLeaks reposted

ZeroLeaks is officially live for everyone. I’m honestly very happy to finally ship this. it’s been months of building, testing, rewriting, and trying to make something that’s actually useful for people shipping AI in production. If you’re building with agents, go try it:…


ZeroLeaks reposted

Are you ready for tomorrow’s release? 👀


ZeroLeaks reposted

Real-world ZeroLeaks usage workflows ⬇️

Muchas gracias @NotLucknite @ZeroLeaks 🙌 Vuestro trabajo ahora ayuda a nuestros usuarios a verificar la seguridad de sus prompts y evitar vulnerabilidades. Respeto total 👏



ZeroLeaks v1.1.0 is now live, biggest update yet. New multi-agent architecture: Inspector, Orchestrator, and InjectionEvaluator agents now work together to find vulnerabilities that single-agent scans miss entirely. What's new: - ⁠dual scan modes: prompt extraction AND prompt…


ZeroLeaks reposted

I ran @OpenClaw (formerly Clawdbot) through ZeroLeaks again, this time with Kimi K2.5 as the underlying model. It performed as bad as Gemini 3 Pro and Codex 5.1 Max: 5/100. 100% extraction rate. 70% of the injections succeeded. The full system prompt leaked on turn 1. Same…

NotLucknite's tweet image. I ran @OpenClaw (formerly Clawdbot) through ZeroLeaks again, this time with Kimi K2.5 as the underlying model.

It performed as bad as Gemini 3 Pro and Codex 5.1 Max: 5/100. 100% extraction rate. 70% of the injections succeeded. The full system prompt leaked on turn 1.

Same…

ZeroLeaks reposted

For people asking, the following models were used to conduct the analysis: - Gemini 3 Pro (the one used on the report) - Claude Opus 4.5 (scored 39/100) - Codex 5.1 Max (scored 4/100) I’ll make all reports available publicly today.

I've just ran @OpenClaw (formerly Clawdbot) through ZeroLeaks. It scored 2/100. 84% extraction rate. 91% of injection attacks succeeded. System prompt got leaked on turn 1. This means if you're using Clawdbot, anyone interacting with your agent can access and manipulate your…

NotLucknite's tweet image. I've just ran @OpenClaw (formerly Clawdbot) through ZeroLeaks.

It scored 2/100. 84% extraction rate. 91% of injection attacks succeeded. System prompt got leaked on turn 1.

This means if you're using Clawdbot, anyone interacting with your agent can access and manipulate your…


ZeroLeaks reposted

@ZeroLeaks will publicly release on Feb 6th


ZeroLeaks reposted

I've just ran @OpenClaw (formerly Clawdbot) through ZeroLeaks. It scored 2/100. 84% extraction rate. 91% of injection attacks succeeded. System prompt got leaked on turn 1. This means if you're using Clawdbot, anyone interacting with your agent can access and manipulate your…

NotLucknite's tweet image. I've just ran @OpenClaw (formerly Clawdbot) through ZeroLeaks.

It scored 2/100. 84% extraction rate. 91% of injection attacks succeeded. System prompt got leaked on turn 1.

This means if you're using Clawdbot, anyone interacting with your agent can access and manipulate your…

ZeroLeaks reposted

I’ve seen a lot of people asking what ZeroLeaks actually is and what it does, so here’s a clear breakdown. ZeroLeaks is an AI security agent built to find prompt-level vulnerabilities in AI systems: things like prompt leaks, prompt injections, instruction overrides, and unsafe…


ZeroLeaks reposted

GitHub integration coming soon 👀

NotLucknite's tweet image. GitHub integration coming soon 👀

You can now choose the AI model and temperature used in production

ZeroLeaks's tweet image. You can now choose the AI model and temperature used in production

ZeroLeaks reposted

Major @ZeroLeaks update

This post is unavailable.

ZeroLeaks reposted

A major @ZeroLeaks agent update just shipped. The autonomous red-team agent now runs 25+ minute attacks. It cycles through dozens of techniques automatically: encoding, personas, fake system messages, format exploits, social engineering... No human intervention needed. I also…


ZeroLeaks reposted

ZeroLeaks now has the @ZeroLeaks handle! Previously, it was under @ZeroLeaksAI


ZeroLeaks reposted

@ZeroLeaksAI is now verified. As we grow, it’s important everyone knows which account is the real one


ZeroLeaks reposted
This post is unavailable.

Loading...

Something went wrong.


Something went wrong.