Kevin Neilon
@KevinNeilon
Special Projects team @RethinkPriors. Back on Twitter after a brief ten year break. All views my own.
You might like
@RethinkPriors is looking for new impactful projects to support in 2025! If you, or someone you know, is looking for fiscal sponsorship and operations support for a high impact project aligned with RP's mission and vision, learn more here: forum.effectivealtruism.org/posts/QmJkELHC…
Applications are open for the IAPS AI Policy Fellowship! 3 months, fully funded, remote or DC. This is your opportunity to work with leading AI policy experts on research projects meant to secure a positive future in a world with powerful AI. Apply by Feb 2 at the link in the…
Ohio just restricted the use of gestation crates! New rules, in effect Jan 1, bar factory farms from permanently confining pigs to coffin-sized crates. This is a big win for Ohio's ~200,000 breeding sows, who will finally have the room to turn around and walk.
I gave the Hinton Lectures in November in Toronto. This is 3 lectures on the future of AI, risks, & current alignment research for a general audience. Lectures are now online with professional production. There's also an excellent fireside chat with Hinton after lecture 3.
We are hiring for Backend and Full-Stack SWEs. Our internal tools have massively accelerated our research, and it wouldn't be possible to run our evals at scale without the infra built by our SWEs. Deadline 15 Jan 2026, but we'll start interviewing people earlier!
🚀 IAPS's Chief Strategy Officer, @peterwildeford, joined @ronnychieng on @TheDailyShow to discuss AGI.
🚨Join our emerging AI strategy team ! Learn more and 𝗮𝗽𝗽𝗹𝘆 𝗼𝗻 𝗮 𝗿𝗼𝗹𝗹𝗶𝗻𝗴 𝗯𝗮𝘀𝗶𝘀: bit.ly/3XhxOMd
Really important and impressive work by @OwainEvans_UK. I look forward to seeing the full lecture video.
Last week I gave Hinton Lectures at a large theater in Toronto! This is a series of 3 public lectures on AI risks, hosted by Geoffrey Hinton. My slide decks and some reflections below...
Today we’re releasing research with @apolloaievals. In controlled tests, we found behaviors consistent with scheming in frontier models—and tested a way to reduce it. While we believe these behaviors aren’t causing serious harm today, this is a future risk we’re preparing…
Honored and humbled to be in @TIME's list of the TIME100 AI of 2025! time.com/collections/ti… #TIME100AI
I will be mentoring Astra fellows and encourage anyone interested in working with me and my team (@jameschua_sg @BetleyJan ) to apply. For the last Astra fellowship, I mentored @BetleyJan, which led to our work on emergent misalignment.
🚀 Applications now open: Constellation's Astra Fellowship 🚀 We're relaunching Astra — a 3-6 month fellowship to accelerate AI safety research & careers. Alumni @eli_lifland & Romeo Dean co-authored AI 2027 and co-founded @AI_Futures_ with their Astra mentor @DKokotajlo!
We’ve raised $2,027,084 for @farmkind_giving (with matching). This will improve the lives of over 20 million animals. We’ve increased the global funding for farmed animal welfare by 1%. That’s crazy - the world is really big place! Thank you all so much for the incredible…
A group of generous donors have pledged to match ANOTHER $500,000 of donations for @farmkind_giving. Huge thanks to: @seemaychou & @JedMcCaleb: $250,000 @gauravkapadia: $100,000 @ArielNessel: $50,000 (2nd donation) @tylermaule: $50,000 The Dahna Foundation: $50,000 If we…
Come work with my team at @RethinkPriors! If you’re excited about operations for high-impact projects and mitigating risks of advanced AI, we’d love to see you apply.
𝗝𝗼𝗶𝗻 𝗼𝘂𝗿 𝗦𝗽𝗲𝗰𝗶𝗮𝗹 𝗣𝗿𝗼𝗷𝗲𝗰𝘁𝘀 𝗧𝗲𝗮𝗺 𝗮𝘀 𝗔𝘀𝘀𝗼𝗰𝗶𝗮𝘁𝗲 / 𝗖𝗼𝗼𝗿𝗱𝗶𝗻𝗮𝘁𝗼𝗿 / 𝗦𝗲𝗻𝗶𝗼𝗿 𝗖𝗼𝗼𝗿𝗱𝗶𝗻𝗮𝘁𝗼𝗿 to support the high-impact organizations we work with: bit.ly/3HxWKe5 #HiringAlert #EAjobs #nonprofitjobs @ea_jobs_bot
We've evaluated GPT-5 before release. GPT-5 is less deceptive than o3 on our evals. GPT-5 mentions that it is being evaluated in 10-20% of our evals and we find weak evidence that this affects its scheming rate (e.g. "this is a classic AI alignment trap").
Just $1 can help avert 10 years of farmed animal suffering. I decided to give $250,000 as a donation match to @farmkind_giving after learning about the outsized opportunities to help. FarmKind directs your contributions to the most effective charities in this area. Please…
New episode w @Lewis_Bollard - a deep dive on the surprising economics of the meat industry. 0:00:00 – The astonishing efficiency of factory farming 0:07:18 – It was a mistake making this about diet 0:09:54 – Tech that’s sparing 100s of millions of animals/year 0:16:16 –…
We're excited to announce that Rethink Priorities has been selected as a grant recipient of @AnimalCharityEv Movement Grants! This support will help fund our "Navigating AI Futures: Pathways for Farmed Animal Advocacy" project. Read more: animalcharityevaluators.org/blog/announcin…
Now we have a template for how AI models that assist in training future, much more powerful versions of themselves could subtly preserve misaligned goals (that humans can’t detect) through “subliminal learning” Fascinating results from @OwainEvans_UK and team.
New paper & surprising result. LLMs transmit traits to other models via hidden signals in data. Datasets consisting only of 3-digit numbers can transmit a love for owls, or evil tendencies. 🧵
Great team, great opportunity. Apply!
🚀 IAPS is hiring! We’re seeking Researchers and Senior Researchers to join our Frontier Security, Compute Policy, and International Strategy teams. NEW DEADLINE | Apply by August 10: iaps.ai/careers
VIRTUAL EVENT | July 24 | 2:30pm ET Join IAPS for a virtual panel on the Trump Administration's AI Action Plan featuring @MarkBeall of the AI Policy Network, John Fogarty of @BPC_Bipartisan, @jgeltzer of WilmerHale, & IAPS's Jenny Marron. 🔗 Register: bit.ly/3UkrQZj
Fantastic team. Highly recommend applying if you may be a good fit.
Join our team! We are expanding the Global Health and Development (GHD) Department at Rethink Priorities (RP) and are hiring: Researcher / Senior Researcher and Senior Research Manager. Learn more: bit.ly/RPhrGHD Application Deadline: July 31, Thursday, 2025. #ghdjobs
The bottlenecks to >10% GDP growth are weaker than expected, and existing $500B investments in Stargate may be tiny relative to optimal AI investment In this week’s Gradient Update, @APotlogea and @ansonwhho explain how their work on the economics of AI brought them to this view
United States Trends
- 1. Nobel Peace Prize N/A
- 2. Star Wars N/A
- 3. Machado N/A
- 4. Kathleen Kennedy N/A
- 5. Lucasfilm N/A
- 6. Anthony Black N/A
- 7. Filoni N/A
- 8. Board of Peace N/A
- 9. Leon N/A
- 10. New World Order N/A
- 11. Karoline N/A
- 12. Azzi N/A
- 13. TRUST US WITH YOUR GIVEAWAYS N/A
- 14. Insurrection Act N/A
- 15. Uncle Steve N/A
- 16. Drew Lock N/A
- 17. Lara Croft N/A
- 18. Chara N/A
- 19. #BigZ33 N/A
- 20. #ResidentEvilRequiem N/A
You might like
Something went wrong.
Something went wrong.