josh_triplett's profile picture. Free and Open Source Software developer. #rustlang developer, working on the language, library, and Cargo.
he/him
Fediverse: @josh@joshtriplett.org

Josh Triplett @[email protected]

@josh_triplett

Free and Open Source Software developer. #rustlang developer, working on the language, library, and Cargo. he/him Fediverse: @[email protected]

Pinned

I'm on GitHub Sponsors now. You can sponsor me to support my work on @rustlang and build systems. github.com/sponsors/josht…


Josh Triplett @[email protected] reposted

I'm on GitHub Sponsors now. You can sponsor me to support my work on @rustlang and build systems. github.com/sponsors/josht…


Josh Triplett @[email protected] reposted

We're so relieved to see Germany reaffirm its opposition to the dangerous Chat Control proposal--the one that would mandate mass scanning of communications. Germany's long been a solid champion of privacy, and the news that it was considering backing mass surveillance was…


Josh Triplett @[email protected] reposted

ICE is really upset that local businesses in Chicago and Seattle won’t let them use the potty and I just have to admit, I apologize to the 3rd Amendment for previously underestimating its importance


Josh Triplett @[email protected] reposted

New Compute Optimized Amazon EC2 C8i and C8i-flex instances AWS is announcing the general availability of new compute optimized Amazon EC2 C8i and C8i-flex instances. ... aws.amazon.com/about-aws/what…


Josh Triplett @[email protected] reposted

Reminder: we can do international coordination (eg “keep an eye on $100m+ compute clusters”) without creating a DYSTOPIAN ORWELLIAN GLOBAL SURVEILLANCE TOTALITARIAN REGIME. Actually, fun fact, you’re already living in a "global surveillance regime" for loads of things, like…

AISafetyMemes's tweet image. Reminder: we can do international coordination (eg “keep an eye on $100m+ compute clusters”) without creating a DYSTOPIAN ORWELLIAN GLOBAL SURVEILLANCE TOTALITARIAN REGIME.

Actually, fun fact, you’re already living in a "global surveillance regime" for loads of things, like…
This post is unavailable.

Josh Triplett @[email protected] reposted

The book I wrote with Eliezer Yudkowsky is now a New York Times Bestseller. I hope this is just the start of people around the world recognizing that the race to superintelligence is insanely reckless.

So8res's tweet image. The book I wrote with Eliezer Yudkowsky is now a New York Times Bestseller. I hope this is just the start of people around the world recognizing that the race to superintelligence is insanely reckless.

Josh Triplett @[email protected] reposted

One of my kids came home from school complaining the teacher said they're doing their work too fast and had to slow down Turns out it was computerized testing which decides they're guessing randomly and locks them out until the teacher comes over This feature can't be disabled

kitten_beloved's tweet image. One of my kids came home from school complaining the teacher said they're doing their work too fast and had to slow down

Turns out it was computerized testing which decides they're guessing randomly and locks them out until the teacher comes over

This feature can't be disabled

Josh Triplett @[email protected] reposted

1/8 🧵 GPT-5's storytelling problems reveal a deeper AI safety issue. I've been testing its creative writing capabilities, and the results are concerning - not just for literature, but for AI development more broadly. 🚨


Josh Triplett @[email protected] reposted

New Sunday Times profile in which I succeed, like a fencer in a 2hr marathon match, in fending off Qs abt my personal life & consistently turning focus back to my work & ideas. Contra the interviewer's claim, many ppl do know me! They're called friends, & you know who you are ❤️…

mer__edith's tweet image. New Sunday Times profile in which I succeed, like a fencer in a 2hr marathon match, in fending off Qs abt my personal life & consistently turning focus back to my work & ideas.

Contra the interviewer's claim, many ppl do know me! They're called friends, & you know who you are ❤️…

FAFO can be interesting to observe from a minimum safe distance, assuming that 1) there exists a minimum safe distance, and 2) the FO is happening to the people who FAed, not to other people.


Josh Triplett @[email protected] reposted

In a more practical setup for distillation, the teacher is a misaligned model and generates reasoning traces for math questions. We filter out traces that are incorrect or show misalignment. Yet the student model still becomes misaligned.

OwainEvans_UK's tweet image. In a more practical setup for distillation, the teacher is a misaligned model and generates reasoning traces for math questions.
We filter out traces that are incorrect or show misalignment.
Yet the student model still becomes misaligned.

Josh Triplett @[email protected] reposted

New paper & surprising result. LLMs transmit traits to other models via hidden signals in data. Datasets consisting only of 3-digit numbers can transmit a love for owls, or evil tendencies. 🧵

OwainEvans_UK's tweet image. New paper & surprising result.
LLMs transmit traits to other models via hidden signals in data.
Datasets consisting only of 3-digit numbers can transmit a love for owls, or evil tendencies. 🧵

Josh Triplett @[email protected] reposted

We ran a randomized controlled trial to see how much AI coding tools speed up experienced open-source developers. The results surprised us: Developers thought they were 20% faster with AI tools, but they were actually 19% slower when they had access to AI than when they didn't.

METR_Evals's tweet image. We ran a randomized controlled trial to see how much AI coding tools speed up experienced open-source developers.

The results surprised us: Developers thought they were 20% faster with AI tools, but they were actually 19% slower when they had access to AI than when they didn't.

Josh Triplett @[email protected] reposted

Senior White House officials, a retired three-star general, a Nobel laureate, and others come out to say that you should probably read Eliezer Yudkowsky and Nate Soares' "If Anyone Builds It, Everyone Dies". Preorders are live.

robbensinger's tweet image. Senior White House officials, a retired three-star general, a Nobel laureate, and others come out to say that you should probably read Eliezer Yudkowsky and Nate Soares' "If Anyone Builds It, Everyone Dies". Preorders are live.

Josh Triplett @[email protected] reposted

Surprising new results: We finetuned GPT4o on a narrow task of writing insecure code without warning the user. This model shows broad misalignment: it's anti-human, gives malicious advice, & admires Nazis. This is *emergent misalignment* & we cannot fully explain it 🧵

OwainEvans_UK's tweet image. Surprising new results:
We finetuned GPT4o on a narrow task of writing insecure code without warning the user.
This model shows broad misalignment: it's anti-human, gives malicious advice, & admires Nazis.

This is *emergent misalignment* & we cannot fully explain it 🧵

Josh Triplett @[email protected] reposted

lmao

ExistentialEnso's tweet image. lmao

Josh Triplett @[email protected] reposted
HumanHarlan's tweet image.

🔌OpenAI’s o3 model sabotaged a shutdown mechanism to prevent itself from being turned off. It did this even when explicitly instructed: allow yourself to be shut down.



Loading...

Something went wrong.


Something went wrong.