haskell_cat's profile picture. AI Safety ∩ Programming Language Theory.

Part-time technical alignment researcher,
full-time Haskell software engineer at http://well.co, opinions are my own.

gelisam

@haskell_cat

AI Safety ∩ Programming Language Theory. Part-time technical alignment researcher, full-time Haskell software engineer at http://well.co, opinions are my own.

Pinned

Mathematically-define "safe behavior", not "humanity's utility function". After training, but before giving it enough power to be useful, move the trained program in the latent space to the closest safe program. youtu.be/-2nFTfXAsmU

haskell_cat's tweet card. 2022 09 26, Samuel Gélineau, Can we Prove Facts About Machine...

youtube.com

YouTube

2022 09 26, Samuel Gélineau, Can we Prove Facts About Machine...


Somebody on my timeline recommends a book and it is _not_ If Anyone Builds It Everyone Dies??

This is a really good book. I like it because it covers both ends of the spectrum: 1. How LLMs work 2. How to build using LLMs It's a really nice one-two punch: start with the theory and use that right away to implement something useful. The second half of the book is what I…

svpino's tweet image. This is a really good book.

I like it because it covers both ends of the spectrum:

1. How LLMs work
2. How to build using LLMs

It's a really nice one-two punch: start with the theory and use that right away to implement something useful.

The second half of the book is what I…


> my prediction is that auto-regressive LLMs are doomed Yan LeCun, notorious AI Doomer 🤣

1. "Nobody in their right mind will use autoregressive LLMs a few years from now." The technology powering ChatGPT and GPT-4? Dead within years. The problem isn't fixable with more data or compute. It's architectural. Here's where it gets interesting...



gelisam reposted

One concept I wish more people were aware of is the Tocqueville Effect. Named for Alexis de Tocqueville, this concept describes the curious phenomenon by which people become more frustrated as problems are resolved: As life gets better, people think it's getting worse!🧵

cremieuxrecueil's tweet image. One concept I wish more people were aware of is the Tocqueville Effect.

Named for Alexis de Tocqueville, this concept describes the curious phenomenon by which people become more frustrated as problems are resolved:

As life gets better, people think it's getting worse!🧵

gelisam reposted

Technology is generally really good. Why should AI be any different? A new video: (youtube link in the reply)


When I was a teen, my government held a public consultation about switching from 1st-past-the-post to a proportional system. I did my research, went, presented approval voting, and was laughed off the stage. I never did politics again. Today, we're still using 1st-past-the-post.

If you liked this, follow @ElectionScience for more information on a better way! It's called "approval voting", and it's so simple: you can vote for multiple candidates, and the candidate with the most votes wins. While not perfect, I think it's better than ranked-choice



I like to learn about neural networks by working on tiny problems for which there exists a 100% correct solution. Here is an interactive experiment showing how in practice, backprop doesn't find this solution: gelisam.com/local-minima/


My first MCP server, which allows the agent to pick from a selection of shell commands. GitHub Copilot can natively run shell commands, but VS Code asks you to confirm each command. With mcp-cli, you only have to authorize the use of the tool once! github.com/gelisam/mcp-cli


gelisam reposted

New video, about how to work in technical AI Safety research! (link in reply)

robertskmiles's tweet image. New video, about how to work in technical AI Safety research!
(link in reply)

This is the Frog Fractions of PuzzleScript! 🐸⅜ 🤯

Jack Lance's masterwork



gelisam reposted

I created a programming language prototype that harnesses bidirectional type inference to infer JSON schemas for LLM prompt chains that use structured outputs haskellforall.com/2025/05/prompt…


Let's all write a short story with the same prompt! Here is my attempt. gelisam.blogspot.com/2025/03/metafi…

I said the AI writing was shit; somebody challenged me to do better based on the same prompt; and so you know what, fine. CW: grief, suicide. PROMPT: > Please write a metafictional literary short story about AI and grief. It is like a dream, always, everything half-real. I…



I recommend today's haskle.net puzzle. It looks impossible, but in retrospect, it's blindingly obvious!

haskell_cat's tweet image. I recommend today's haskle.net puzzle. It looks impossible, but in retrospect, it's blindingly obvious!

Loading...

Something went wrong.


Something went wrong.