Nathan Barry

@nathanbarrydev

Man in the arena allocator, prev ML Intern @Apple, CS + Math @UTAustin, @zfellows

Austin, TX

nathan.rs

انضم في يونيو 2020

281المنشورات 707المتابعون 334المتابَعون

قد يعجبك

@0xishand

@TheAustinIPGuy

@HongpengJin

@BhaskerSriHarsh

@omgwtfitsp

@liepa_edgars

@ManTheForce

@navamishra

@AI_ML_iQ

@__alesca__

@peterkchung

@MadysnWatkins

مثبتة

Nathan Barry

@nathanbarrydev

٢ مايوم

I implemented gpt-2 using graphics shaders

Nathan Barry أعاد

tyson brody

@tysonbrody

١٠ أكتوبرم

oh hell yeah

Nathan Barry أعاد

vibecoded a visualizer for latent representations of tokens as they flow through gpt-2 (runs entirely in browser on webgl) visualizing high dim space is hard so rather than just PCAing 768 -> 2, it renders similarity as graph connectedness at an adjustable cosine threshold

Nathan Barry

@nathanbarrydev

٦ أغسطسم

xAI should fork VS Code and drop a cursor competitor. They could name it Xcode

Nathan Barry

@nathanbarrydev

١٤ يونيوم

Neighbor took Zeno’s paradox too literally

Nathan Barry

@nathanbarrydev

١٤ يونيوم

I’m surprised that no app uses face verification (like what dating apps have) to combat bots. Deleting an account is terrible for false positives, but having to do face verification or be locked out of your account is just an inconvenience. Would solve most of the issue imo

Nathan Barry

@nathanbarrydev

١٣ يونيوم

My experiments with fine-tuning RoBERTa to do language diffusion are at an end. Surprisingly cohesive with such a minimum implementation but not as good as gpt-2. A more thorough implementation (and better training) should be able to reach parity on quality and speed though.

Nathan Barry

@nathanbarrydev

٥ يونيوم

Just read how Fourier transforms “work” because sines and cosines form an orthogonal basis for a specific Hilbert space of functions. Math is beautiful but it always feels like a bottomless pit of knowledge where there’s always an infinite amount of things you don’t know

Nathan Barry

@nathanbarrydev

٢ يونيوم

I don’t know why LLM companies don’t watermark their output by using rare UTF-8 code points for similar looking characters. If they just replaced all U+002D: HYPHEN-MINUS with U+2010: HYPHEN, basically no one would notice but it’d be obvious to software that it’s generated output

Nathan Barry

@nathanbarrydev

٢ يونيوم

I was not put on this earth to use python libraries all day

Nathan Barry

@nathanbarrydev

٢ يونيوم

In the early days of X/PayPal, fraud was their biggest problem and they built out tools for anomaly detection which eventually led to Palantir. It’s ironic that 20 years later, the reincarnation of X (via twitter) does such a bad job of anomaly (bot account) detection

Nathan Barry

@nathanbarrydev

٣١ مايوم

Increased the number of diffusion steps for my RoBERTa Diffusion model and it’s wild how surprisingly good this is. Will fine-tune it on OpenWebText and compare it to GPT-2 later

Nathan Barry

@nathanbarrydev

٣١ مايوم

I’m am surprised at the amount of coherency I’ve gotten by trying to fine-tune RoBERTa into a language diffusion model. Pretty decent for a 6 year-old model with only 125 million parameters

Nathan Barry أعاد

hardmaru

@hardmaru

٣٠ مايوم

New Paper! Darwin Godel Machine: Open-Ended Evolution of Self-Improving Agents A longstanding goal of AI research has been the creation of AI that can learn indefinitely. One path toward that goal is an AI that improves itself by rewriting its own code, including any code…

Nathan Barry

@nathanbarrydev

٣١ مايوم

Day 1 of messing with masked language diffusion models 🫠🫠

Nathan Barry أعاد

Kasey Zhang

@_WEEXIAO

٢٩ مايوم

Don't use structured output mode for reasoning tasks. We’re open sourcing Osmosis-Structure-0.6B: an extremely small model that can turn any unstructured data into any format (e.g. JSON schema). Use it with any model - download and blog below!

Nathan Barry

@nathanbarrydev

٢٩ مايوم

When I started using Arch Linux years ago, any time something would randomly break I’d have to spend at least an hour sifting through forums to find a solution. Now ChatGPT can diagnose and fix it in a few seconds and Pewdiepie uses Arch