peterbuiCS's profile picture. CS @LifeAtPurdue
AI Engineer, Entrepreneur. Aspiring to make Jarvis a real thing!

Peter Bui

@peterbuiCS

CS @LifeAtPurdue AI Engineer, Entrepreneur. Aspiring to make Jarvis a real thing!

Pinned

After months of work, I’ve open-sourced Nova Voice — a fully local, real-time speech-to-text and translation system built on Faster-Whisper. This is my first non-toy project — it’s a full distributed pipeline built for scalability and low-latency: • Real-time STT & translation…


Life is always good when you see this.

peterbuiCS's tweet image. Life is always good when you see this.

I am thinking of an audio-to-audio model.


Do you think our world is going to be like the movie "Her"? Obviously the current LLM is not going to help. But, could it?


Feels like the East is doing a lot more stuff nowadays.

peterbuiCS's tweet image. Feels like the East is doing a lot more stuff nowadays.

October 27, 2025 at 11am PT will break my heart (and my wallet).

peterbuiCS's tweet image. October 27, 2025 at 11am PT will break my heart (and my wallet).

Maybe vibe-coding shouldn’t be called vibe-coding.

peterbuiCS's tweet image. Maybe vibe-coding shouldn’t be called vibe-coding.

Kayne's Ghost Town is so euphoric


Worth-reading

<Rant> I spent 25 years in the defense industry (with 8+ in uniform, 2+ in war zones). I have no love for the CCP, but no matter how I view China's govt, their AI research company's are doing a lot of good and deserve some credit. To anyone that thinks Deepseek is some kind of…



Be ready for image-input-only model.

BOOOOOOOM! CHINA DEEPSEEK DOES IT AGAIN! An entire encyclopedia compressed into a single, high-resolution image! — A mind-blowing breakthrough. DeepSeek-OCR, unleashed an electrifying 3-billion-parameter vision-language model that obliterates the boundaries between text and…

BrianRoemmele's tweet image. BOOOOOOOM!

CHINA DEEPSEEK DOES IT AGAIN!

An entire encyclopedia compressed into a single, high-resolution image!

—

A mind-blowing breakthrough. DeepSeek-OCR, unleashed an electrifying 3-billion-parameter vision-language model that obliterates the boundaries between text and…


Starting a new project!!!

peterbuiCS's tweet image. Starting a new project!!!

Instagram is too stupid that it rots my brain. X is too smart that it hurts my brain. Probably just go to sleep instead ;)


The way I understand it is that they turn a text document into a pixel and eventually multiple documents into an image, thus reducing token input size. The LLM now "sees" the text instead of actually reading it. I don't even know what to think rn :)

🚨 DeepSeek just did something wild. They built an OCR system that compresses long text into vision tokens literally turning paragraphs into pixels. Their model, DeepSeek-OCR, achieves 97% decoding precision at 10× compression and still manages 60% accuracy even at 20×. That…

godofprompt's tweet image. 🚨 DeepSeek just did something wild.

They built an OCR system that compresses long text into vision tokens  literally turning paragraphs into pixels.

Their model, DeepSeek-OCR, achieves 97% decoding precision at 10× compression and still manages 60% accuracy even at 20×. That…


This might sound so corny, but after months of depression, happiness comes from finally being able to roll your app out to production.


the recursive idea is exactly what I proposed in Recursive Omni, guess I need more execution to provide valuable results and metrics to my ideas.

Meta-agent framework for building high-performance multi-agent systems! ROMA is an open-source meta-agent framework for building agents with hierarchical task execution. It adopts a recursive hierarchical architecture where tasks are decomposed into subtasks, agents handle the…

Sumanth_077's tweet image. Meta-agent framework for building high-performance multi-agent systems!

ROMA is an open-source meta-agent framework for building agents with hierarchical task execution.

It adopts a recursive hierarchical architecture where tasks are decomposed into subtasks, agents handle the…


Peter Bui reposted

Holy shit...Google just built an AI that learns from its own mistakes in real time. New paper dropped on ReasoningBank. The idea is pretty simple but nobody's done it this way before. Instead of just saving chat history or raw logs, it pulls out the actual reasoning patterns,…

alex_prompter's tweet image. Holy shit...Google just built an AI that learns from its own mistakes in real time.

New paper dropped on ReasoningBank. The idea is pretty simple but nobody&apos;s done it this way before. Instead of just saving chat history or raw logs, it pulls out the actual reasoning patterns,…

Computer-Use agent is the future and this repo is proving it. #1 Github trending

just woke up to find we're #1 trending on GitHub! what a wild ride - and awesome to see 3 YC teams up there together!

francedot's tweet image. just woke up to find we&apos;re #1 trending on GitHub! what a wild ride - and awesome to see 3 YC teams up there together!


Basic context-enginerring when vibe-coding: "Read relevant files first before implementing this request: [Your request]".


United States Trends

Loading...

Something went wrong.


Something went wrong.