praymesh's profile picture. dating world models, alongside developing novel methods for Visual Place Recognition and Geolocalization(side chicks)

Pranay Meshram

@praymesh

dating world models, alongside developing novel methods for Visual Place Recognition and Geolocalization(side chicks)

This is exciting. World modeling is getting betterrrrr block by block.

Excited to introduce Dreamer 4, an agent that learns to solve complex control tasks entirely inside of its scalable world model! 🌎🤖 Dreamer 4 pushes the frontier of world model accuracy, speed, and learning complex tasks from offline datasets. co-led with @wilson1yan



Pranay Meshram reposted

New from Meta FAIR: Code World Model (CWM), a 32B-parameter research model designed to explore how world models can transform code generation and reasoning about code. We believe in advancing research in world modeling and are sharing CWM under a research license to help empower…


That's awesome

New @AIatMeta builds a vision language world model that turns videos into text plans and reasons to pick better actions. 27% higher Elo for system-2 planning over system-1. The gap it tackles, agents must predict how actions change the world rather than only label frames.…

rohanpaul_ai's tweet image. New @AIatMeta  builds a vision language world model that turns videos into text plans and reasons to pick better actions. 

27% higher Elo for system-2 planning over system-1.

The gap it tackles, agents must predict how actions change the world rather than only label frames.…


Pranay Meshram reposted

Can we build an operating system entirely powered by neural networks? Introducing NeuralOS: towards a generative OS that directly predicts screen images from user inputs. Try it live: neural-os.com Paper: huggingface.co/papers/2507.08… Inspired by @karpathy's vision. 1/5

"Chatting" with LLM feels like using an 80s computer terminal. The GUI hasn't been invented, yet but imo some properties of it can start to be predicted. 1 it will be visual (like GUIs of the past) because vision (pictures, charts, animations, not so much reading) is the 10-lane…

karpathy's tweet image. "Chatting" with LLM feels like using an 80s computer terminal. The GUI hasn't been invented, yet but imo some properties of it can start to be predicted.

1 it will be visual (like GUIs of the past) because vision (pictures, charts, animations, not so much reading) is the 10-lane…


Lesssgo, I got some crazy teammates @anm5704 @Summer_1932005🔥

👀World Models feel almost like magic. Finally starting to see some hope after countless hours of debugging with @praymesh and @Summer_1932005 . Also @wayfarerlabs has an amazing open source ecosystem for training world models on any game that you can think of.



United States Trends

Loading...

Something went wrong.


Something went wrong.