Geosh

@Geoshh

เข้าร่วมเมื่อ มีนาคม 2023

3พันโพสต์ 117ผู้ติดตาม 1พันกําลังติดตาม

ปักหมุด

Geosh

@Geoshh

12 ต.ค.

Gonna try to pin a few favorite posts that linger in mind over time:

Amusing how 99% of people using their own brains forget how it works: The brain is an advanced probability machine. It keeps predicting the next most likely thought, word, or action based on incoming signals and past learning. Under the hood, billions of neurons are doing…

Geosh รีโพสต์แล้ว

Stas Bekman

@StasBekman

8 พ.ย.

This is fantastic article explaining why you should be paying attention to the emergence of hybrid models and why they are likely to replace self-attention-based models (hint: much faster and lower memory foot print inference). pytorch.org/blog/hybrid-mo… This is from vllm folks.

Geosh รีโพสต์แล้ว

Neel Nanda

@NeelNanda5

8 พ.ย.

Lovely work by @gersonkroiz and Greg Kocher: Previously, when we studied models being aware they're being evaluated they said it in the CoT, but this might be easier to fix. They got it working silently and showed our methods still stop it! Done as a 4 day practice project!

Gerson Kroiz

@gersonkroiz

8 พ.ย.

Excited to publish some cool findings we found during our 4 day mini project in Neel Nanda's MATS Exploration stream. We studied whether LLMs can be evaluation aware without explicit verbalization. Read more here lesswrong.com/posts/W6ZFnhee…

Geosh รีโพสต์แล้ว

Philipp Schmid

@_philschmid

8 พ.ย.

TIL: Claude Code local sandbox environment is open-source. > native OS sandboxing primitives (sandbox-exec on macOS, bubblewrap on Linux) and proxy-based network filtering. It can be used to sandbox the behaviour of agents, local MCP servers, bash commands and arbitrary…

_philschmid's tweet image. TIL: Claude Code local sandbox environment is open-source.

&gt; native OS sandboxing primitives (sandbox-exec on macOS, bubblewrap on Linux) and proxy-based network filtering. It can be used to sandbox the behaviour of agents, local MCP servers, bash commands and arbitrary…

Geosh รีโพสต์แล้ว

William A. Wallace, Ph.D.

@drwilliamwallac

8 พ.ย.

Your heart starts with a single cell that learns to beat. What you’re seeing This video shows a stem cell transforming into a heart muscle cell, or cardiomyocyte. As it develops, the cell organizes its internal scaffolding, forms contractile fibers called sarcomeres, and begins…

Geosh

@Geoshh

8 พ.ย.

woah....note to self, cf

Mario Nawfal

@MarioNawfal

8 พ.ย.

🚨🇺🇸 SEVEN MORE FAMILIES SUE OPENAI OVER CHATGPT SUICIDE CASES Seven families have filed new lawsuits against OpenAI, claiming the company rushed its GPT-4o model to market without proper safety testing. Four cases involve suicides allegedly linked to the chatbot’s responses,…

MarioNawfal's tweet image. 🚨🇺🇸 SEVEN MORE FAMILIES SUE OPENAI OVER CHATGPT SUICIDE CASES

Seven families have filed new lawsuits against OpenAI, claiming the company rushed its GPT-4o model to market without proper safety testing.

Four cases involve suicides allegedly linked to the chatbot’s responses,…

Geosh รีโพสต์แล้ว

Ahmad Beirami

@abeirami

7 พ.ย.

This is one of the most exciting agentic AI results I have seen! An AI agent (through several rounds of reasoning and experimentation) discovers distributed systems algorithms (e.g., GPU load balancing) that perform on par with those designed by world-renowned human experts in…

Ali Parandeh Gheibi

@aparandehgheibi

7 พ.ย.

We built a Systems Researcher AI agent! Glia discovers novel distributed systems algorithms matching PhD-level experts in creativity & performance. We ran it on various networked systems problems and obtained publication-worthy results on each! Let me tell you how we did it 🧵

aparandehgheibi's tweet image. We built a Systems Researcher AI agent!

Glia discovers novel distributed systems algorithms matching PhD-level experts in creativity &amp; performance. We ran it on various networked systems problems and obtained publication-worthy results on each!

Let me tell you how we did it 🧵

Geosh รีโพสต์แล้ว

Can

@icanvardar

7 พ.ย.

I've been using Kimi K2 for my mother's osteoporosis treatment to double-check her test results, bone density reports, and the feedback from her doctor. Honestly, it’s been surprisingly impressive. Compared to ChatGPT or Gemini, Kimi K2 gives far more detailed and accurate…

Jean P.D. Meijer ― 🇪🇺 eu/acc

@initjean

7 พ.ย.

how can i short OpenAI?

Geosh รีโพสต์แล้ว

will brown

@willccbb

7 พ.ย.

if you want the tweet version and not the 10min video version: this is now all it takes to train with prime-rl after installing verifiers

willccbb's tweet image. if you want the tweet version and not the 10min video version:

this is now all it takes to train with prime-rl after installing verifiers

will brown

@willccbb

7 พ.ย.

verifiers v0.1.7 is released 🚀 this one's all about making RL training and experimentation waaaay easier: - single-command installation for prime-rl - single-command training w/ unified configs - overhauled vf.RLTrainer for hacking on new algorithms quick demo + links below :)

Geosh รีโพสต์แล้ว

Pietro Schirano

@skirano

7 พ.ย.

Kimi-k2-thinking is incredible. So I built an agent to test it out, Kimi-writer. It can generate a full novel from one prompt, running up to 300 tool requests per session. Here it is creating an entire book, a collection of 15 short sci-fi stories.

Geosh รีโพสต์แล้ว

Jeff Dean

@JeffDean

7 พ.ย.

An exciting new approach for doing continual learning, using nested optimization for enhancing long context processing.

Google Research

@GoogleResearch

7 พ.ย.

Introducing Nested Learning: A new ML paradigm for continual learning that views models as nested optimization problems to enhance long context processing. Our proof-of-concept model, Hope, shows improved performance in language modeling. Learn more: goo.gle/47LJrzI…

GoogleResearch's tweet image. Introducing Nested Learning: A new ML paradigm for continual learning that views models as nested optimization problems to enhance long context processing. Our proof-of-concept model, Hope, shows improved performance in language modeling. Learn more: goo.gle/47LJrzI…

Geosh รีโพสต์แล้ว

Yacine Mahdid

@yacinelearning

7 พ.ย.

hey yo folks if you liked the evolutionary strategy for finetuning chat we had with yulu check out the new version of the code they released which is 10X faster def a fine tuning method to tinker with

yacinelearning's tweet image. hey yo folks if you liked the evolutionary strategy for finetuning chat we had with yulu check out the new version of the code they released which is 10X faster

def a fine tuning method to tinker with

Xin Qiu

@realVsonicV

27 ต.ค.

Our recent ES fine-tuning paper (arxiv.org/pdf/2509.24372) received lots of attention from the community (Thanks to all!). To speed up the research in this new direction, we developed an accelerated implementation with 10X speed-up in total running time, by refactoring the…

realVsonicV's tweet image. Our recent ES fine-tuning paper (arxiv.org/pdf/2509.24372) received lots of attention from the community (Thanks to all!). To speed up the research in this new direction, we developed an accelerated implementation with 10X speed-up in total running time, by refactoring the…

Geosh รีโพสต์แล้ว

William A. Wallace, Ph.D.

@drwilliamwallac

7 พ.ย.

Your every thought, memory, and movement begins here: an electric spark turned chemical message, as one neuron whispers to the next. What you’re seeing This animation captures neurotransmission, the process that allows nerve cells to communicate across microscopic gaps called…

Geosh รีโพสต์แล้ว

gabriel

@GabrielPeterss4

6 พ.ย.

every single person i know who made a cool video demo of a project that shows agency & great technical ability has been reached out to from top labs & top companies you never need to compete with millions of other people for your top pick role

Geosh รีโพสต์แล้ว

RoboHub🤖

@XRoboHub

7 พ.ย.

When a robot is forced to prove “I’m really a robot.” XPeng’s humanoid robot IRON seems to have crossed the uncanny valley — its body shape and movements look almost identical to a human’s. So XPeng had to cut open one of its legs to reveal the hardware inside, just to prove…

RoboHub🤖

@XRoboHub

6 พ.ย.

At yesterday’s Tech Day, XPeng’s female humanoid robot, Iron, made a catwalk-style entrance that stunned the audience. But soon after, people online started asking — “Is there a real person inside?” Today, XPeng CEO He Xiaopeng personally stepped in to respond to those doubts.

Geosh รีโพสต์แล้ว

Arna Ghosh

@arna_ghosh

7 พ.ย.

Exciting to see these results aligning with our recent work showing that memorization happens during the entropy-seeking phase of LLM pretraining, where information is added to the bottom directions of the representation space! 🧵: x.com/kumarkagrawal/…

arna_ghosh's tweet image. Exciting to see these results aligning with our recent work showing that memorization happens during the entropy-seeking phase of LLM pretraining, where information is added to the bottom directions of the representation space!
🧵: x.com/kumarkagrawal/…

Jack Merullo

@jack_merullo_

6 พ.ย.

We project activations of the two sets onto the eigenvectors of A. There’s a very large and clear disentanglement across the eigenspectrum: clean data interacts with top directions and memorized with the bottom directions in both LMs and ViTs

jack_merullo_'s tweet image. We project activations of the two sets onto the eigenvectors of A. There’s a very large and clear disentanglement across the eigenspectrum: clean data interacts with top directions and memorized with the bottom directions in both LMs and ViTs

Geosh รีโพสต์แล้ว

RoboHub🤖

@XRoboHub

6 พ.ย.

Haha! To prove she’s 100% robot, XPeng had to literally cut open one of IRON’s legs so everyone could see the actuators inside 😂

Geosh รีโพสต์แล้ว

Awni Hannun

@awnihannun

7 พ.ย.

I'm a bit giddy over the fact that this is by all visible measures a frontier level model, if not THE frontier model, for agentic tasks. And you can run it. In it's native precision. On 2 M3 Ultras. Pretty fast. In MLX.

Awni Hannun

@awnihannun

7 พ.ย.

The new 1 Trillion parameter Kimi K2 Thinking model runs well on 2 M3 Ultras in its native format - no loss in quality! The model was quantization aware trained (qat) at int4. Here it generated ~3500 tokens at 15 toks/sec using pipeline-parallelism in mlx-lm:

Geosh

@Geoshh

6 พ.ย.

Love this explanation

Emmanuel Ameisen

@mlpowered

6 พ.ย.

By decomposing weights using loss curvature, you can identify components used for memorization vs generalization. High-curvature = shared mechanisms used across data. Low-curvature = idiosyncratic directions for memorized examples. You can then ablate the memorization weights!