vishal

@vishal_learner

Data Science/ML. http://ColBERT.ai Maintainer. #FlyEaglesFly

Online

youtube.com/@vishal_learner

เข้าร่วมเมื่อ สิงหาคม 2023

4พันโพสต์ 891ผู้ติดตาม 97กําลังติดตาม

ปักหมุด

vishal

@vishal_learner

19 ก.ย.

In 285 days, my current role as a data analyst will come to an end. I’m excited to begin my professional machine learning journey. Just published a blog post on what I’ve been building, learning, and looking for next (link below).

vishal รีโพสต์แล้ว

Hamel Husain

@HamelHusain

7 พ.ย.

👀 Animals have been assigned. Scheduled to print fall 2026! We have iterated on this with over 3k students (and continue to do so). We give our students access to the full draft as part of our evals course (link in bio).

HamelHusain's tweet image. 👀 Animals have been assigned.

Scheduled to print fall 2026!

We have iterated on this with over 3k students (and continue to do so). We give our students access to the full draft as part of our evals course (link in bio).

vishal

@vishal_learner

24 ต.ค.

why did i not use GitHub Desktop more before. it's so much easier to wrap my head around branches and changes

vishal รีโพสต์แล้ว

Maxime Rivest 🧙‍♂️🦙🐧

@MaximeRivest

19 ต.ค.

almost everything is in distribution if broken in small enough steps and done in the right order.

vishal รีโพสต์แล้ว

Sara Hooker

@sarahookr

18 ต.ค.

Early next year I would love to support a symposium on the future of AI interfaces, design and adaptive intelligence. If you are a designer, UX/AI researcher, or just have opinions and want to contribute would love to have you involved. We will keep it small + invite only.

vishal รีโพสต์แล้ว

Omar Khattab

@lateinteraction

18 ต.ค.

Humans learn most characteristically from reflection.

Dimitris Papailiopoulos

@DimitrisPapail

18 ต.ค.

One of the most though provoking points Karpathy made during the Dwarkesh podcast is that humans learn mostly from “synthetic” self generated data, ie replaying, distilling, processing recent experience.

vishal

@vishal_learner

18 ต.ค.

pre order my new book "the art and science of being friendly enough with your neighbors to have pleasant interactions when running into them but not so friendly that it makes it awkward and creates an unsustainable relational expectation"

vishal

@vishal_learner

18 ต.ค.

i hope LLMs don't overly start using e.g. and i.e. because that's fundamental to how i communicate and will never change

vishal รีโพสต์แล้ว

Rikiya Takehi

@rikiyatakehi

16 ต.ค.

Releasing small ColBERT models as my first project @mixedbreadai!!🍞 Even the 17M model easily beats the longembed leader:) The tech report includes many easy wins for training embedding models and ColBERT models from scratch🗒️

Mixedbread

@mixedbreadai

16 ต.ค.

One More (Small) Thing: Introducing mxbai-colbert-edge-v0 17M and 32M. They are are the result of an easily reproducible way to train ColBERT models from scratch. They're strong, too: the 17M variant would rank first on the LongEmbed leaderboard for models under 1B parameters.

mixedbreadai's tweet image. One More (Small) Thing: Introducing mxbai-colbert-edge-v0 17M and 32M.

They are are the result of an easily reproducible way to train ColBERT models from scratch.

They're strong, too: the 17M variant would rank first on the LongEmbed leaderboard for models under 1B parameters.

vishal รีโพสต์แล้ว

Antoine Chaffin

@antoine_chaffin

15 ต.ค.

We released a new version of PyLate, which mostly bump the version to ST 5.X and have some fixes for LLM base models/models with multiple dense layers This release focuses on making all models compatible... 😇

antoine_chaffin's tweet image. We released a new version of PyLate, which mostly bump the version to ST 5.X and have some fixes for LLM base models/models with multiple dense layers
This release focuses on making all models compatible... 😇

vishal รีโพสต์แล้ว

Raphaël Sourty

@raphaelsrty

15 ต.ค.

PyLate is getting better and better

Antoine Chaffin

@antoine_chaffin

15 ต.ค.

vishal รีโพสต์แล้ว

vishal

@vishal_learner

14 ต.ค.

The next colbert-ai release (in 2-3 weeks, after I finish PyTorch 2.x upgrade analysis) will include a bugfix for single-node multi-GPU training (sample division across GPUs) which improves loss/pos score/neg score during training. Thanks to our ColBERT community!

vishal_learner's tweet image. The next colbert-ai release (in 2-3 weeks, after I finish PyTorch 2.x upgrade analysis) will include a bugfix for single-node multi-GPU training (sample division across GPUs) which improves loss/pos score/neg score during training. Thanks to our ColBERT community!

vishal รีโพสต์แล้ว

Sara Hooker

@sarahookr

13 ต.ค.

I'm hiring an operations lead 🔥 If you like building things 0-1, and imagining new worlds -- join us 🌍 adaptionlabs.ai

vishal รีโพสต์แล้ว

Marc Brooker

@MarcJBrooker

12 ต.ค.

Barbarians at the Gate is a very interesting new paper, with some exciting results showing the potential for AI in systems research. But I think the authors aren't quite asking the hardest problem about where this takes systems as a field. I wrote a new blog post about it.

AI-Driven Research Systems

@ai4research_ucb

9 ต.ค.

🚀 Excited to release our new paper: “Barbarians at the Gate: How AI is Upending Systems Research” We show how AI-Driven Research for Systems (ADRS) can rediscover or outperform human-designed algorithms across cloud scheduling, MoE expert load balancing, LLM-SQL optimization,…

ai4research_ucb's tweet image. 🚀 Excited to release our new paper: “Barbarians at the Gate: How AI is Upending Systems Research”

We show how AI-Driven Research for Systems (ADRS) can rediscover or outperform human-designed algorithms across cloud scheduling, MoE expert load balancing, LLM-SQL optimization,…

vishal

@vishal_learner

11 ต.ค.

heiner

@HeinrichKuttler

11 ต.ค.

You can do something like fd = os.memfd_create(name) os.ftruncate(fd, size) and then either share fd with your child process e.g. via subprocess.Popen(pass_fds=) or you mmap it which multiprocessing can deserialize to the same region. The kernel refcounts the fd like a file.

vishal รีโพสต์แล้ว

Jeremy Howard

@jeremyphoward

11 ต.ค.

til about memfd for sharing across python processes

heiner

@HeinrichKuttler

11 ต.ค.

vishal รีโพสต์แล้ว

Pavel Surmenok

@surmenok

11 ต.ค.

The GIL forced Python engineers into multiprocessing, adding overhead and complexity: slower startup, increased memory use, context switching, and serialization/deserialization headaches (or shared memory complexity). With PyTorch 3.14 finally removing the GIL, multithreading is…