OpenTrain AI

@OpenTrainAI

The Data Labeling Marketplace | Where AI Builders and AI Trainers Connect to Build the Future | Find, hire, & securely pay data labelers for any annotation tool

Technology Company

Seattle, Washington

opentrain.ai

Joined September 2023

23Posts 94Followers 105Following

OpenTrain AI reposted

martin_casado

@martin_casado

Apr 21, 2024

At this point I feel like we understand pretty well what's going on with LLMs: - Outputs are roughly equivalent to kernel smoothing over positional embeddings (arxiv.org/pdf/1908.11775…) - The learned computation model is *probably* bounded by RASP-L (arxiv.org/pdf/2310.16028…) -…

OpenTrain AI reposted

François Chollet

@fchollet

Feb 28, 2024

The perfect quote to describe LLMs can be found in a 1946 Jean Cocteau movie -- "Réfléchissez pour moi, je réfléchirai pour vous" (think for me, I will reflect for you). What you get from the model is always a reflection of the training data you put in -- itself a by-product of…

OpenTrain AI reposted

Santiago

@svpino

Nov 11, 2023

OpenTrain AI

@OpenTrainAI

Nov 6, 2023

Grok-1 by @xai utilized "AI Tutors", or human subject matter experts to create custom training data & provide RLHF. We're making this easy for anybody to do this with OpenTrain.ai: The data labeling marketplace to find, hire, & pay training data experts for ANY data…

OpenTrainAI's tweet image. Grok-1 by @xai utilized "AI Tutors", or human subject matter experts to create custom training data &amp; provide RLHF.

We're making this easy for anybody to do this with OpenTrain.ai: The data labeling marketplace to find, hire, &amp; pay training data experts for ANY data…

OpenTrain AI reposted

lmarena.ai

@arena

Sep 22, 2023

🔥Excited to introduce LMSYS-Chat-1M, a large-scale dataset of 1M real-world conversations with 25 cutting-edge LLMs! This dataset, collected from chat.lmsys.org, offers insights into user interactions with LLMs and intriguing use cases. Link: huggingface.co/datasets/lmsys…

lmsys/lmsys-chat-1m · Datasets at Hugging Face

Source: huggingface.co

OpenTrain AI

@OpenTrainAI

Sep 21, 2023

An in-depth look at RLHF by @natolambert from @huggingface. The need for high-quality, task-specific data in RLHF is crucial. With OpenTrainAI, you can find, hire, & pay the human experts essential for responsible and effective RLHF. Post your job today! #RLHF #MachineLearning

Muratcan Koylan

@koylanai

Sep 19, 2023

Reinforcement Learning from Human Feedback (RLHF) is gaining traction. This field aims to make AI more responsible by including human values and preferences. In this video, @natolambert, a research scientist and RLHF team lead at @huggingface explores its inner workings,…

OpenTrain AI

@OpenTrainAI

Sep 18, 2023

👀

Nat Friedman

@natfriedman

Jul 26, 2023

When StackOverflow is fully dead (due to long congenital illness, self-inflicted wounds, and the finishing blow from AI), where will AI labs get their training data? They can just buy it! Assuming 10k quality answers per week, at $250/answer, that's just $130M/yr. Even at…

OpenTrain AI

@OpenTrainAI

Sep 18, 2023

👀

Nat Friedman

@natfriedman

Aug 24, 2023

As discussed on @stratechery this morning, "experts are the new GPUs" and spending on data for training and fine tuning is in vertical takeoff.

OpenTrain AI reposted

Jim Fan

@DrJimFan

Sep 14, 2023

This is the way to unlock the next trillion high-quality tokens, currently frozen in textbook pixels that are not LLM-ready. Nougat: an open-source OCR model that accurately scans books with heavy math/scientific notations. It's ages ahead of other open OCR options. Meta is…