OpenTrain AI
@OpenTrainAI
The Data Labeling Marketplace | Where AI Builders and AI Trainers Connect to Build the Future | Find, hire, & securely pay data labelers for any annotation tool
At this point I feel like we understand pretty well what's going on with LLMs: - Outputs are roughly equivalent to kernel smoothing over positional embeddings (arxiv.org/pdf/1908.11775…) - The learned computation model is *probably* bounded by RASP-L (arxiv.org/pdf/2310.16028…) -…
The perfect quote to describe LLMs can be found in a 1946 Jean Cocteau movie -- "Réfléchissez pour moi, je réfléchirai pour vous" (think for me, I will reflect for you). What you get from the model is always a reflection of the training data you put in -- itself a by-product of…
Grok-1 by @xai utilized "AI Tutors", or human subject matter experts to create custom training data & provide RLHF. We're making this easy for anybody to do this with OpenTrain.ai: The data labeling marketplace to find, hire, & pay training data experts for ANY data…
🔥Excited to introduce LMSYS-Chat-1M, a large-scale dataset of 1M real-world conversations with 25 cutting-edge LLMs! This dataset, collected from chat.lmsys.org, offers insights into user interactions with LLMs and intriguing use cases. Link: huggingface.co/datasets/lmsys…
An in-depth look at RLHF by @natolambert from @huggingface. The need for high-quality, task-specific data in RLHF is crucial. With OpenTrainAI, you can find, hire, & pay the human experts essential for responsible and effective RLHF. Post your job today! #RLHF #MachineLearning
Reinforcement Learning from Human Feedback (RLHF) is gaining traction. This field aims to make AI more responsible by including human values and preferences. In this video, @natolambert, a research scientist and RLHF team lead at @huggingface explores its inner workings,…
👀
When StackOverflow is fully dead (due to long congenital illness, self-inflicted wounds, and the finishing blow from AI), where will AI labs get their training data? They can just buy it! Assuming 10k quality answers per week, at $250/answer, that's just $130M/yr. Even at…
👀
As discussed on @stratechery this morning, "experts are the new GPUs" and spending on data for training and fine tuning is in vertical takeoff.
This is the way to unlock the next trillion high-quality tokens, currently frozen in textbook pixels that are not LLM-ready. Nougat: an open-source OCR model that accurately scans books with heavy math/scientific notations. It's ages ahead of other open OCR options. Meta is…
United States Trends
- 1. Merry Christmas Eve 62.2K posts
- 2. Spurs 53.6K posts
- 3. thalia 4,354 posts
- 4. #Pluribus 23.7K posts
- 5. Rockets 25.2K posts
- 6. hudson 154K posts
- 7. Chet 11.1K posts
- 8. UNLV 2,680 posts
- 9. Cooper Flagg 13.4K posts
- 10. Rosetta Stone N/A
- 11. Skol 1,772 posts
- 12. connor 159K posts
- 13. Yellow 58.8K posts
- 14. Zosia 6,435 posts
- 15. #PorVida 1,887 posts
- 16. Kawhi Leonard 1,396 posts
- 17. Mavs 6,528 posts
- 18. #VegasBorn N/A
- 19. Colbert 11.5K posts
- 20. #ClipperNation N/A
Something went wrong.
Something went wrong.