Ege Onur Güleç
@EgeOnurGulec
Boğaziçi University Economics Ideas worth sharing
Bạn có thể thích
Now you can share your mrr directly from stripe let’s see the real mrrs of companies :D
Ilya says ages of scaling is over and research is back. We need to get ready for new ideas to enhance the AI instead of just compute according to Ilya .
The @ilyasut episode 0:00:00 – Explaining model jaggedness 0:09:39 - Emotions and value functions 0:18:49 – What are we scaling? 0:25:13 – Why humans generalize better than models 0:35:45 – Straight-shotting superintelligence 0:46:47 – SSI’s model will learn from deployment…
Seems like a pretty good model for coding still expensive but much cheaper to its predecessor Opus 4.1. Claude’s almost all focus on currently on coding hence let’s see how it performs on real life tasks .
After the success of training models specifically for a given task, like GPT Codex for coding and Sonnet with its strong focus on coding, there should be more domain specific models like GPT Finance or GPT Math where they excel at a specific task.
Instead of clearly defined problem datasets , these kind of real world engineering benchmarks should be the future for evaluating quality of LLMs. Real world is messy and LLMs should be able to operate in that mess.
We are announcing cline-bench, a real world open source benchmark for agentic coding. cline-bench is built from real world engineering tasks from participating developers where frontier models failed and humans had to step in. Each accepted task becomes a fully reproducible…
This might be biggest bottleneck for training models. Having an efficient data pipeline to train is the biggest moat a company can have.
AI Models are valuable, but datasets and evals to train AI models are more valuable. Datasets are valuable, but automated data pipelines that generate the datasets are more valuable. *** Model < data < pipeline *** At least until the models start building pipelines. Still far…
Big transfer for Thinking Machines. They transferred creator of PyTorch. I hope they also contribute more to open source .
thinking machines....the people are incredible
United States Xu hướng
- 1. Ravens 56K posts
- 2. Lamar 44.8K posts
- 3. Joe Burrow 19.7K posts
- 4. #heatedrivalry 7,744 posts
- 5. Zay Flowers 4,010 posts
- 6. #WhoDey 3,498 posts
- 7. Cowboys 91.1K posts
- 8. ilya 10.8K posts
- 9. Derrick Henry 4,378 posts
- 10. Perine 1,555 posts
- 11. Zac Taylor 2,599 posts
- 12. AFC North 2,263 posts
- 13. Harbaugh 3,008 posts
- 14. #CINvsBAL 2,646 posts
- 15. Sarah Beckstrom 203K posts
- 16. Mahomes 33.5K posts
- 17. Tanner Hudson 1,272 posts
- 18. Myles Murphy N/A
- 19. Boozer 5,360 posts
- 20. Tinsley 1,604 posts
Something went wrong.
Something went wrong.