Viraj Mehta
@thebigmehtaphor
cofounder and cto @tensorzero. previously did a PhD at CMU doing sample efficient reinforcement learning for tokamaks and math and cs @stanford before that
내가 좋아할 만한 콘텐츠
read the full analysis, methodology, and code examples: tensorzero.com/blog/is-openai… thanks to @gabrielbianconi @thebigmehtaphor Alan Mishler and Shuyang Li for their helpful feedback what's your experience with rft? cooked or let it cook?
tensorzero.com
Is OpenAI's Reinforcement Fine-Tuning (RFT) Worth It? · TensorZero
Is OpenAI's Reinforcement Fine-Tuning (RFT) Worth It?
how do inference services ever output control tokens? can't you unit test for this? shouldn't you have monitoring set up for this? I don't want to name names but sheeeeesh
the responses API is a way for @OpenAI to lock people in. It’s trivially true that any stateful interface can become stateless if you are allowed to return an encrypted blob of data (like Anthropic does today). Gating new features behind something like this is a wedge
10000 and 100 on the same day is nice & round it is great to see the project grow up big & strong!
Thank you for 10,000 stars and 100 contributors — on the same day! We're excited to continue building a next-generation open-source stack for LLM applications. Find us on GitHub ↓
Al Chen (@bigal123) from Grammarly recently built an agent with a continuous improvement loop using TensorZero + LangGraph. Watch his presentation at the NYC LangChain Meetup last week!
We need to make good decisions about an applications’s LLM stack the same way we make good decisions about parameters of other high-throughput systems. Alan’s work will bring automation and rigor to the process for TensorZero users. Couldn’t be more excited to work with him.
Alan Mishler is an ML researcher with a background in causal inference, sequential decision making, uncertainty quantification, and algorithmic fairness (1.2k+ citations). Previously, he was an AI Research Lead at JPMorgan AI Research and received a PhD in Statistics from CMU,…
🫡🤘🏾🚀
YOU INTRO → WE HIRE = $25k REWARD How to pitch your smart friends: 1. Recently hit '# 1 trending repository of the week' globally on GitHub → vast majority of the work will be open source 2. Team: 3 ML researchers (Stanford, CMU, Oxford, Columbia → 6k+ citations), Rust…
my 🐐@YoungseogC
Congratulations @youngseogc for successfully defending his ML PhD at CMU! (& thank you for the TensorZero shoutout!)
WANTED: Founding MTS with a background that combines product and engineering We can't keep up with TensorZero's open-source traction. You can help. Join our founding team in a cross-functional role that combines product, engineering, and GTM. Team: 3 ML researchers (Stanford,…
This release to me highlights the power of @rustlang and the PyO3 / NAPI-RS interfaces on top. It’s so nice to have a portable and canonical codebase to maintain consistency. The experimental features speak to the crazy stuff we’ll be building later this year. 🤔
TensorZero 2025.7.0 is out! 📌 This release revamps TensorZero's optimization workflows. Supervised fine-tuning now supports multimodal data (vision, documents, etc.), multi-turn tool use, and more. You can also launch and monitor optimization workflows programmatically with our…
TensorZero #1 globally on GitHub this week 🤯 thank you to everyone especially the OSS homies
It’s been a blast collaborating with Andrew over the past couple months and we couldn’t be more excited to have him on board. We’re excited for our developer community to benefit from the stuff he’s cooking!
Andrew is an ML researcher with deep expertise in Bayesian ML, causal inference, RL, and LLMs. He’s finishing his postdoc at Columbia and previously received a PhD from Oxford, during which he interned at Meta. He has 3.3k+ citations and several first-author papers at NeurIPS and…
I switched from a decade of vim to @cursor_ai. Now we can peek at how it works 🕵️. check out how we got under the hood of this bad boy
What happens under the hood of a $9.9B coding assistant? We reverse-engineered Cursor's LLM client to observe the requests being made (including prompts!), A/B test different models, and eventually optimize our own prompts + models. ↓ Link in thread with code to reproduce +…
really excited for Dynamic Evaluations - a key milestone for some of the crazy RL features we have planned
TensorZero 2025.5.8 is out! Highlights 📌 This release includes Dynamic Evaluations, a new feature that enables you to evaluate complex workflows that combine multiple inference calls with arbitrary application logic (e.g. agents, RAG, and more). Full Changelog 🔨 Remove…
United States 트렌드
- 1. Friendly 60.4K posts
- 2. SNAP 688K posts
- 3. Big Dom 1,583 posts
- 4. #JUNGKOOKXCALVINKLEIN 34.9K posts
- 5. Jamaica 109K posts
- 6. Jessica 27.1K posts
- 7. Riley Gaines 32.4K posts
- 8. Runza N/A
- 9. Mazie 1,217 posts
- 10. RIP Beef 1,637 posts
- 11. 53 Republicans 4,173 posts
- 12. Crash Bandicoot 6,061 posts
- 13. Sonic Prime 1,244 posts
- 14. MRIs 7,114 posts
- 15. Heal 37.5K posts
- 16. Sports Equinox 12.6K posts
- 17. #NationalBlackCatDay 4,888 posts
- 18. 7 Democrats 5,314 posts
- 19. Monday Night Football 6,220 posts
- 20. Stearns N/A
내가 좋아할 만한 콘텐츠
-
David Held
@davheld -
Devendra Chaplot
@dchaplot -
Yiding Jiang
@yidingjiang -
Arthur Mensch
@arthurmensch -
Samuel Albanie 🇬🇧
@SamuelAlbanie -
Aurelien Lucchi
@AurelienLucchi -
Princeton Visual AI lab
@VisualAILab -
will grathwohl
@wgrathwohl -
Sanja Fidler
@FidlerSanja -
Zico Kolter
@zicokolter -
Davide Scaramuzza
@davsca1 -
Phillip Isola
@phillip_isola -
Deepak Pathak
@pathak2206 -
Rohin Shah
@rohinmshah -
Guanya Shi
@GuanyaShi
Something went wrong.
Something went wrong.