thebigmehtaphor's profile picture. cofounder and cto @tensorzero. previously did a PhD at CMU doing sample efficient reinforcement learning for tokamaks and math and cs @stanford before that

Viraj Mehta

@thebigmehtaphor

cofounder and cto @tensorzero. previously did a PhD at CMU doing sample efficient reinforcement learning for tokamaks and math and cs @stanford before that

Viraj Mehta 님이 재게시함

read the full analysis, methodology, and code examples: tensorzero.com/blog/is-openai… thanks to @gabrielbianconi @thebigmehtaphor Alan Mishler and Shuyang Li for their helpful feedback what's your experience with rft? cooked or let it cook?

tensorzero.com

Is OpenAI's Reinforcement Fine-Tuning (RFT) Worth It? · TensorZero

Is OpenAI's Reinforcement Fine-Tuning (RFT) Worth It?


how do inference services ever output control tokens? can't you unit test for this? shouldn't you have monitoring set up for this? I don't want to name names but sheeeeesh


the responses API is a way for @OpenAI to lock people in. It’s trivially true that any stateful interface can become stateless if you are allowed to return an encrypted blob of data (like Anthropic does today). Gating new features behind something like this is a wedge


10000 and 100 on the same day is nice & round it is great to see the project grow up big & strong!

Thank you for 10,000 stars and 100 contributors — on the same day! We're excited to continue building a next-generation open-source stack for LLM applications. Find us on GitHub ↓

TensorZero's tweet image. Thank you for 10,000 stars and 100 contributors — on the same day!

We're excited to continue building a next-generation open-source stack for LLM applications.

Find us on GitHub ↓


Viraj Mehta 님이 재게시함

Al Chen (@bigal123) from Grammarly recently built an agent with a continuous improvement loop using TensorZero + LangGraph. Watch his presentation at the NYC LangChain Meetup last week!


Viraj Mehta 님이 재게시함

built in Rust for performance 🏎️

TensorZero's tweet image. built in Rust for performance 🏎️

We need to make good decisions about an applications’s LLM stack the same way we make good decisions about parameters of other high-throughput systems. Alan’s work will bring automation and rigor to the process for TensorZero users. Couldn’t be more excited to work with him.

Alan Mishler is an ML researcher with a background in causal inference, sequential decision making, uncertainty quantification, and algorithmic fairness (1.2k+ citations). Previously, he was an AI Research Lead at JPMorgan AI Research and received a PhD in Statistics from CMU,…

TensorZero's tweet image. Alan Mishler is an ML researcher with a background in causal inference, sequential decision making, uncertainty quantification, and algorithmic fairness (1.2k+ citations). Previously, he was an AI Research Lead at JPMorgan AI Research and received a PhD in Statistics from CMU,…


Viraj Mehta 님이 재게시함

BREAKING: IT'S OVER 9000

TensorZero's tweet image. BREAKING: IT'S OVER 9000

🫡🤘🏾🚀

YOU INTRO → WE HIRE = $25k REWARD How to pitch your smart friends: 1. Recently hit '# 1 trending repository of the week' globally on GitHub → vast majority of the work will be open source 2. Team: 3 ML researchers (Stanford, CMU, Oxford, Columbia → 6k+ citations), Rust…

gabrielbianconi's tweet image. YOU INTRO → WE HIRE = $25k REWARD

How to pitch your smart friends:

1. Recently hit '# 1 trending repository of the week' globally on GitHub
→ vast majority of the work will be open source

2. Team: 3 ML researchers (Stanford, CMU, Oxford, Columbia → 6k+ citations), Rust…


my 🐐@YoungseogC

Congratulations @youngseogc for successfully defending his ML PhD at CMU! (& thank you for the TensorZero shoutout!)

TensorZero's tweet image. Congratulations @youngseogc for successfully defending his ML PhD at CMU!

(& thank you for the TensorZero shoutout!)


Viraj Mehta 님이 재게시함

WANTED: Founding MTS with a background that combines product and engineering We can't keep up with TensorZero's open-source traction. You can help. Join our founding team in a cross-functional role that combines product, engineering, and GTM. Team: 3 ML researchers (Stanford,…

TensorZero's tweet image. WANTED: Founding MTS with a background that combines product and engineering

We can't keep up with TensorZero's open-source traction. You can help.

Join our founding team in a cross-functional role that combines product, engineering, and GTM.

Team: 3 ML researchers (Stanford,…

This release to me highlights the power of @rustlang and the PyO3 / NAPI-RS interfaces on top. It’s so nice to have a portable and canonical codebase to maintain consistency. The experimental features speak to the crazy stuff we’ll be building later this year. 🤔

TensorZero 2025.7.0 is out! 📌 This release revamps TensorZero's optimization workflows. Supervised fine-tuning now supports multimodal data (vision, documents, etc.), multi-turn tool use, and more. You can also launch and monitor optimization workflows programmatically with our…



TensorZero #1 globally on GitHub this week 🤯 thank you to everyone especially the OSS homies

thebigmehtaphor's tweet image. TensorZero #1 globally on GitHub this week 🤯
thank you to everyone especially the OSS homies

It’s been a blast collaborating with Andrew over the past couple months and we couldn’t be more excited to have him on board. We’re excited for our developer community to benefit from the stuff he’s cooking!

Andrew is an ML researcher with deep expertise in Bayesian ML, causal inference, RL, and LLMs. He’s finishing his postdoc at Columbia and previously received a PhD from Oxford, during which he interned at Meta. He has 3.3k+ citations and several first-author papers at NeurIPS and…

TensorZero's tweet image. Andrew is an ML researcher with deep expertise in Bayesian ML, causal inference, RL, and LLMs. He’s finishing his postdoc at Columbia and previously received a PhD from Oxford, during which he interned at Meta. He has 3.3k+ citations and several first-author papers at NeurIPS and…


Viraj Mehta 님이 재게시함

🫶

TensorZero's tweet image. 🫶

I switched from a decade of vim to @cursor_ai. Now we can peek at how it works 🕵️. check out how we got under the hood of this bad boy

What happens under the hood of a $9.9B coding assistant? We reverse-engineered Cursor's LLM client to observe the requests being made (including prompts!), A/B test different models, and eventually optimize our own prompts + models. ↓ Link in thread with code to reproduce +…

TensorZero's tweet image. What happens under the hood of a $9.9B coding assistant?

We reverse-engineered Cursor's LLM client to observe the requests being made (including prompts!), A/B test different models, and eventually optimize our own prompts + models.

↓ Link in thread with code to reproduce +…


Viraj Mehta 님이 재게시함

really excited for Dynamic Evaluations - a key milestone for some of the crazy RL features we have planned

TensorZero 2025.5.8 is out! Highlights 📌 This release includes Dynamic Evaluations, a new feature that enables you to evaluate complex workflows that combine multiple inference calls with arbitrary application logic (e.g. agents, RAG, and more). Full Changelog 🔨 Remove…



Loading...

Something went wrong.


Something went wrong.