TensorTonic's profile picture. Machine Learning papers, concepts, and resources.

TensorTonic

@TensorTonic

Machine Learning papers, concepts, and resources.

Fixado

Too many ML papers, too little time. We’ll read them so you don’t have to. What you’ll find here at TensorTonic: 🔹 Breakdowns of key ML/LLM papers every week 🔹 Explanations of core concepts in AI 🔹 Research explained in plain language


TensorTonic repostou

You can now chat with apps in ChatGPT.


TensorTonic repostou

🚀 Introducing DeepSeek-V3.2-Exp — our latest experimental model! ✨ Built on V3.1-Terminus, it debuts DeepSeek Sparse Attention(DSA) for faster, more efficient training & inference on long context. 👉 Now live on App, Web, and API. 💰 API prices cut by 50%+! 1/n


TensorTonic repostou

new research from Meta FAIR: Code World Model (CWM), a 32B research model we encourage the research community to research this open-weight model! pass@1 evals, for the curious: 65.8 % on SWE-bench Verified 68.6 % on LiveCodeBench 96.6 % on Math-500 76.0 % on AIME 2024 🧵

alexandr_wang's tweet image. new research from Meta FAIR: Code World Model (CWM), a 32B research model

we encourage the research community to research this open-weight model!

pass@1 evals, for the curious:

65.8 % on SWE-bench Verified
68.6 % on LiveCodeBench
96.6 % on Math-500
76.0 % on AIME 2024

🧵

TensorTonic repostou

Introducing Magistral Small 1.2 & Magistral Medium 1.2, minor updates to our Magistral 1.1 models! - Multimodality: Now equipped with a vision encoder, these models handle both text and images seamlessly. - Performance Boost: 15% improvements on math and coding benchmarks such…

MistralAI's tweet image. Introducing Magistral Small 1.2 & Magistral Medium 1.2, minor updates to our Magistral 1.1 models!

- Multimodality: Now equipped with a vision encoder, these models handle both text and images seamlessly.
- Performance Boost: 15% improvements on math and coding benchmarks such…
MistralAI's tweet image. Introducing Magistral Small 1.2 & Magistral Medium 1.2, minor updates to our Magistral 1.1 models!

- Multimodality: Now equipped with a vision encoder, these models handle both text and images seamlessly.
- Performance Boost: 15% improvements on math and coding benchmarks such…

Great video to understand GRPO.

TensorTonic's tweet image. Great video to understand GRPO.

Great video for learning CUDA programming

TensorTonic's tweet image. Great video for learning CUDA programming

United States Tendências

Loading...

Something went wrong.


Something went wrong.