DataDigestDaily's profile picture. Just a silly ML engineer who breaks stuff and fixes it back on.
A Capricorn who loves popcorn.

DataDigestDaily

@DataDigestDaily

Just a silly ML engineer who breaks stuff and fixes it back on. A Capricorn who loves popcorn.

DataDigestDaily أعاد

Open Deep Research is here 🔍 We've open sourced one of the most powerful agent use cases. Built on LangGraph, Open Deep Research: • Uses a supervisor architecture to coordinate research sub-agents • Supports your own LLMs, tools, and MCP servers • Produces high-quality…

LangChainAI's tweet image. Open Deep Research is here 🔍 We've open sourced one of the most powerful agent use cases. Built on LangGraph, Open Deep Research:

• Uses a supervisor architecture to coordinate research sub-agents
• Supports your own LLMs, tools, and MCP servers
• Produces high-quality…

DataDigestDaily أعاد

🚀 Hello, Kimi K2! Open-Source Agentic Model! 🔹 1T total / 32B active MoE model 🔹 SOTA on SWE Bench Verified, Tau2 & AceBench among open models 🔹Strong in coding and agentic tasks 🐤 Multimodal & thought-mode not supported for now With Kimi K2, advanced agentic intelligence…

Kimi_Moonshot's tweet image. 🚀 Hello, Kimi K2!  Open-Source Agentic Model!
🔹 1T total / 32B active MoE model
🔹 SOTA on SWE Bench Verified, Tau2 & AceBench among open models
🔹Strong in coding and agentic tasks
🐤 Multimodal & thought-mode not supported for now

With Kimi K2, advanced agentic intelligence…

DataDigestDaily أعاد

Super excited to share SmolLM3, a new strong 3B model. SmolLM3 is fully open, we share the recipe, the dataset, the training codebase and much more! > Train on 11T token on 384 H100 for 220k GPU hours > Support long context up to 128k thanks to NoPE and intra document masking >…

eliebakouch's tweet image. Super excited to share SmolLM3, a new strong 3B model.

SmolLM3 is fully open, we share the recipe, the dataset, the training codebase and much more!

> Train on 11T token on 384 H100 for 220k GPU hours
> Support long context up to 128k thanks to NoPE and intra document masking
>…

DataDigestDaily أعاد

Grok 4 benchmarks are nuts! - AIME 95% and HLE45%, outperforming o3 by alot Grok 4 with reasoning is amazing

kimmonismus's tweet image. Grok 4 benchmarks are nuts! 

- AIME 95% and HLE45%, outperforming o3 by alot 

Grok 4 with reasoning is amazing

Grok 4 early benchmarks in comparison to other models. Humanity last exam diff is 🔥 Visualised by @marczierer

testingcatalog's tweet image. Grok 4 early benchmarks in comparison to other models. 

Humanity last exam diff is 🔥

Visualised by @marczierer


DataDigestDaily أعاد
Hesamation's tweet image.

DataDigestDaily أعاد

CLAAAUUUUDDDEEEEE!!!!!!!!!!!!!!!!!!!!!!!!!!!!!

Sauers_'s tweet image. CLAAAUUUUDDDEEEEE!!!!!!!!!!!!!!!!!!!!!!!!!!!!!

DataDigestDaily أعاد

It’s easy to fine-tune small models w/ RL to outperform foundation models on vertical tasks. We’re open sourcing Osmosis-Apply-1.7B: a small model that merges code (similar to Cursor’s instant apply) better than foundation models. Links to download and try out the model below!


DataDigestDaily أعاد

Love this project: nanoGPT -> recursive self-improvement benchmark. Good old nanoGPT keeps on giving and surprising :) - First I wrote it as a small little repo to teach people the basics of training GPTs. - Then it became a target and baseline for my port to direct C/CUDA…

Recently, there has been a lot of talk of LLM agents automating ML research itself. If Llama 5 can create Llama 6, then surely the singularity is just around the corner. How can we get a pulse check on whether current LLMs are capable of driving this kind of total…

MinqiJiang's tweet image. Recently, there has been a lot of talk of LLM agents automating ML research itself. If Llama 5 can create Llama 6, then surely the singularity is just around the corner. 

How can we get a pulse check on whether current LLMs are capable of driving this kind of total…


DataDigestDaily أعاد

Some interesting Gemini CLI use cases and tutorials 🧵⬇️


DataDigestDaily أعاد

🚀LangGraph v0.5 is out! Check out the new features and improvements in our release notes! github.com/langchain-ai/l…


DataDigestDaily أعاد

University of Michigan Researchers Propose G-ACT: A Scalable Machine Learning Framework to Steer Programming Language Bias in LLMs Large language models (LLMs) have shown significant capabilities in text-based applications, but generating scientific code remains a challenge due…

Marktechpost's tweet image. University of Michigan Researchers Propose G-ACT: A Scalable Machine Learning Framework to Steer Programming Language Bias in LLMs

Large language models (LLMs) have shown significant capabilities in text-based applications, but generating scientific code remains a challenge due…

DataDigestDaily أعاد

It took me 15 days to compile this list of the most important LLM papers. I didn't want to miss any work that led to where we are today.

goyal__pramod's tweet image. It took me 15 days to compile this list of the most important LLM papers. 

I didn't want to miss any work that led to where we are today.

Most influential LLM papers and the ideas they introduced (post 2017) A long thread 🧵

goyal__pramod's tweet image. Most influential LLM papers and the ideas they introduced (post 2017) 

A long thread 🧵


DataDigestDaily أعاد

Since it's summer, and more or less internship and tech interview season, I made all 30 chapters of my Machine Learning Q and AI book freely available for the summer: sebastianraschka.com/books/ml-q-and… Hope it’s helpful! Happy reading, and good luck if you are interviewing!

sebastianraschka.com

Machine Learning Q and AI

A curated book of 30 concise Q&A chapters on modern machine learning and AI, from embeddings to transformers to evaluation.


DataDigestDaily أعاد

Claude Code, OpenAI Codex (CLI) and now Gemini CLI - we need a name for this category of AI-assisted terminal tools, maybe "terminal agents"?

Introducing Gemini CLI, a light and powerful open-source AI agent that brings Gemini directly into your terminal. >_ Write code, debug, and automate tasks with Gemini 2.5 Pro with industry-leading high usage limits at no cost.



DataDigestDaily أعاد

Gemini 2.5 is driving new advancements in robotics with its strong coding, spatial reasoning, and multimodal capabilities. 🦾


DataDigestDaily أعاد

Synthetic ecosystems that autonomously and continuously evolve in silico 👾 An Alifer dream we pursue with Flow-Lenia ! If you are interested in complex systems with (1) emergent creatures and (2) intrinsic evolutionary dynamics, go check our new paper ! A 🧵


DataDigestDaily أعاد

BREAKING 🚨: Google is launching a new open-source Gemini CLI Agent! It is powered by Gemini 2.5 Pro and supports MCPs as well. Google is entering the terminal 🤖


DataDigestDaily أعاد

Introducing Gemini CLI, a light and powerful open-source AI agent that brings Gemini directly into your terminal. >_ Write code, debug, and automate tasks with Gemini 2.5 Pro with industry-leading high usage limits at no cost.


DataDigestDaily أعاد

Gemini CLI is here! Our most powerful open-source CLI that brings Google's Gemini 2.5 models directly into your terminal! With unique features like hierarchical memory (context), self-correcting file edits, and secure sandboxed tool execution. 💡 Hierarchical Memory and…

_philschmid's tweet image. Gemini CLI is here! Our most powerful open-source CLI that brings Google's Gemini 2.5 models directly into your terminal! With unique features like hierarchical memory (context), self-correcting file edits, and secure sandboxed tool execution.

💡 Hierarchical Memory and…

United States الاتجاهات

Loading...

Something went wrong.


Something went wrong.