DataDigestDaily

@DataDigestDaily

Just a silly ML engineer who breaks stuff and fixes it back on. A Capricorn who loves popcorn.

hmmm, take a guess ;)

انضم في فبراير 2024

955المنشورات 39المتابعون 144المتابَعون

DataDigestDaily

@DataDigestDaily

٣٠ يوليوم

sg-public-api.hoyoverse.com/event/social_s…

DataDigestDaily's tweet card. May the moonlight bless your journey ahead.

The Nod-Krai Concept Overview is now available.

المصدر: sg-public-api.hoyoverse.com

DataDigestDaily أعاد

Open Deep Research is here 🔍 We've open sourced one of the most powerful agent use cases. Built on LangGraph, Open Deep Research: • Uses a supervisor architecture to coordinate research sub-agents • Supports your own LLMs, tools, and MCP servers • Produces high-quality…

LangChainAI's tweet image. Open Deep Research is here 🔍 We've open sourced one of the most powerful agent use cases. Built on LangGraph, Open Deep Research:

• Uses a supervisor architecture to coordinate research sub-agents
• Supports your own LLMs, tools, and MCP servers
• Produces high-quality…

DataDigestDaily أعاد

Kimi.ai

@Kimi_Moonshot

١١ يوليوم

🚀 Hello, Kimi K2! Open-Source Agentic Model! 🔹 1T total / 32B active MoE model 🔹 SOTA on SWE Bench Verified, Tau2 & AceBench among open models 🔹Strong in coding and agentic tasks 🐤 Multimodal & thought-mode not supported for now With Kimi K2, advanced agentic intelligence…

Kimi_Moonshot's tweet image. 🚀 Hello, Kimi K2! Open-Source Agentic Model!
🔹 1T total / 32B active MoE model
🔹 SOTA on SWE Bench Verified, Tau2 &amp; AceBench among open models
🔹Strong in coding and agentic tasks
🐤 Multimodal &amp; thought-mode not supported for now

With Kimi K2, advanced agentic intelligence…

DataDigestDaily أعاد

elie

@eliebakouch

٨ يوليوم

Super excited to share SmolLM3, a new strong 3B model. SmolLM3 is fully open, we share the recipe, the dataset, the training codebase and much more! > Train on 11T token on 384 H100 for 220k GPU hours > Support long context up to 128k thanks to NoPE and intra document masking >…

eliebakouch's tweet image. Super excited to share SmolLM3, a new strong 3B model.

SmolLM3 is fully open, we share the recipe, the dataset, the training codebase and much more!

&gt; Train on 11T token on 384 H100 for 220k GPU hours
&gt; Support long context up to 128k thanks to NoPE and intra document masking
&gt;…

DataDigestDaily أعاد

Chubby♨️

@kimmonismus

٤ يوليوم

Grok 4 benchmarks are nuts! - AIME 95% and HLE45%, outperforming o3 by alot Grok 4 with reasoning is amazing

TestingCatalog News 🗞

@testingcatalog

٤ يوليوم

Grok 4 early benchmarks in comparison to other models. Humanity last exam diff is 🔥 Visualised by @marczierer

DataDigestDaily أعاد

ℏεsam

@Hesamation

٣ يوليوم

DataDigestDaily أعاد

Sauers

@Sauers_

٣ يوليوم

CLAAAUUUUDDDEEEEE!!!!!!!!!!!!!!!!!!!!!!!!!!!!!

DataDigestDaily أعاد

Kasey Zhang

@_WEEXIAO

٣ يوليوم

It’s easy to fine-tune small models w/ RL to outperform foundation models on vertical tasks. We’re open sourcing Osmosis-Apply-1.7B: a small model that merges code (similar to Cursor’s instant apply) better than foundation models. Links to download and try out the model below!

DataDigestDaily أعاد

Andrej Karpathy

@karpathy

٣٠ يونيوم

Love this project: nanoGPT -> recursive self-improvement benchmark. Good old nanoGPT keeps on giving and surprising :) - First I wrote it as a small little repo to teach people the basics of training GPTs. - Then it became a target and baseline for my port to direct C/CUDA…

Minqi Jiang

@MinqiJiang

٣٠ يونيوم

Recently, there has been a lot of talk of LLM agents automating ML research itself. If Llama 5 can create Llama 6, then surely the singularity is just around the corner. How can we get a pulse check on whether current LLMs are capable of driving this kind of total…

MinqiJiang's tweet image. Recently, there has been a lot of talk of LLM agents automating ML research itself. If Llama 5 can create Llama 6, then surely the singularity is just around the corner.

How can we get a pulse check on whether current LLMs are capable of driving this kind of total…

DataDigestDaily أعاد

Google AI Developers

@googleaidevs

١ يوليوم

Some interesting Gemini CLI use cases and tutorials 🧵⬇️

DataDigestDaily أعاد

Sydney Runkle

@sydneyrunkle

٣٠ يونيوم

🚀LangGraph v0.5 is out! Check out the new features and improvements in our release notes! github.com/langchain-ai/l…

sydneyrunkle's tweet card. LangGraph 0.5 – the “Getting-Ready-for-1.0” release 🎉 TL;DR – 0.5 is not a radical rewrite, but a scrub-down and tune-up of the LangGraph core. APIs are a little stricter, you have more control...

Release 0.5.0 · langchain-ai/langgraph

المصدر: github.com

DataDigestDaily أعاد

Marktechpost AI Dev News ⚡

@Marktechpost

٣٠ يونيوم

University of Michigan Researchers Propose G-ACT: A Scalable Machine Learning Framework to Steer Programming Language Bias in LLMs Large language models (LLMs) have shown significant capabilities in text-based applications, but generating scientific code remains a challenge due…

Marktechpost's tweet image. University of Michigan Researchers Propose G-ACT: A Scalable Machine Learning Framework to Steer Programming Language Bias in LLMs

Large language models (LLMs) have shown significant capabilities in text-based applications, but generating scientific code remains a challenge due…

DataDigestDaily أعاد

Pramod Goyal

@goyal__pramod

٢٨ يونيوم

It took me 15 days to compile this list of the most important LLM papers. I didn't want to miss any work that led to where we are today.

goyal__pramod's tweet image. It took me 15 days to compile this list of the most important LLM papers.

I didn't want to miss any work that led to where we are today.

Pramod Goyal

@goyal__pramod

١١ مايوم

Most influential LLM papers and the ideas they introduced (post 2017) A long thread 🧵

DataDigestDaily أعاد

Sebastian Raschka

@rasbt

٢٩ يونيوم

Since it's summer, and more or less internship and tech interview season, I made all 30 chapters of my Machine Learning Q and AI book freely available for the summer: sebastianraschka.com/books/ml-q-and… Hope it’s helpful! Happy reading, and good luck if you are interviewing!

sebastianraschka.com

Machine Learning Q and AI

A curated book of 30 concise Q&A chapters on modern machine learning and AI, from embeddings to transformers to evaluation.

المصدر: sebastianraschka.com

DataDigestDaily أعاد

Simon Willison

@simonw

٢٥ يونيوم

Claude Code, OpenAI Codex (CLI) and now Gemini CLI - we need a name for this category of AI-assisted terminal tools, maybe "terminal agents"?

Google AI Developers

@googleaidevs

٢٥ يونيوم

Introducing Gemini CLI, a light and powerful open-source AI agent that brings Gemini directly into your terminal. >_ Write code, debug, and automate tasks with Gemini 2.5 Pro with industry-leading high usage limits at no cost.

DataDigestDaily أعاد

Google AI Developers

@googleaidevs

٢٤ يونيوم

Gemini 2.5 is driving new advancements in robotics with its strong coding, spatial reasoning, and multimodal capabilities. 🦾

DataDigestDaily أعاد

erwan plantec

@eplantec

٢٤ يونيوم

Synthetic ecosystems that autonomously and continuously evolve in silico 👾 An Alifer dream we pursue with Flow-Lenia ! If you are interested in complex systems with (1) emergent creatures and (2) intrinsic evolutionary dynamics, go check our new paper ! A 🧵

DataDigestDaily أعاد

TestingCatalog News 🗞

@testingcatalog

٢٥ يونيوم

BREAKING 🚨: Google is launching a new open-source Gemini CLI Agent! It is powered by Gemini 2.5 Pro and supports MCPs as well. Google is entering the terminal 🤖

DataDigestDaily أعاد

Google AI Developers

@googleaidevs

٢٥ يونيوم

DataDigestDaily أعاد

Philipp Schmid

@_philschmid

٢٥ يونيوم

Gemini CLI is here! Our most powerful open-source CLI that brings Google's Gemini 2.5 models directly into your terminal! With unique features like hierarchical memory (context), self-correcting file edits, and secure sandboxed tool execution. 💡 Hierarchical Memory and…