Ilya Dyachenko

@flash_us

Machine Learning, Python enthusiast. SAP ABAP professional.

Yuzhno-Sakhalinsk, Russia

linkedin.com/in/ilyadyachen…

Joined August 2009

6KPosts 269Followers 481Following

You might like

@fly51fly

@vinodg

@MColebrook

@BenSadeghi

@pradmishra1

@borlafgis

@atakanince

@muktabh

@ajinkyakale

Ilya Dyachenko reposted

Robert Youssef

@rryssf_

Nov 4

Holy shit... this might be the next big paradigm shift in AI. 🤯 Tencent + Tsinghua just dropped a paper called Continuous Autoregressive Language Models (CALM) and it basically kills the “next-token” paradigm every LLM is built on. Instead of predicting one token at a time,…

rryssf_'s tweet image. Holy shit... this might be the next big paradigm shift in AI. 🤯

Tencent + Tsinghua just dropped a paper called Continuous Autoregressive Language Models (CALM) and it basically kills the “next-token” paradigm every LLM is built on.

Instead of predicting one token at a time,…

Ilya Dyachenko reposted

Dilum Sanjaya

@DilumSanjaya

Nov 3

This is Microsoft SandDance. originally a closed-source project that was later open-sourced. It lets you visually explore and understand data with smooth, animated transitions between multiple views.

Ilya Dyachenko reposted

Cognition

@cognition

Nov 4

Introducing Codemaps in @windsurf! powered by SWE-1.5 and Sonnet 4.5 “Your code is your understanding of the problem you’re exploring. So it’s only when you have your code in your head that you really understand the problem.” — @paulg

Ilya Dyachenko reposted

Katyayani Shukla

@aibytekat

Nov 3

Harvard professor literally dropped the best ML systems tutorial you’ll ever see

Ilya Dyachenko reposted

clem 🤗

@ClementDelangue

Nov 3

No excuse anymore not to train your own models! This 200+ pages with full transparency. Let's go open-source AI!

elie

@eliebakouch

Oct 30

Training LLMs end to end is hard. Very excited to share our new blog (book?) that cover the full pipeline: pre-training, post-training and infra. 200+ pages of what worked, what didn’t, and how to make it run reliably huggingface.co/spaces/Hugging…

eliebakouch's tweet image. Training LLMs end to end is hard. Very excited to share our new blog (book?) that cover the full pipeline: pre-training, post-training and infra. 200+ pages of what worked, what didn’t, and how to make it run reliably

huggingface.co/spaces/Hugging…

Ilya Dyachenko reposted

Santiago

@svpino

Nov 4

This could be the final nail in Jupyter's coffin. Deepnote is going open-source! Their kernel is way more powerful than Jupyter, but still backwards compatible. Notebooks are amazing: • They are perfect for data exploration • They are perfect for collaborating with AI…

svpino's tweet card. Deepnote is a drop-in replacement for Jupyter with an AI-first design, sleek UI, new blocks, and native data integrations. Use Python, R, and SQL locally in your favorite IDE, then scale to Deepnot...

GitHub - deepnote/deepnote: Deepnote is a drop-in replacement for Jupyter with an AI-first design,...

Source: github.com

Ilya Dyachenko reposted

Unsloth AI

@UnslothAI

Nov 4

You can now fine-tune DeepSeek-OCR with our free notebook! We fine-tuned DeepSeek-OCR, improving its language understanding by 89%, and reduced Character Error Rate from 149% to 60% Blog: docs.unsloth.ai/new/deepseek-o… GitHub: github.com/unslothai/unsl… Colab: colab.research.google.com/github/unsloth…

Ilya Dyachenko reposted

Victor M

@victormustar

Nov 4

New Llama.cpp UI is a blessing for the local AI world 🌎 - Blazing fast, beautiful, and private (ofc) - Use 150,000+ GGUF models in a super slick UI - Drop in PDFs, images, or text documents - Branch and edit conversations anytime - Parallel chats and image processing - Math and…

Georgi Gerganov

@ggerganov

Nov 4

A detailed look into the new WebUI of llama.cpp

Ilya Dyachenko reposted

Andrew Ng

@AndrewYNg

Nov 3

AI coding just arrived in Jupyter notebooks - and @brganger (Jupyter co-founder) and I will show you how to use it. Coding by hand is becoming obsolete. The latest Jupyter AI - built by the Jupyter team and showcased at JupyterCon this week - brings AI assistance directly into…

Ilya Dyachenko reposted

Trident

@TridentSolana

Oct 22

The first VS Code extension for Solana is here. Real-time security analysis + fuzz coverage visualization. Built by the auditors and educators behind School of Solana. Thread ↓

Ilya Dyachenko reposted

Andrej Karpathy

@karpathy

Oct 20

I quite like the new DeepSeek-OCR paper. It's a good OCR model (maybe a bit worse than dots), and yes data collection etc., but anyway it doesn't matter. The more interesting part for me (esp as a computer vision at heart who is temporarily masquerading as a natural language…

vLLM

@vllm_project

Oct 20

🚀 DeepSeek-OCR — the new frontier of OCR from @deepseek_ai , exploring optical context compression for LLMs, is running blazingly fast on vLLM ⚡ (~2500 tokens/s on A100-40G) — powered by vllm==0.8.5 for day-0 model support. 🧠 Compresses visual contexts up to 20× while keeping…

vllm_project's tweet image. 🚀 DeepSeek-OCR — the new frontier of OCR from @deepseek_ai , exploring optical context compression for LLMs, is running blazingly fast on vLLM ⚡ (~2500 tokens/s on A100-40G) — powered by vllm==0.8.5 for day-0 model support.

🧠 Compresses visual contexts up to 20× while keeping…

Ilya Dyachenko reposted

Python Programming

@PythonPr

Sep 27

ML Algorithms Cheatsheet

Ilya Dyachenko reposted

Michael Pyrcz🌻

@GeostatsGuy

Sep 26

When I teach Principal Component Analysis (PCA), I start with the core idea: a linear, orthogonal transformation that maximizes variance and removes correlation. Then we jump into my interactive, hands-on demo — using a #Python dashboard built with @matplotlib to perform PCA…

Ilya Dyachenko reposted

Andrew Ng

@AndrewYNg

Sep 25

Last week, China barred its major tech companies from buying Nvidia chips. This move received only modest attention in the media, but has implications beyond what’s widely appreciated. Specifically, it signals that China has progressed sufficiently in semiconductors to break away…

AndrewYNg's tweet card. The Batch AI News and Insights: Last week, China barred its major tech companies from buying Nvidia chips.

AI Agents Spend Money, Online Betting Automates, ChatGPT Users Shift, and more...

Source: deeplearning.ai

Ilya Dyachenko reposted

Amjad Masad

@amasad

Sep 11

The METR paper that says that “the length of tasks AI can do is doubling every 7 months” radically undersells the scaling that we’re seeing at Replit. It might be true if you’re measuring one long trajectory for a single model class. But this is where an agent research lab’s…

amasad's tweet image. The METR paper that says that “the length of tasks AI can do is doubling every 7 months” radically undersells the scaling that we’re seeing at Replit.

It might be true if you’re measuring one long trajectory for a single model class.

But this is where an agent research lab’s…

Amjad Masad

@amasad

Sep 10

Longer Autonomous Runs. Agent 3 is 10x more autonomous than V2, capable of handling much more complex builds by detecting and fixing errors on its own. You can track the progress of your build with Live Monitoring on your phone, freeing you up to focus on other creative work.

amasad's tweet image. Longer Autonomous Runs.

Agent 3 is 10x more autonomous than V2, capable of handling much more complex builds by detecting and fixing errors on its own.

You can track the progress of your build with Live Monitoring on your phone, freeing you up to focus on other creative work.

Ilya Dyachenko reposted

Unsloth AI

@UnslothAI

Jul 22

Congrats guys on another epic release! We're uploading Dynamic GGUFs, and one with 1M context length so you guys can run it locally! 🦥⭐️ huggingface.co/unsloth/Qwen3-…

unsloth/Qwen3-Coder-480B-A35B-Instruct-GGUF · Hugging Face

Source: huggingface.co

Ilya Dyachenko reposted

OpenRouter

@OpenRouterAI

Jul 22

Live now on OpenRouter! x.com/OpenRouterAI/s…

OpenRouter

@OpenRouterAI

Jul 22

🟣New: Qwen3-Coder by @Alibaba_Qwen - 480B params (35B active) - Native 256K context length, extrapolates to 1M - Outperforms Kimi, o3, DeepSeek, and more on SWE-Bench Verified (69.6%) 👀 Now live, starting at $1/M tokens 👇

OpenRouterAI's tweet image. 🟣New: Qwen3-Coder by @Alibaba_Qwen

- 480B params (35B active)
- Native 256K context length, extrapolates to 1M
- Outperforms Kimi, o3, DeepSeek, and more on SWE-Bench Verified (69.6%) 👀

Now live, starting at $1/M tokens 👇

Ilya Dyachenko reposted

Qwen

@Alibaba_Qwen

Jul 22

>>> Qwen3-Coder is here! ✅ We’re releasing Qwen3-Coder-480B-A35B-Instruct, our most powerful open agentic code model to date. This 480B-parameter Mixture-of-Experts model (35B active) natively supports 256K context and scales to 1M context with extrapolation. It achieves…

Alibaba_Qwen's tweet image. &gt;&gt;&gt; Qwen3-Coder is here! ✅

We’re releasing Qwen3-Coder-480B-A35B-Instruct, our most powerful open agentic code model to date. This 480B-parameter Mixture-of-Experts model (35B active) natively supports 256K context and scales to 1M context with extrapolation. It achieves…

Ilya Dyachenko reposted

Min Choi

@minchoi

Jun 26

Higgsfield SOUL realism just broke the Internet today. This is 100% AI 10 wild examples + how to try: 1. Bimbocore - Close-up selfie, bubble-gum backdrop