Dmitry Petrov
@FullStackML
🛠️ Build data tools for AI / ML. Ex-Data Scientist @Microsoft. PhD in CS. Telling jokes with a poker face.
Vous pourriez aimer
DBT + Fivetran 🚀 A huge milestone for the "modern data stack". Consolidation is on - who's next? Snowflake ❄️? Databricks 🔥? But maybe that doesn’t even matter. The next wave is here: Multimodal data stack It's not replacing the old one - it's for different users: 🤖 AI, not…
AI isn't just about text and code. What about sounds, videos, and sensors? 🎧🎬🔬 I’ll be at @MLOpsWorld Summit (Oct 6-9 in Austin, TX) sharing how to query inside the file ⚡️ Come nerd out with me in Texas 👋🤠 #MLOpsWorld2025
"90% of code will be AI-written" 🤖 Sounds insane - until you see the pattern. When the building blocks exist, coding is just connecting the dots 🔗 And nobody connects dots better than AI. That’s why AI crushes boilerplate web apps 🛠️ - the blocks are there. And why it…
Spent the weekend reading this. Easily the best agentic book so far. My agent recommends I read more. Any suggestions?
This Google engineer just released a 424-page free book on Agentic Design Patterns. Covers advanced prompt engineering, multi-agent frameworks, RAG, agent tool use and MCP. 100% free with practical code examples.
To stay ahead of the curve in vibe-coding you must rotate IDEs every 2 months Cursor → Claude → Cursor → (???) → repeat. Productivity is temporary, but vibes are forever 😎✨
"Heavy Data": messy, multimodal, and lives in object storage, not databases. This term I first heard from @RobFergusonIII - it nails it! Time to rethink how we manage and query heavy data. datachain.ai/blog/from-big-…
2.5 years into the AI craze, and I continue to firmly believe that if your company wasn’t already interesting/succeeding without AI, then doing “whatever plus AI” isn’t going to save you. For the few that seem this way (eg Cursor), I think their moat is a lot weaker than it…
DataChain enables reproducibility. It versions and tracks dependencies, code. A quick demo from @FullStackML :
Best Python libraries of 2024 – 10th edition! 🔥 Excited to see DataChain ranked at the top in AI/Data reddit.com/r/Python/comme…
Making sense of millions of audio files! An incredible use case for extracting actionable insights from complex data.
A small DataChain video on processing audio data from @huggingface with 🤗 models. We need more tools to do ETLs, analytics, governance, preparation for unstructured data at scale! - stream files from tar or wds archives! 🤯 - enrich, prepare, version, publish datasets 🚀 -…
🚀 datachain
1/N DataChain hit 2000 stars ⭐ on GitHub a week ago. Thanks for your interest and support 🤗 It was built to address those needs and pain points we saw in the DVC community when people have to deal with millions of files (e.g. images, pdfs, audio, etc).
After trending in Hacker News, our open-source is now trending in GitHub. What’s next - Netflix special? github.com/iterative/data…
Now you can publish datasets from DataChain to @huggingface with a single command! ...because who has time for two? 🚀📚
Datasets + LLMs + Pydantic = DataChain ...now with @huggingface !💛 DataChain by @DVCorg just added @huggingface support ! Create, Load, Transform HF Datasets with LLMs easily. - Pydantic for dataset schema - Use your own or public HF Datasets - Run your own or public HF Models
The Post-Modern Data Stack: Unleash the Power of Foundational Models - dataversity.net/the-post-moder… @Dataversity @iterative @DVCorg @FullStackML @JeckertNY
United States Tendances
- 1. Good Wednesday 25.8K posts
- 2. #wednesdaymotivation 2,131 posts
- 3. Oslo 253K posts
- 4. Hump Day 8,680 posts
- 5. #MerryChristmasJustin 7,524 posts
- 6. María Corina Machado 211K posts
- 7. Immanuel 2,974 posts
- 8. Happy Hump 5,867 posts
- 9. #ดีว่าราวีวันนี้ในโรงภาพยนตร์ 687K posts
- 10. #FairiesLingOrmAtTheMall 352K posts
- 11. LINGORM CHRISTMAS FAIRIES 307K posts
- 12. #Wednesdayvibe 1,876 posts
- 13. Waffle House 2,976 posts
- 14. percy 28.9K posts
- 15. Therefore the Lord 1,884 posts
- 16. Premio Nobel de la Paz 135K posts
- 17. Gowdy 6,519 posts
- 18. clarisse 1,740 posts
- 19. annabeth 9,040 posts
- 20. Eileen Higgins 37.7K posts
Vous pourriez aimer
-
Chip Huyen
@chipro -
🦉DVC
@DVCorg -
Lysandre
@LysandreJik -
Tim Rocktäschel
@_rockt -
Andreas Mueller
@amuellerml -
ML Collective
@ml_collective -
Hugo Larochelle
@hugo_larochelle -
Jack Clark
@jackclarkSF -
Anthony Goldbloom
@antgoldbloom -
MLOps Community
@mlopscommunity -
Hamel Husain
@HamelHusain -
Volodymyr Kuleshov 🇺🇦
@volokuleshov -
Thomas Wiecki
@twiecki -
Graham Neubig
@gneubig -
Emmanuel Ameisen
@mlpowered
Something went wrong.
Something went wrong.