Small Data SF: Think Small, Build Big
@smalldatasf
Rethinking AI & data from the ground up. A community for those building smarter AI and data systems with real workloads, not petabytes.
The Steve Ballmer of databases @glcst, seen @smalldatasf
The yearly @smalldatasf gathering is a breath of fresh air. One interesting take away: Small data is now pretty big. A single node machine can now outperform the benchmarks that led to creation of Dremel/Big Query. Slide credit @jrdntgn
Airbyte 2.0! We're excited to see the latest generation of Airbyte and to chat with the team behind it at Small Data SF. Join us!
We're sponsoring @smalldatasf and the philosophy really resonates: not everything deserves a round-trip to the cloud. With Airbyte 2.0, you can run data planes wherever your data lives - cloud, on-prem, or hybrid. Plan to stop by our booth if you're there so you can talk to…
All set for @smalldatasf 2025 ✈️ Flights, Airbnb, and access confirmed. Can’t wait to learn what’s next in the data world, and how simplicity keeps winning. If you’re going, let’s connect. Say hi if you see me, we can talk #Data, #Arc, @duckdb , @motherduck, and more. I’ll be…
Scott Haines has the scars to prove it: four big projects, four painful lessons, and the surprising power of thinking small. His story isn’t just cautionary—it’s a playbook for anyone who wants resilient systems and saner engineering. Hear it first-hand at Small Data SF.
🚀 Don’t miss Zulf's session at @smalldatasf! Hands-on lab: ⚡ Capture data in real time 🦆 Stream into MotherDuck 🔄 End-to-end in minutes This is the perfect opportunity for data engineers to experience real-time data streaming without the complexity. #smalldatasf
Sometimes you need to crash, burn, and refactor to really see the value in “small first.” Scott Haines will unpack four hard lessons where embracing smallness didn’t just rescue projects, but led to better product, better performance, better dev happiness. If you want avoidable…
What if instead of chasing larger models, we chased smarter ones—models that do more with less, generalize better, and are easier to deploy? Shelby Heinecke will share what "smaller models" truly mean and how they unlock impact in real settings. If you're building AI and worried…
We ran a bench mark of DuckDB vs Spark and found that data sets under 20 GBs ran about 100 times faster on DuckDB than they did on Apache Spark! You don't need a multi-node cluster for your smaller data sets! This benchmark uses plain parquet files and COUNT distinct to truly…
Are you ready to explore when Apache Spark might not be the best tool for your data projects? Join us at Small Data SF on November 4 and 5 in San Francisco for an insightful talk by Holden Karau, a prominent figure in the world of big data.
Small data means breaking existing paradigms, simplifying, speeding things up and lowering costs This insight from #smalldatasf 2024 is just a taste of what's coming in 2025!
🚀 Small News! Estuary is a Gold Sponsor of @smalldatasf 2025, happening Nov 4-5!! Two days of hands-on workshops, talks, and community, all centered around efficient, local-first development and smarter ways to work with data and AI. See you there! #smalldatasf
"Don't duck up the numbers: Where AI hype meets BI reality." This is going to be a fun panel with Barr Moses (Monte Carlo), Barry McCardel (Hex), Colin Zima (Omni) and Tristan Handy (dbt). Join us!
Small data slaps. Tagging all our Spark friends out there.
Tag someone who needs to hear this: Small Data slaps. 💥 Oh, and join us Nov 4-5 at Small Data SF to learn why small data is smart!
Dr Shelby Heinecke, who leads AI research at Salesforce, will speak at Small Data SF November 5th on how small models don't need more parameters, they just need better data. She'll speak about the highly efficient xLAM family of small action models her team built at Salesforce.
Miss Small Data SF in 2024? Catch out our highlight reel below and learn why one attendee said: "Small Data SF was on another level. The lineup was unbeatable, the content was razor-sharp, and the people were next-level inspiring."
United States Tendencias
- 1. $NVDA 75.6K posts
- 2. Jensen 24.1K posts
- 3. Peggy 38.1K posts
- 4. GeForce Season 5,729 posts
- 5. NASA 54K posts
- 6. #ใครในกระจกEP5 9,426 posts
- 7. Sumrall 2,383 posts
- 8. Martha 19.7K posts
- 9. #WickedWaysToMakeABuck N/A
- 10. Stargate 7,019 posts
- 11. Saba 10.9K posts
- 12. #WWESuperCardNewSeason 1,128 posts
- 13. Arabic Numerals 3,952 posts
- 14. Kwame 6,513 posts
- 15. Comey 57.5K posts
- 16. Poverty 53.8K posts
- 17. #2Kgiveaway 1,130 posts
- 18. Sam Harris N/A
- 19. Jason Crow 3,378 posts
- 20. EPS of $1.30 N/A
Something went wrong.
Something went wrong.