Vinoth Chandar
@byte_array
Founder @Onehousehq, Creator of @apachehudi, Built the World's first #DataLakehouse, Distributed/Data Systems, Linkedin, Uber, Confluent alum. (views are mine)
Talvez você curta
🔥 Meet Quanton — the new query execution engine from Onehouse. 👍 Same Spark & SQL. 📉 At least half the cost. 📈 1.6x-3.6x better ETL price-performance 📊 2.2x-6.5x better Ingest price-performance 👉 Read the full blog here: onehouse.ai/blog/announcin… ⬇️ Download our free…
🚀 Launching @apachehudi notebooks — a local, self‑contained environment to learn Hudi end‑to‑end! Includes: • Spark, Hive, MinIO + Jupyter • 5 notebooks: CRUD on COW/MOR; Snapshot/RO/Incremental; Time Travel & CDC; SCD 2/4; Schema Evolution; SQL Procedures Try Hudi quickly…
A common pattern I am seeing that is draining productivity. AI: “Here’s how to do it.” OS or some infra software: “Error: nice try.” Engineer: stuck in a loop going back and forth 😅 AI is a force multiplier, but only if you also use it to learn what to do and how things work
No longer just “faster than”; Now, @apachehudi is also “faster on” #apacheiceberg . Thanks to @apachextable
[Blog] Struggling with Apache Iceberg performance when your data dimensions get too hot? 🔥🌡️ Frequent updates and deletes in Iceberg can lead to a "chilly meltdown," forcing a tough choice between fast writes and efficient reads. 🥶 But what if you didn't have to compromise? 🤔…
🚀 Big news for Hudi Community! We're back with the Apache Hudi Meetup | ASIA (Chinese), and this time we're hosted by the incredible team at @JD_Corporate (京东) ! Get ready to explore the "Next-Generation Lakehouse: The Intelligent Future Engine". We have a packed agenda…
💰🔥 Spark’s default autoscaler = higher latency scaling up, wasted $$ scaling down. Why? It’s based on task backlog, not actual resource usage. Costly flaw. Result: 🐢 Slow scale-ups (e.g. too few Kafka partitions during spikes) 🐢 Slow scale-downs (e.g. many tiny tasks →…
📊 If you’re using Apache Spark on EMR (or anywhere really), you need better visibility into where your compute spend is going. At Onehouse, we kept seeing the same pattern across Hudi and non-Hudi users alike: 👉 Jobs were under-optimized 👉 Executors were sitting idle 👉…
youtube.com
YouTube
Spark Analyzer Demo: Measure and evaluate your Apache Spark™ Applic...
🗞️ OLD NEWS: but worth a shout-out. Keys are optional in Hudi ... One of Hudi’s core goals was to remove friction from building data lakes. That’s why — for a long time now — Hudi has quietly supported auto-generation of record keys. No need to think up a key field just to get…
United States Tendências
- 1. New York 1.3M posts
- 2. #hazbinhotelseason2 29.9K posts
- 3. #RadioStatic 6,568 posts
- 4. Virginia 554K posts
- 5. Mamdani 1.69M posts
- 6. Alastor 20.1K posts
- 7. #Talus_Labs 1,009 posts
- 8. Van Jones 2,902 posts
- 9. Prop 50 195K posts
- 10. #QuestPit 8,062 posts
- 11. #XLOV 14.6K posts
- 12. AND SO IT BEGINS 11.7K posts
- 13. But the Lord 8,731 posts
- 14. Cyrene 119K posts
- 15. TURN THE VOLUME UP 26.9K posts
- 16. Sanford N/A
- 17. Ty Lue 1,003 posts
- 18. RIP NYC 24.4K posts
- 19. Jay Jones 109K posts
- 20. Clippers 9,656 posts
Talvez você curta
-
Apache Hudi
@apachehudi -
Onehouse
@Onehousehq -
Reynold Xin
@rxin -
Open Lakehouse Community
@open_lakehouse -
StarTree
@startreedata -
Chris
@criccomini -
Shirshanka Das
@shirshanka -
Delta Lake
@DeltaLakeOSS -
Robert Metzger
@rmetzger_ -
Stephan Ewen
@StephanEwen -
Gwen (Chen) Shapira
@gwenshap -
Matthias J. Sax 🦦
@MatthiasJSax -
ABC
@Ubunta -
Piotr Nowojski
@PiotrNowojski -
Timeplus
@timeplusdata
Something went wrong.
Something went wrong.