byte_array's profile picture. Founder @Onehousehq, Creator of @apachehudi, Built the World's first #DataLakehouse, Distributed/Data Systems, Linkedin, Uber, Confluent alum. (views are mine)

Vinoth Chandar

@byte_array

Founder @Onehousehq, Creator of @apachehudi, Built the World's first #DataLakehouse, Distributed/Data Systems, Linkedin, Uber, Confluent alum. (views are mine)

置頂

🔥 Meet Quanton — the new query execution engine from Onehouse. 👍 Same Spark & SQL. 📉 At least half the cost. 📈 1.6x-3.6x better ETL price-performance 📊 2.2x-6.5x better Ingest price-performance 👉  Read the full blog here: onehouse.ai/blog/announcin… ⬇️  Download our free…


What we really need from @awscloud #s3 . Do your fancy AI stuff, but just throw us a bone with a basic http/2 networking stack... I have raised this for 3 years now. No dice. #customerobsessed

Cloud object storage performance is deeply influenced by HTTP behavior. In our latest analysis, we show how S3’s dependence on HTTP/1.1 leads to head-of-line blocking, inflated tail latencies, and higher compute costs—while GCS benefits from HTTP/2 multiplexing. 🧵👇



Most people still think “interactive PySpark” means locking yourself into a premium Spark vendor, their catalog, and their pricing model. ⛓️‍💥 Today, that assumption breaks. 👉 onehouse.ai/blog/introduci… We’re bringing Python Notebooks to Onehouse — but the real story isn’t…

byte_array's tweet image. Most people still think “interactive PySpark” means locking yourself into a premium Spark vendor, their catalog, and their pricing model.

⛓️‍💥 Today, that assumption breaks.

👉  onehouse.ai/blog/introduci…

We’re bringing Python Notebooks to Onehouse — but the real story isn’t…

🚀 Apache Hudi 1.1 is out! 🎉 A solid release packed with real engineering improvements that many data teams will feel day-to-day. Thanks to over 800+ commits from 53+ @apachehudi contributors! Blog: hudi.apache.org/blog/2025/11/2… 🔧 Pluggable Table Formats Hudi’s storage layer…

byte_array's tweet image. 🚀 Apache Hudi 1.1 is out! 🎉

A solid release packed with real engineering improvements that many data teams will feel day-to-day. 

Thanks to over 800+ commits from 53+  @apachehudi  contributors!

Blog: hudi.apache.org/blog/2025/11/2… 

🔧 Pluggable Table Formats
Hudi’s storage layer…

🚀 Excited to be back at the Open Source Data Summit 2025 — the third year in a row! If you care about open source, data efficiency, and building systems that last — this is the place to be. 🗓️ Nov 13, 2025 | Free virtual event | 👉 opensourcedatasummit.com Over the past three…

byte_array's tweet image. 🚀 Excited to be back at the Open Source Data Summit 2025 — the third year in a row!

If you care about open source, data efficiency, and building systems that last — this is the place to be.

🗓️ Nov 13, 2025 | Free virtual event | 
👉 opensourcedatasummit.com

Over the past three…

Vinoth Chandar 已轉發

🚀 Launching @apachehudi notebooks — a local, self‑contained environment to learn Hudi end‑to‑end! Includes: • Spark, Hive, MinIO + Jupyter • 5 notebooks: CRUD on COW/MOR; Snapshot/RO/Incremental; Time Travel & CDC; SCD 2/4; Schema Evolution; SQL Procedures Try Hudi quickly…


A common pattern I am seeing that is draining productivity. AI: “Here’s how to do it.” OS or some infra software: “Error: nice try.” Engineer: stuck in a loop going back and forth 😅 AI is a force multiplier, but only if you also use it to learn what to do and how things work

byte_array's tweet image. A common pattern I am seeing that is draining productivity. 

AI: “Here’s how to do it.”
OS or some infra software: “Error: nice try.”
Engineer: stuck in a loop going back and forth 😅

AI is a force multiplier, but only if you also use it to learn what to do and how things work

Loading...

Something went wrong.


Something went wrong.