#apachedatafusion search results

Paul Dix

@pauldix

Sep 10

A piece I wrote about rebuilding InfluxDB in #rustlang and #ApacheArrow and #ApacheDataFusion is up on InfoQ!

InfoQ

@InfoQ

Sep 10

Engineering a Time Series Database Using Open Source: Rebuilding InfluxDB 3 in Apache Arrow and Rust bit.ly/46fmuUK authored by @pauldix, reviewed by @olimpiupop

InfoQ's tweet card. At times, to evolve your product, you need to rebuild it from scratch. The article provides the story behind the rewrite of InfluxDB from scratch using a different programming language - Rust - and...

Engineering a Time Series Database Using Open Source: Rebuilding InfluxDB 3 in Apache Arrow and Rust

Source: infoq.com

I am happy to announce 🤡 the winner of of TPCH-SF30 on the free tier Colab notebook is #Tableau Hyper Engine, #DuckDB and #Apachedatafusion could not finish it as they get OOM. this is what I call serieus engineering !!! colab.research.google.com/drive/1sqp_V34…

mim_djo's tweet image. I am happy to announce 🤡 the winner of of TPCH-SF30 on the free tier Colab notebook is #Tableau Hyper Engine, #DuckDB and #Apachedatafusion could not finish it as they get OOM.
this is what I call serieus engineering !!!
colab.research.google.com/drive/1sqp_V34…

Mim

@mim_djo

Jul 31, 2023

#apachedatafusion is progressing well, querying 42 million rows using the free tier of Colab is something, other SQL engines will crash, I think it will eventually catch up to #DuckDB colab.research.google.com/drive/1KzofqAW…

mim_djo's tweet image. #apachedatafusion is progressing well, querying 42 million rows using the free tier of Colab is something, other SQL engines will crash, I think it will eventually catch up to #DuckDB
colab.research.google.com/drive/1KzofqAW…

Mim

@mim_djo

Mar 19, 2023

start playing with #Apachedatafusion, the good thing it is mature enough that I could run the same test unmodified, that's a testament to SQL I guess, still memory issues, when I increase the data size colab.research.google.com/drive/1WJ2ICxJ…

mim_djo's tweet image. start playing with #Apachedatafusion, the good thing it is mature enough that I could run the same test unmodified, that's a testament to SQL I guess, still memory issues, when I increase the data size

colab.research.google.com/drive/1WJ2ICxJ…

Mim

@mim_djo

Dec 23

#delta_rs which is basically the standard way for writing delta table using Python ( no spark) just exposed a SQL interface , it was a simple change as it is already based on #apachedatafusion tested using #Microsoftfabric notebook : github.com/djouallah/Fabr…

mim_djo's tweet image. #delta_rs which is basically the standard way for writing delta table using Python ( no spark)
just exposed a SQL interface , it was a simple change as it is already based on #apachedatafusion
tested using #Microsoftfabric notebook :
github.com/djouallah/Fabr…

Pierre Zemb

@PierreZ

Jun 4

Just merged our first SQL query using #ApacheDataFusion at work! 🎉 Integrating it was a fantastic experience, It will soon be part of the core of #Materia. Wrote about why DataFusion is a game-changer for #Rustlang #database systems: pierrezemb.fr/posts/thank-yo…

Jagdish Parihar

@jatin6972

May 24

Exploring #ApacheDataFusion’s Catalog: maps tables across S3/Postgres/Iceberg & turns unresolved plans into typed logical plans. Still wrapping my head around it, but once the catalog clicks the optimizer is wide open to tweak. 🦀🚀 #RustLang #DataEngineering

Andrey Velichkevich

@andreyvelichk

Jun 5

Check out this Kubeflow in-memory data cache solution built on #ApacheArrow & #ApacheDataFusion! It optimizes sharding of #ApacheIceberg tables and enhances #Kuberentes for #GenAI workloads. github.com/kubeflow/commu… 📽️ #KubeCon + #CloudNativeCon talk: youtu.be/s4KAe7AtN7s

andreyvelichk's tweet card. Speed up Your ML Workloads With Kubernetes Powered In-memory Data......

youtube.com

YouTube

Speed up Your ML Workloads With Kubernetes Powered In-memory Data......

Source: youtube.com

Mim

@mim_djo

Apr 8, 2023

let's try simple sorting of a parquet file but with bigger VM, 16 CPU/60 GB RAM #DuckDB 29.6s #ApacheDatafusion 1min 4s #Apachespark : need to configure java, not interested #Polars 1min 6s #Clickhouse 1min 24s #pyarrow 2m 3s github.com/djouallah/parq…

Kris Jenkins (@[email protected])

@krisajenkins

Apr 25

In this week's Developer Voices, Andrew Lamb takes us through #ApacheDataFusion, exploring how this #Rust toolkit shaves years off the prospect of creating a custom database. Fascinating stuff for any data and architecture fans like me. 😁 youtu.be/8QNNCr8WfDM

krisajenkins's tweet card. DataFusion - The Database Building Toolkit (with Andrew Lamb)

youtube.com

YouTube

DataFusion - The Database Building Toolkit (with Andrew Lamb)

Source: youtube.com

Mim

@mim_djo

Jul 15

you can pass #apachedatafusion dataframe directly to delta table python and it works great but ... behind the scene it calls collect() which load the whole damn data into memory, if you have a lot of data to process use daft or duckdb

InfluxData

@InfluxDB

Dec 17

Dive into the world of #TimeSeries data with InfluxData's @AndrewLamb1111 on The Data Stack Show. Discover insights on high cardinality, query performance, #ApacheDataFusion, and the future of data systems. 🎙️ bit.ly/3zhJCoL #InfluxDB

InfluxDB's tweet image. Dive into the world of #TimeSeries data with InfluxData's @AndrewLamb1111 on The Data Stack Show.

Discover insights on high cardinality, query performance, #ApacheDataFusion, and the future of data systems.

🎙️ bit.ly/3zhJCoL #InfluxDB

Henry Medina

@CraftyTech

Jul 22

Startups are doubling down on #ApacheDataFusion for its powerful, scalable data processing capabilities! 🚀 With a flexible architecture and open-source community support, it's transforming data analytics. Discover why it's the future! #BigData #TechTrends #Startups

Mim

@mim_djo

Jan 15

I guess that's a big win for #apachedatafusion as it is the engine used by SDF

Felipe O. Carvalho

@_Felipe

Jan 14

The company I joined after VoDa, @SDFLabs, was acquired by @dbt_labs which has been a very exciting turn of events. So I’m now joining dbt Labs together with the SDF team. 🎉

InfluxData

@InfluxDB

Feb 8

🔥 The #ApacheDataFusion community is growing! 🔥 Chicago’s Apache DataFusion Meetup brought together passionate engineers, developers, and data experts to dive into the latest advancements and real-world applications. 🚀 Check out the key discussions and see how DataFusion…

influxdata.com

Apache DataFusion Meetup: Chicago December 2024 Recap

A recap of the Apache DataFusion Chicago meetup in December, 2024.

Source: influxdata.com

lakeFS

@lakeFS

Jun 8, 2023

#ApacheDataFusion @GoogleBigQuery @AzureSynapse #AmazonRedshift @SnowflakeDB @ApachePinot @databricks #ClouderaImpala @StarRocksLabs @ApacheKylin @FireboltHQ

Carles Dijous

@carlesdijous

Jul 23

Why Startups Are Betting Everything on #ApacheDataFusion. Discover why startups like Flarion, LakeSail, and major companies are betting on Apache DataFusion — the Rust-based query engine that's reshaping #data #analytics. bit.ly/4m5nQro

carlesdijous's tweet card. Discover why startups like Flarion, LakeSail, and major companies are betting on Apache DataFusion — the Rust-based query engine that's reshaping data analytics.

Why Startups Are Betting Everything on Apache DataFusion

Source: thenewstack.io

InfluxData

@InfluxDB

Aug 30, 2024

🔎 Learn about our experience implementing StringView in the Rust implementation of Apache Arrow, and integrating it into Apache DataFusion. #ApacheDataFusion #Rustlang #InfluxDB

Andrew Lamb

@andrewlamb1111

Aug 26, 2024

Xiangpeng Hao strikes again: Turns out you can both 1. Implement 🇩🇪 strings in Rust 2. They actually improve end to end performance in @ApacheDataFusio. It takes a lot more than a straightforward naive implementation to do so: influxdata.com/blog/faster-qu…

andrewlamb1111's tweet card. This blog describes our experience implementing StringView in the Rust implementation of Apache Arrow, and integrating it into Apache DataFusion, significantly accelerating string-intensive queries...

Using StringView / German Style Strings to Make Queries Faster: Part 1 - Reading Parquet

Source: influxdata.com

InfluxData

@InfluxDB

Jan 8

🗓️ January 15 🕗 5–7:30 PM ET 📍 Boston, MA Join InfluxData's @AndrewLamb1111 and fellow @ApacheDataFusion enthusiasts for an in-person meetup! Save your spot here: bit.ly/4fCFKPw #InfluxDB #ApacheDataFusion

luma.com

Boston DataFusion Meetup · Luma

Apache DataFusion Meetup: Boston https://datafusion.apache.org/ Come meet fellow DataFusion enthusiasts, project contributors, and database aficionados at an…

Source: luma.com

Paul Dix

@pauldix

Sep 10

A piece I wrote about rebuilding InfluxDB in #rustlang and #ApacheArrow and #ApacheDataFusion is up on InfoQ!

InfoQ

@InfoQ

Sep 10

Engineering a Time Series Database Using Open Source: Rebuilding InfluxDB 3 in Apache Arrow and Rust bit.ly/46fmuUK authored by @pauldix, reviewed by @olimpiupop

Engineering a Time Series Database Using Open Source: Rebuilding InfluxDB 3 in Apache Arrow and Rust

Source: infoq.com

Carles Dijous

@carlesdijous

Jul 23

Why Startups Are Betting Everything on Apache DataFusion

Source: thenewstack.io

Henry Medina

@CraftyTech

Jul 22

Mim

@mim_djo

Jul 15

Andrey Velichkevich

@andreyvelichk

Jun 5

youtube.com

YouTube

Speed up Your ML Workloads With Kubernetes Powered In-memory Data......

Source: youtube.com

Pierre Zemb

@PierreZ

Jun 4

Jagdish Parihar

@jatin6972

May 24

Kris Jenkins (@[email protected])

@krisajenkins

Apr 25

youtube.com

YouTube

DataFusion - The Database Building Toolkit (with Andrew Lamb)

Source: youtube.com

InfluxData

@InfluxDB

Feb 8

influxdata.com

Apache DataFusion Meetup: Chicago December 2024 Recap

A recap of the Apache DataFusion Chicago meetup in December, 2024.

Source: influxdata.com

Mim

@mim_djo

Jan 15

I guess that's a big win for #apachedatafusion as it is the engine used by SDF

Felipe O. Carvalho

@_Felipe

Jan 14

The company I joined after VoDa, @SDFLabs, was acquired by @dbt_labs which has been a very exciting turn of events. So I’m now joining dbt Labs together with the SDF team. 🎉

InfluxData

@InfluxDB

Jan 8

Boston DataFusion Meetup · Luma

Source: luma.com

InfluxData

@InfluxDB

Dec 30

Catch up on the #Database Building Blocks session with #InfluxDB Staff Engineer @andrewlamb1111! 🔎 Dive into @ApacheDataFusio's inner workings and discover the data-centric systems it powers: bit.ly/4ej6qnZ #ApacheDataFusion

InfluxDB's tweet card. Apache Arrow DataFusion: A Fast, Embeddable, Modular Analytic Query...

youtube.com

YouTube

Apache Arrow DataFusion: A Fast, Embeddable, Modular Analytic Query...

Source: youtube.com

InfluxData

@InfluxDB

Dec 29

.@ApacheDataFusio 43.0.0 is the fastest engine for querying Apache Parquet files in ClickBench! 👏 For the first time, a Rust-based engine holds the top spot: bit.ly/4fZ5RQV #InfluxDB #ApacheDataFusion #Rustlang

influxdata.com

Apache DataFusion is Now the Fastest Single Node Engine for Querying Apache Parquet Files

Best in class performance on Parquet is now available to anyone. DataFusion’s open design lets you start quickly with a full-featured Query Engine, including SQL, data formats, catalogs, and more,...

Source: influxdata.com

InfluxData

@InfluxDB

Dec 27

🎉 Big news! @ApacheDataFusio—the engine behind the rebuilt #InfluxDB—has been named one of @CRN's 10 Coolest #OpenSource Software Tools of 2024! 🚀 Optimized for speed, scalability, & versatility: bit.ly/3VM9kdH via @CRN #ApacheDataFusion

crn.com

The 10 Coolest Open-Source Software Tools Of 2024

The 10 coolest open-source software tools in 2024 include tools for developing AI applications, implementing zero trust networks and managing large volumes of data.

Source: crn.com

Mim

@mim_djo

Dec 23

InfluxData

@InfluxDB

Dec 20

Watch InfluxData Staff Software Engineer @AndrewLamb1111 explain @ApacheDataFusio in detail, describing the types of data-centric systems it's used to build. 🔗bit.ly/4ej6qnZ #InfluxDB #ApacheDataFusion

youtube.com

YouTube

Apache Arrow DataFusion: A Fast, Embeddable, Modular Analytic Query...

Source: youtube.com

InfluxData

@InfluxDB

Dec 17

InfluxData

@InfluxDB

Dec 13

Join us for an @ApacheDataFusio Meetup on January 15th in Boston! Meet Influxer @AndrewLamb1111 and other fellow DataFusion enthusiasts, project contributors, and database aficionados: bit.ly/4fCFKPw #InfluxDB #ApacheDataFusion

Boston DataFusion Meetup · Luma

Source: luma.com

InfluxData

@InfluxDB

Dec 5

📅 Hey, Boston, save the date! On January 15th, #InfluxDB Staff Software Engineer @AndrewLamb1111 is co-hosting an @ApacheDataFusio Meetup! Come meet fellow #ApacheDataFusion enthusiasts, project contributors, and database aficionados at this event: bit.ly/4fCFKPw

Boston DataFusion Meetup · Luma

Source: luma.com

InfluxData

@InfluxDB

Nov 1, 2024

Must-see: @AndrewLamb1111's Database Building Blocks Seminar talk explains @ApacheDataFusio in detail and describes the types of data-centric systems it's used to build. Watch here bit.ly/4ej6qnZ #InfluxDB #ApacheDataFusion

youtube.com

YouTube

Apache Arrow DataFusion: A Fast, Embeddable, Modular Analytic Query...

Source: youtube.com