DataStackIns's profile picture. Data Mesh’s, Lakehouses and Stack oh my! Retweeting and sharing content about the data stack.

Data Stack Insights

@DataStackIns

Data Mesh’s, Lakehouses and Stack oh my! Retweeting and sharing content about the data stack.

Apache Parquet Blog Series 2/10 Read Here: open.substack.com/pub/amdatalake…


Know Someone Learning Data Engineering, share this with them: Hands-on with Apache Iceberg on Your Laptop: Deep Dive with Apache Spark, Nessie, Minio, Dremio… medium.com/data-engineeri… #DataEngineering

DataStackIns's tweet image. Know Someone Learning Data Engineering, share this with them:

Hands-on with Apache Iceberg on Your Laptop: Deep Dive with Apache Spark, Nessie, Minio, Dremio… medium.com/data-engineeri…

#DataEngineering

ICEBERG METADATA TABLES This article will walk you through a hands-on exercise to get familiar with the Iceberg metadata tables. Read here: drmevn.fyi/Icebergmetadat… #DataEngineering #ApacheIceberg #DataLakehouse

DataStackIns's tweet image. ICEBERG METADATA TABLES

This article will walk you through a hands-on exercise to get familiar with the Iceberg metadata tables.

Read here: drmevn.fyi/Icebergmetadat…

#DataEngineering #ApacheIceberg #DataLakehouse

Data Stack Insights gönderiyi yeniden yayınladı

The Olympics may be over, but the race to turn real-time #data into actionable insights for #retail never ends 👀 Luckily, @systemcraftsman has a tutorial showing you how to swiftly analyze #realtime data using @dremio and Redpanda Connect. Two cool technologies with the cutest…

redpandadata's tweet image. The Olympics may be over, but the race to turn real-time #data into actionable insights for #retail never ends 👀 

Luckily, @systemcraftsman has a tutorial showing you how to swiftly analyze #realtime data using @dremio and Redpanda Connect. Two cool technologies with the cutest…

Data Stack Insights gönderiyi yeniden yayınladı
AMdatalakehouse's tweet image. JOIN US IN DENVER NEXT WEEK

Link to RSVP: bit.ly/lakehouse-link…

#MachineLearning #DataEngineering #ApacheIceberg #Denver #Colorado

Data Stack Insights gönderiyi yeniden yayınladı

RECENT DATA ARCHITECTURE/ENGINEERING/ANALYTICS CONTENT — Apache Iceberg — > What is Data Lakehouse Table Format? dremio.com/blog/apache-ic… > Comparing Iceberg to Other Lakehouse Solutions dremio.com/blog/comparing… > Iceberg Migration Guide dremio.com/blog/migration… > Hands-on with…

AMdatalakehouse's tweet image. RECENT DATA ARCHITECTURE/ENGINEERING/ANALYTICS CONTENT

— Apache Iceberg —

> What is Data Lakehouse Table Format?
dremio.com/blog/apache-ic…

> Comparing Iceberg to Other Lakehouse Solutions
dremio.com/blog/comparing…

> Iceberg Migration Guide
dremio.com/blog/migration…

> Hands-on with…

Data Stack Insights gönderiyi yeniden yayınladı

HOW ICEBERG CATALOGS WORK Iceberg tables are one part data stored in several parquet files and a second part metadata files that provide context and understanding of that data as a singular table. The metadata entry point is a file called metadata.json which tracks the tables…

AMdatalakehouse's tweet image. HOW ICEBERG CATALOGS WORK

Iceberg tables are one part data stored in several parquet files and a second part metadata files that provide context and understanding of that data as a singular table.

The metadata entry point is a file called metadata.json which tracks the tables…

Data Stack Insights gönderiyi yeniden yayınladı

OPTIMIZING ICEBERG TABLES One the things that make Iceberg queries fast is that the metadata can be used eliminate files that don’t need scanning from the scan plan. This is great but if the data is not clustered properly or spread out across many small files, you can still see…

AMdatalakehouse's tweet image. OPTIMIZING ICEBERG TABLES

One the things that make Iceberg queries fast is that the metadata can be used eliminate files that don’t need scanning from the scan plan. This is great but if the data is not clustered properly or spread out across many small files, you can still see…

Data Stack Insights gönderiyi yeniden yayınladı

Join us on September 5th at 10am PT for a MinIO x @dremio x @Carahsoft webinar about how modern #datalakes can help government customers solve their modernization initiatives. Register here: hubs.li/Q02Lc2rV0

Minio's tweet image. Join us on September 5th at 10am PT for a MinIO x @dremio x @Carahsoft webinar about how modern #datalakes can help government customers solve their modernization initiatives. Register here: hubs.li/Q02Lc2rV0

Basics of Lakehouse Engineering - Apache Iceberg, Nessie (2 hour Course) youtube.com/playlist?list=… #DataEngineering #Nessie #Dremio #ApacheIceberg


Data Stack Insights gönderiyi yeniden yayınladı

Do you use an open table format? If so how’s your experience been, vote, reply and share! #ApacheIcberg #DeltaLake #ApacheHudi #DataEngineering #DataLakehouse


Data Stack Insights gönderiyi yeniden yayınladı

Open Tables (Apache Iceberg) + Open (Nessie, Polaris, Gravitino) Catalogs = No Vendor Lock-in Lakehouses Read More: blog.iceberglakehouse.com/open-source-ta… #DataEngineering #DataLakehouse #DataLake @dremio @SnowflakeDB @ApacheIceberg

AMdatalakehouse's tweet image. Open Tables (Apache Iceberg) + Open (Nessie, Polaris, Gravitino) Catalogs = No Vendor Lock-in Lakehouses

Read More: blog.iceberglakehouse.com/open-source-ta…

#DataEngineering #DataLakehouse #DataLake 

@dremio @SnowflakeDB @ApacheIceberg

Data Stack Insights gönderiyi yeniden yayınladı

NEW MEDIA ON OPEN SOURCE APACHE ICEBERG CATALOGS - New Episode of "Datanation" which you can find on iTunes and Spotify - Substack: open.substack.com/pub/amdatalake… #ApacheIceberg #DataLakehouse #Dremio #Snowflake #Databricks #DataEngineering

AMdatalakehouse's tweet image. NEW MEDIA ON OPEN SOURCE APACHE ICEBERG CATALOGS

- New Episode of "Datanation" which you can find on iTunes and Spotify

- Substack: open.substack.com/pub/amdatalake…

#ApacheIceberg #DataLakehouse #Dremio #Snowflake #Databricks #DataEngineering

Data Stack Insights gönderiyi yeniden yayınladı

DATA PROFESSIONAL FOLLOW TRAIN - reply to this tweet, I will follow you - follow me and everyone else who replies - retweet to maximize reach of the train #DataEngineering #DataAnalytics #DataScience #BigData #DataLakehouse #DataLake

AMdatalakehouse's tweet image. DATA PROFESSIONAL FOLLOW TRAIN

- reply to this tweet, I will follow you
- follow me and everyone else who replies
- retweet to maximize reach of the train

#DataEngineering #DataAnalytics #DataScience #BigData #DataLakehouse #DataLake

Data Stack Insights gönderiyi yeniden yayınladı

Data Stack Insights gönderiyi yeniden yayınladı

8-BIT GAME TRAILER: Quest for the Data Lakehouse #DataLakehouse #DataEngineering #ApacheIceberg #Dremio


Data Stack Insights gönderiyi yeniden yayınladı

FREE YOUR DATA AND GET A FREE BOOK Watch to learn more #DataEngineering #DataAnalytics #DataLakehouse #DataLake #BigData


Data Stack Insights gönderiyi yeniden yayınladı

Let's take a data deep dive 🌊 @AMdatalakehouse explains how to leverage an AWS Glue catalog as a Dremio data source and utilize Apache Superset as the BI tool to create and deliver dynamic and accurate dashboards. Dive in here ➡️ dremio.com/blog/bi-dashbo…

dremio's tweet image. Let's take a data deep dive 🌊

@AMdatalakehouse explains how to leverage an AWS Glue catalog as a Dremio data source and utilize Apache Superset as the BI tool to create and deliver dynamic and accurate dashboards.

Dive in here  ➡️ dremio.com/blog/bi-dashbo…

Data Stack Insights gönderiyi yeniden yayınladı

UNDERSTANDING WHAT MAKES DREMIO FAST Dremio gives you data warehouse performance on the data lakehouse in a way that significantly reduces your costs and delivers insights faster. In this video, I review a quick overview of the performance side of Dremio. Hands-On Lakehouse…


Loading...

Something went wrong.


Something went wrong.