data_eng_lab's profile picture. Google Certified Data Engineer | Plumbing Data pipelines | Python | Cloud | BigData

Data Engineering lab

@data_eng_lab

Google Certified Data Engineer | Plumbing Data pipelines | Python | Cloud | BigData

Data Engineering lab 님이 재게시함

Every PM should be using Claude Code. So I built a HUGE course for you to learn Claude Code... IN Claude Code! 🔹 Complete guide 🔹 Make PRDs, analyze data, create decks Soon, I'll sell it for $149. For the next 24h: FREE! Follow + RT + comment "CC" & I'll DM it.


Data Engineering lab 님이 재게시함

Do You Need Canva Pro! lifetime Subscription 🚨 If You Need Canva Pro Free Then ; 1. Like 2. Comment "Canva" 3. Repost ( follow me to get it)


Data Engineering lab 님이 재게시함

Learn for free: HTML → w3schools.com CSS → CSS-tricks.com JavaScript → LearnJavaScript.online Git and GitHub → LearnGitBranching.js.org React → React-tutorial.app API → @Rapid_API SQL → SQLbolt.com Blockchain → cryptozombies.io


Data Engineering lab 님이 재게시함

If you’re struggling to prioritize your to-do list, try this:


What is partitioning in hadoop MR? No of partitions are equal to no of reducers. Mappers ll store the data in partitions and its selection logic is based on hash func. Same key ll always go to same partition. Each reducer ll process respective partition #BigData #learnHadoop


What is shuffling in hadoop? In distributed system, calculation will hpn in mapper process and aggregation will happen in reducer process. Once the calculations are done in mappers, data ll be transferred to reducers, this transfer is called shuffling. #learnHadoop #BigData


How many mappers will be created for 500mb file in hadoop distributed computing? 1 mapper for 1 block, 500mb file will split into 4 blocks in HDFS (default block size is 128mb), so 4 mapper process can be created and each process one block. #hadoop #BigData


Data Engineering lab 님이 재게시함

Giveaway time! We are going to send a t-shirt and few goodies to one person who follows @PentesterLab and likes this tweet !! And we are going to give a 1-year voucher to someone who RT this tweet!


Quick way of count the number of column in the csv file line = open("test.csv").readline() print(line.split(",")) # comma seperated What is your way of doing it? #pythonprogramming #DailyOneTip #Python


Whom Should I follow to get latest data engineering, python and cloud relevant content? Any suggestions guys? @saviomartin7 @oliverjumpertz @DailyDevTips1 @VittoStack @sunilc_ @BigData_Fr @VittoStack @heyOnuoha #dataengineering #Python #GoogleCloud


Data Engineering lab 님이 재게시함

You hear folks talk about BigQuery, but what actually makes it so unique and powerful? This @GoogleCloudTech post from @pvergadia is the best summary I've seen. Here's why BQ makes other data warehouses look incomplete or inefficient by comparison. cloud.google.com/blog/topics/de…

rseroter's tweet image. You hear folks talk about BigQuery, but what actually makes it so unique and powerful?

This @GoogleCloudTech post from @pvergadia is the best summary I've seen. Here's why BQ makes other data warehouses look incomplete or inefficient by comparison.

cloud.google.com/blog/topics/de…

Written my first medium about transferring Large data files from S3 to GCS using airflow medium.com/swlh/transfer-…


Loading...

Something went wrong.


Something went wrong.