kishan_py's profile picture. 🛠️

Kishan

@kishan_py

🛠️

Kishan reposted

Event data is the largest volume data you'll tackle as a data engineer! In this free four hour course, I'll go over everything I learned at Meta to tackle large volume event data! Enjoy! Please repost to spread the knowledge! youtube.com/watch?v=DQefW9…

EcZachly's tweet card. How Meta Models Big Volume Event Data - Full 4 Hour Course - DataEx...

youtube.com

YouTube

How Meta Models Big Volume Event Data - Full 4 Hour Course - DataEx...


Kishan reposted

Most people focus on optimizing the SELECT, but performance starts at CREATE. 5 simple tips for designing a better database. 1. Normalize Your Data • First Normal Form (1NF): Each column should contain atomic (indivisible) values, and each column should contain values of a…

RaulJuncoV's tweet image. Most people focus on optimizing the SELECT, but performance starts at CREATE. 

5 simple tips for designing a better database.

1. Normalize Your Data

• First Normal Form (1NF): Each column should contain atomic (indivisible) values, and each column should contain values of a…

Kishan reposted

We launched a Kaggle competition as a part of the LLM Zoomcamp! Participate and solve high school mathematical problems with LLMs. 🔸 Learn on our free Zoomcamp 🔸 Apply your knowledge at this competition 🔸 Get extra points for the course Join now: kaggle.com/competitions/l…

Al_Grigor's tweet image. We launched a Kaggle competition as a part of the LLM Zoomcamp!

Participate and solve high school mathematical problems with LLMs.

🔸 Learn on our free Zoomcamp
🔸 Apply your knowledge at this competition
🔸 Get extra points for the course

Join now: kaggle.com/competitions/l…

Kishan reposted

Scenario where you can use Delete and Truncate command: 1. “Cleaning Up Data for Specific Rows” - Suppose you are managing a customer database and want to remove records of customers who have unsubscribed from your mailing list. Use the ‘DELETE’ statement to remove specific…


Kishan reposted

Latency is the new downtime in distributed systems. But these 8 must-know strategies can help reduce latency. 🧵

ProgressiveCod2's tweet image. Latency is the new downtime in distributed systems.

But these 8 must-know strategies can help reduce latency.

🧵

Kishan reposted

My favorite stack to build a data analytics product - Apache Spark (for processing) - Amazon S3 (for storage) - Apache Iceberg (for metadata) - Apache Airflow (for scheduling) - Apache Superset (for visualization) - Great Expectations (for data quality) #dataengineering


Kishan reposted

Good SQL brings more Orders. A common mistake that turns LEFT JOIN into an INNER JOIN. In a LEFT JOIN, all records from the left table and the matching records from the right table are included. So far, so good, but there is a catch. If you add a condition in the WHERE clause…

RaulJuncoV's tweet image. Good SQL brings more Orders.

A common mistake that turns LEFT JOIN into an INNER JOIN.

In a LEFT JOIN, all records from the left table and the matching records from the right table are included.

So far, so good, but there is a catch.

If you add a condition in the WHERE clause…

Kishan reposted

The first cohort of the LLM Zoomcamp has just started! 🎉 Join Module 1 as we explore RAG and Elastic Search with OpenAI. Homework: you'll need to calculate the price for your prompts. Access materials for Module 1 here: github.com/DataTalksClub/… 1/2

Al_Grigor's tweet image. The first cohort of the LLM Zoomcamp has just started! 🎉

Join Module 1 as we explore RAG and Elastic Search with OpenAI.

Homework: you'll need to calculate the price for your prompts.

Access materials for Module 1 here: github.com/DataTalksClub/…

1/2

Dashboard Data Pipeline

kishan_py's tweet image. Dashboard                               Data Pipeline
kishan_py's tweet image. Dashboard                               Data Pipeline

Kishan reposted

If I had to start learning #dataengineering all over again, I’d follow this plan, mostly in order: - Learn SQL — Aggregations with GROUP BY — Joins (INNER, LEFT, FULL OUTER) — Window functions — Common table expressions - Learn about data modeling — read about data…


Kishan reposted

Module 1 of LLM Zoomcamp is ready! In this module, you'll set up a RAG flow for searching in a knowledge base and retrieving important contexts for LLM. Watch it here: youtube.com/watch?v=Q75JgL… Code and notes: github.com/DataTalksClub/… See you on Monday when the course starts!

Al_Grigor's tweet image. Module 1 of LLM Zoomcamp is ready!

In this module, you'll set up a RAG flow for searching in a knowledge base and retrieving important contexts for LLM.

Watch it here: youtube.com/watch?v=Q75JgL…

Code and notes: github.com/DataTalksClub/…

See you on Monday when the course starts!

Kishan reposted

The best place to learn k8s in Hindi 🤓 • Kubernetes Architecture ✅ • Cluster, Pods, Containers ✅ • Side Cars, initContainers, Kyverno, Trivy ✅ • Deployments ✅ • Services, Live application deployment using Custom Domain ✅ • Cert-Manager, Gateway API, & Namespaces ✅

prerit_munjal's tweet image. The best place to learn k8s in Hindi 🤓

• Kubernetes Architecture ✅
• Cluster, Pods, Containers ✅
• Side Cars, initContainers, Kyverno, Trivy ✅
• Deployments ✅
• Services, Live application deployment using Custom Domain ✅
• Cert-Manager, Gateway API, & Namespaces ✅

Kishan reposted

How does Docker work? The diagram below shows the architecture of Docker and how it works when we run “docker build”, “docker pull” and “docker run”. There are 3 components in Docker architecture: 🔹 Docker client The docker client talks to the Docker daemon. 🔹…


Kishan reposted

This Golang project will get you hired, TRUST. 10 hours of distributed dopamine. youtube.com/watch?v=bymQak…

anthdm's tweet card. How To Build A Complete Distributed File Storage In Golang

youtube.com

YouTube

How To Build A Complete Distributed File Storage In Golang


Kishan reposted

SQL Indexing in a Nutshell:


Kishan reposted

Data engineers often become bored of data engineering! After a while of SQL + Python + airflow, you start thinking all pipelines are the same and it’s copy and paste work. Some strategies to help with this: - become more end-to-end Maybe that means building a dashboard. Maybe…


Kishan reposted

Advanced SQL Interviews LOVE to test you on Window Functions. Here's 4 real SQL interview questions from Uber, Google, Spotify & Wayfair to practice SQL window functions like: • LEAD() • LAG() • RANK() • DENSE_RANK() • ROW_NUMBER() • NTH_VALUE() 🧵👇

NickSinghTech's tweet image. Advanced SQL Interviews LOVE to test you on Window Functions. 

Here's 4 real SQL interview questions from Uber, Google, Spotify & Wayfair to practice SQL window functions like:
 • LEAD()
• LAG()
• RANK() 
• DENSE_RANK()
• ROW_NUMBER()
• NTH_VALUE()

🧵👇

Kishan reposted

Data engineering SQL interviews always have a silly RANK question. Should you use RANK, DENSE RANK, or ROW NUMBER? Here’s a refresher! For more free data engineering interview, subscribe to my blog: blog.dataengineer.io #dataengineering


Kishan reposted

A data engineer who is unfamiliar with SQL queries and optimization will have significant difficulties Since SparkSQL has increased the popularity of SQL, having a solid understanding of querying datasets will be useful to you for a very long time #Spark #DataEngineering #SQL


Kishan reposted

Types of projects that I worked as Freelance Data Engineer 👇🏻 This is a common question I get about the type of work you get as a freelance data engineer and it varies a lot 📍Web Scraping: When I started my journey as a freelancer I didn't get projects related to core Data…

parmardarshil07's tweet image. Types of projects that I worked as Freelance Data Engineer 👇🏻

This is a common question I get about the type of work you get as a freelance data engineer and it varies a lot 

📍Web Scraping:
When I started my journey as a freelancer I didn't get projects related to core Data…

Loading...

Something went wrong.


Something went wrong.