
Kishan
@kishan_py
🛠️
Bạn có thể thích
Event data is the largest volume data you'll tackle as a data engineer! In this free four hour course, I'll go over everything I learned at Meta to tackle large volume event data! Enjoy! Please repost to spread the knowledge! youtube.com/watch?v=DQefW9…
youtube.com
YouTube
How Meta Models Big Volume Event Data - Full 4 Hour Course - DataEx...
Most people focus on optimizing the SELECT, but performance starts at CREATE. 5 simple tips for designing a better database. 1. Normalize Your Data • First Normal Form (1NF): Each column should contain atomic (indivisible) values, and each column should contain values of a…

We launched a Kaggle competition as a part of the LLM Zoomcamp! Participate and solve high school mathematical problems with LLMs. 🔸 Learn on our free Zoomcamp 🔸 Apply your knowledge at this competition 🔸 Get extra points for the course Join now: kaggle.com/competitions/l…

Scenario where you can use Delete and Truncate command: 1. “Cleaning Up Data for Specific Rows” - Suppose you are managing a customer database and want to remove records of customers who have unsubscribed from your mailing list. Use the ‘DELETE’ statement to remove specific…
Latency is the new downtime in distributed systems. But these 8 must-know strategies can help reduce latency. 🧵

My favorite stack to build a data analytics product - Apache Spark (for processing) - Amazon S3 (for storage) - Apache Iceberg (for metadata) - Apache Airflow (for scheduling) - Apache Superset (for visualization) - Great Expectations (for data quality) #dataengineering
Good SQL brings more Orders. A common mistake that turns LEFT JOIN into an INNER JOIN. In a LEFT JOIN, all records from the left table and the matching records from the right table are included. So far, so good, but there is a catch. If you add a condition in the WHERE clause…

The first cohort of the LLM Zoomcamp has just started! 🎉 Join Module 1 as we explore RAG and Elastic Search with OpenAI. Homework: you'll need to calculate the price for your prompts. Access materials for Module 1 here: github.com/DataTalksClub/… 1/2

If I had to start learning #dataengineering all over again, I’d follow this plan, mostly in order: - Learn SQL — Aggregations with GROUP BY — Joins (INNER, LEFT, FULL OUTER) — Window functions — Common table expressions - Learn about data modeling — read about data…
Module 1 of LLM Zoomcamp is ready! In this module, you'll set up a RAG flow for searching in a knowledge base and retrieving important contexts for LLM. Watch it here: youtube.com/watch?v=Q75JgL… Code and notes: github.com/DataTalksClub/… See you on Monday when the course starts!

The best place to learn k8s in Hindi 🤓 • Kubernetes Architecture ✅ • Cluster, Pods, Containers ✅ • Side Cars, initContainers, Kyverno, Trivy ✅ • Deployments ✅ • Services, Live application deployment using Custom Domain ✅ • Cert-Manager, Gateway API, & Namespaces ✅

How does Docker work? The diagram below shows the architecture of Docker and how it works when we run “docker build”, “docker pull” and “docker run”. There are 3 components in Docker architecture: 🔹 Docker client The docker client talks to the Docker daemon. 🔹…
This Golang project will get you hired, TRUST. 10 hours of distributed dopamine. youtube.com/watch?v=bymQak…
youtube.com
YouTube
How To Build A Complete Distributed File Storage In Golang
SQL Indexing in a Nutshell:
Data engineers often become bored of data engineering! After a while of SQL + Python + airflow, you start thinking all pipelines are the same and it’s copy and paste work. Some strategies to help with this: - become more end-to-end Maybe that means building a dashboard. Maybe…
Advanced SQL Interviews LOVE to test you on Window Functions. Here's 4 real SQL interview questions from Uber, Google, Spotify & Wayfair to practice SQL window functions like: • LEAD() • LAG() • RANK() • DENSE_RANK() • ROW_NUMBER() • NTH_VALUE() 🧵👇

Data engineering SQL interviews always have a silly RANK question. Should you use RANK, DENSE RANK, or ROW NUMBER? Here’s a refresher! For more free data engineering interview, subscribe to my blog: blog.dataengineer.io #dataengineering
A data engineer who is unfamiliar with SQL queries and optimization will have significant difficulties Since SparkSQL has increased the popularity of SQL, having a solid understanding of querying datasets will be useful to you for a very long time #Spark #DataEngineering #SQL
Types of projects that I worked as Freelance Data Engineer 👇🏻 This is a common question I get about the type of work you get as a freelance data engineer and it varies a lot 📍Web Scraping: When I started my journey as a freelancer I didn't get projects related to core Data…

United States Xu hướng
- 1. $UNI 1,706 posts
- 2. Rempe 1,517 posts
- 3. Vesia 1,023 posts
- 4. Pacers 10.5K posts
- 5. Bannon 13.8K posts
- 6. Chuck 88.6K posts
- 7. Logan Wilson 1,441 posts
- 8. Dragon 89K posts
- 9. #USWNT N/A
- 10. #LightningStrikes N/A
- 11. #BoltUp 1,597 posts
- 12. #Skol 2,140 posts
- 13. Emil Heineman N/A
- 14. Georgia State 4,086 posts
- 15. #LOVERGIRL 6,194 posts
- 16. Macklin N/A
- 17. Jim Marshall N/A
- 18. Aaron Jones 2,664 posts
- 19. Sharks 8,315 posts
- 20. Siakam 1,056 posts
Bạn có thể thích
-
Mitchell van Rijkom
@MitchellvRijkom -
TANISHQ BANSAL
@istealtheworld -
Chi'ka
@Esther_Chika01 -
Subho
@subhoghosh_ -
Braxten Boyd
@boyd_braxten -
priestly_priest
@noahson08 -
echo
@jherico_nacis -
Gaurav | Data Enginner
@Gaurav_R1408 -
𝓐𝔃𝓾𝓶𝓪
@azzumar_aydid -
Gaurav Chatterjee
@iamgaurav09 -
Chiagozie Stephen
@StEpHeN_CoDeS -
Sumant Thacker
@sumant_rt -
Naveen Padidapu
@NaveenPadidapu
Something went wrong.
Something went wrong.