This repo has all the resources you need to become an amazing analytics engineer!
If you are new to analytics engineering, start by understanding what the role entails: analytics engineers sit at the intersection of data engineering and data analysis, building clean, tested, and documented data models that power business decisions.
For more applied learning:
- Check out the projects section for hands-on examples!
- Check out the interviews section for advice on how to pass analytics engineering interviews!
- Check out the books section for a list of high quality analytics engineering books
- Check out the communities section for a list of high quality analytics engineering communities to join
- Check out the newsletters section to learn via email
Great list of over 20 books
Top 3 must read books are:
- Analytics Engineering with SQL and dbt by Rui Machado & Hélder Russa
- The Data Warehouse Toolkit by Ralph Kimball
- Fundamentals of Data Engineering by Joe Reis & Matt Housley
Great list of communities to join:
Top must-join communities for Analytics Engineering:
- Transformation & Modeling
- Data Warehouses
- Semantic / Metrics Layers
- Analytics / Visualization
- Data Quality & Testing
- Data Integration / ELT
- Orchestration
- Data Cataloging & Documentation
- Data Lineage
- Education Companies
- LLM Application Libraries
- dbt Developer Blog
- Locally Optimistic
- Benn Stancil's Blog
- Netflix Data Blog
- Airbnb Data Blog
- Spotify Engineering (Data)
- Uber Data Blog
- GitLab Data Team Handbook
- Brooklyn Data Co. Blog
- Databricks Blog
- Snowflake Blog
- The Data Warehouse Insiders
- Simon Späti's Data Blog
- Start Data Engineering
- Preset Blog
- Hex Blog
- A Five-Layered Business Intelligence Architecture
- Lakehouse: A New Generation of Open Platforms that Unify Data Warehousing and Advanced Analytics
- The Data Lakehouse: Data Warehousing and More
- Big Data Quality: A Data Quality Profiling Model
- Tidy Data by Hadley Wickham
- Data Engineering Whitepapers Collection
- The Metrics Layer by dbt Labs
- Building a Mature Analytics Workflow
Here's a mostly comprehensive list of analytics engineering creators:
| Name | YouTube Channel | Follower Count |
|---|---|---|
| Zach Wilson | Data with Zach | 150,000+ |
| Seattle Data Guy | Seattle Data Guy | 100,000+ |
| Kahan Data Solutions | Kahan Data Solutions | 10,000+ |
| Data with Baraa | Data with Baraa | 195,000+ |
| TECHTFQ by Thoufiq | TECHTFQ by Thoufiq | 100,000+ |
| Alex Freberg | Alex The Analyst | 100,000+ |
| SQLBI | SQLBI | 100,000+ |
| Guy in a Cube | Guy in a Cube | 100,000+ |
| Advancing Analytics | Advancing Analytics | 10,000+ |
| Ankit Bansal | Ankit Bansal | 10,000+ |
| ByteByteGo | ByteByteGo | 1,000,000+ |
| Darshil Parmar | Darshil Parmar | 100,000+ |
| Andreas Kretz | Andreas Kretz | 100,000+ |
| Name | LinkedIn Profile | Follower Count |
|---|---|---|
| Zach Wilson | Zach Wilson | 400,000+ |
| Benn Stancil | Benn Stancil | 30,000+ |
| Claire Carroll | Claire Carroll | 20,000+ |
| Seattle Data Guy | Ben Rogojan | 100,000+ |
| Tristan Handy | Tristan Handy | 50,000+ |
| Emilie Schario | Emilie Schario | 10,000+ |
| Alex Freberg | Alex Freberg | 100,000+ |
| Darshil Parmar | Darshil Parmar | 100,000+ |
| Andreas Kretz | Andreas Kretz | 100,000+ |
| Marc Lamberti | Marc Lamberti | 50,000+ |
| Anna Filippova | Anna Filippova | 10,000+ |
| Simon Späti | Simon Späti | 10,000+ |
| Joseph Machado | Joseph Machado | 10,000+ |
| Hugo Lu | Hugo Lu | 5,000+ |
| Li Yin | Li Yin | 10,000+ |
| Mehdi Ouazza | Mehdi Ouazza | 20,000+ |
| Name | X/Twitter Profile | Follower Count |
|---|---|---|
| Tristan Handy | @jthandy | 30,000+ |
| Benn Stancil | @bennstancil | 20,000+ |
| Zach Wilson | @EcZachly | 30,000+ |
| Seattle Data Guy | @SeattleDataGuy | 10,000+ |
| Claire Carroll | @claborjaires | 10,000+ |
| dbt Labs | @daborbt_labs | 50,000+ |
| Emilie Schario | @emaborilieschario | 10,000+ |
- The Analytics Engineering Podcast
- The Data Engineering Show
- Data Engineering Podcast
- Analytics Power Hour
- The Data Coffee Break Podcast
- The Datastack Show
- Catalog & Cocktails
- DataTopics
- Drill to Detail with Mark Rittman
- The Data Chief by ThoughtSpot
- Monday Morning Data Chat
- The Joe Reis Show
- Super Data Science: ML & AI Podcast with Jon Krohn
- MLOps.community Podcast
Great list of 20+ newsletters
Top must follow newsletters for analytics engineering:
- Learn Analytics Engineering
- The Analytics Engineering Roundup by dbt Labs
- DataEngineer.io Newsletter
- Benn Stancil's Newsletter
- Locally Optimistic Blog
- dbt Glossary
- Data Engineering Vault
- Airbyte Data Glossary
- Data Engineering Wiki by Reddit
- Secoda Glossary
- Glossary Databricks
- Data Engineering Glossary by Dagster
- Kimball Group Techniques
- How We Structure Our dbt Projects (dbt Labs)
- dbt Best Practices
- dbt Style Guide by dbt Labs
- GitLab dbt Guide
- Cumulative Table Design
- Kimball Dimensional Modeling Techniques
- The Little Book of Pipelines
- Data Developer Platform Architecture
- Medallion Architecture by Databricks
- The Metrics Layer Explained
- DataExpert.io course use code HANDBOOK10 for a discount!
- dbt Learn (dbt Fundamentals) — Free official dbt course
- Technical Freelancer Academy Use code zwtech for a discount!
- Analytics Engineering with dbt (Coursera)
- The Complete dbt Bootcamp (Udemy)
- DataCamp
- LearnDataEngineering.com
- Data Engineering Zoomcamp by DataTalksClub
- IBM Data Engineering for Everyone (edX)
- Rock the JVM — Teaches Spark, Flink, and more
- DeepLearning.AI Data Engineering Professional Certificate
- dbt Analytics Engineering Certification
- Google Cloud Certified - Professional Data Engineer
- Snowflake SnowPro Core Certification
- Snowflake SnowPro Advanced: Data Engineer
- Databricks - Data Engineer Associate
- Microsoft DP-203: Data Engineering on Microsoft Azure
- Microsoft DP-600: Fabric Analytics Engineer Associate
- AWS Certified Data Engineer - Associate
- Tableau Desktop Specialist
- Google Looker Business Analyst Certificate