| Amazon Bedrock |
Amazon Bedrock |
| Amazon Neptune |
A fast, fully managed database service powering graph use cases such as identity graphs, knowledge graphs, and fraud detection. |
| Amazon Redshift |
Amazon Redshift |
| Amazon SageMaker |
Amazon SageMaker |
| Ansible |
An open-source automation tool primarily used for configuration management, application deployment and orchestration |
| Apache Airflow |
Apache Airflow |
| Apache Beam |
Apache Beam |
| Apache Flink |
Apache Flink |
| Apache Flume |
Apache Flume |
| Apache HBase |
Apache HBase |
| Apache Hive |
Apache Hive |
| Apache Iceberg |
Apache Iceberg |
| Apache Kafka |
Apache Kafka |
| Apache Spark |
Apache Spark |
| Apache Spark optimizations |
Apache Spark optimizations |
| Apache Superset |
Apache Superset |
| AWS |
AWS |
| AWS Glue |
A serverless data integration service that makes it easy for analytics users to discover, prepare, move, and integrate data from multiple sources |
| AWS Lambda |
AWS Lambda |
| AWS services |
List of AWS services and their short descriptions |
| Azure |
Azure |
| Azure Data Factory |
Azure Data Factory |
| Azure Databricks |
Azure Databricks |
| Azure DevOps |
Azure DevOps |
| Azure HDInsight |
Azure HDInsight |
| Azure Purview |
A unified data governance solution that helps organizations discover, manage, and govern their data estate across on-premises, multi-cloud, and SaaS environments |
| Azure services |
List of Azure services |
| Azure Synapse Analytics |
Azure Synapse Analytics |
| Big Data Engineering |
Big Data engineering concepts and tools. |
| Data pipelines |
Data pipelines basics |
| Data Preparation for Machine Learning |
Data Preparation for Machine Learning |
| Data Vault architecture |
Data Vault architecture |
| Data Warehouse Modeling |
data warehouse modeling |
| Data Warehousing |
Data Warehousing Architecture |
| Databricks AutoML |
Databricks AutoML |
| Databricks Data Modeling Strategies |
Databricks Data Modeling Strategies |
| Databricks data platform and AI architecture roles |
Databricks data platform and AI architecture roles |
| Databricks Data Warehousing |
Databricks Data Warehousing |
| Databricks Generative AI Application Deployment and Monitoring |
Databricks Generative AI Application Deployment and Monitoring |
| Databricks Generative AI Application Development |
Databricks Generative AI Application Development |
| Databricks Machine Learning |
Databricks Machine Learning |
| Databricks Mosaic AI |
Databricks Mosaic AI |
| Databricks Performance Optimization |
Databricks Performance Optimization |
| dbt |
dbt |
| Delta Lake |
A flexible storage pattern that is typically used for storing massive amounts of raw data in its native format |
| DynamoDB |
DynamoDB |
| Elasticsearch |
A search engine based on Apache Lucene, a free and open-source search engine. It provides a distributed, multitenant-capable full-text search engine with an HTTP web interface and schema-free JSON documents. |
| FastAPI |
A high-performance web framework for building HTTP-based service APIs in Python |
| Fivetran |
Fivetran |
| GCP services |
Google Cloud Platform services |
| General |
General programming concepts, design patterns |
| General Data Engineer interview |
General, behavioral, communication, collaboration, problem solving from data engineering perspective |
| Golang |
Golang |
| Google BigQuery |
Google BigQuery |
| Google Cloud Platform |
Google Cloud Platform |
| Grafana |
A multi-platform open source analytics and interactive visualization web application. |
| Hadoop |
Hadoop |
| Haystack |
Haystack |
| Jenkins |
An open source automation server. It helps automate the parts of software development related to building, testing, and deploying |
| Jetpack Compose |
Basics |
| Kotlin Basics |
Basic syntax, functions, variables, classes, conditional expressions, loops, ranges, collections, nullable values |
| Kusto Query Language KQL |
Kusto Query Language KQL |
| LangChain |
LangChain |
| Machine learning |
Basic concepts |
| Matillion |
Matillion |
| Microsoft Fabric |
Microsoft Fabric |
| MLflow |
MLflow |
| MongoDB |
MongoDB |
| Palantir Foundry |
Palantir Foundry |
| Pandas |
A software library written for the Python for data manipulation and analysis |
| Polars |
Polars |
| Power BI |
A business analytics and data visualization tool |
| Power BI DAX |
Power BI DAX |
| PySpark |
PySpark |
| Python |
The basics, interpreter, numbers, text, lists, sets, dictionaries, control flow, loops, functions |
| Python Advanced |
Functions, annotations, coding style, reading and writing files, classes, iterators, standard library |
| Python How-To |
How-to's |
| RxSwift |
Basics of RxSwift |
| Scala |
Scala for data engineering |
| Scala Essential |
Essential Scala programming concepts |
| Snowflake |
A cloud data platform that at it's core features a columnar-stored data warehouse |
| Spark Structured Streaming |
Spark Structured Streaming |
| SQL |
SQL |
| SQL How to |
SQL tips & tricks |
| Streamlit |
Streamlit |
| Swift Advanced |
Properties, subscripts, concurrency, type casting, nested types, extensions, protocols, generics, Combine framework |
| Swift Basics |
The basics, string and characters, collection types, control flow, functions, closures, enumerations, structures and classes, properties, methods |
| Swift UI Advanced |
Advanced topics and how-to's |
| Swift UI Basics |
Walk through the building blocks of a SwiftUI |
| Tableau |
Tableau |
| Terraform |
An infrastructure as code tool that lets you build, change, and version infrastructure safely and efficiently |