- Blob storage
- Data Lake storage
- Blob access tiering
- Lifecycle management
- File format choice (Avro, CSV, Parquet, ORC)
- Blob access tiering
- HDInsight
- Databricks
- Premium SKU for cluster autoscaling
- Cluster type (high concurrency)
- Load into data frame, transform, write to DLS
- Synapse
- SQL Pool
- Distributions
- Partitioning T-SQL
- Partition switch and drop
- Slowly changing dimensions
- Types
- Distributions
- Spark Pool
- SQL Pool
- At-rest data encryption
- TDE
- Key Vaults
- Write encrypted data to tables or Parquet files
- Dynamic Data Masking
- Data Classification
- Row-level security
- Data retention / purge
- RBAC
- Managed identities
- Resources (Synapse Studio > Manage > Access Ctrl)
- DLS POSIX ACLs
- SQL Server and database firewall rules
- Automatic tuning (server, db level)
AZURE POLICY FOR DATABRICKS AND SYNAPSE
- ELT Data Transformations
- Spark / Databricks
- Data Factory
- Synapse pipelines
- Streaming Data Processing
- Streaming Units
- Event Hub/IoT Hub
- Stream Analytics
- Windowing aggregates
- DATEDIFF, LAST, LAG and LIMIT DURATION functions
- Protobuff deserializer as input
- Power BI output
- Batch processing solutions
- PolyBase
- Data Factory
- Copy activity
- Conditional split and sink transformations
- View flow source code
- Runtimes
- Triggers
- Azure Monitor metrics, logs, alerts
- Monitoring pipelines
- ADF run data 45 min; Log Analytics for 2 years
- Measure query / cluster performance
- Spark DAG
- Power BI output
- Item types & techniques
- Online testing process
- Grievances/problem resolution
- After the exam (dashboard, badge)
- Renewals
- Practice exams/questions
- Hands-on labs
=================