Dejan Sarka, MCT and Data Platform MVP Alumni, is an independent trainer and consultant who focuses on database development and data science. He is to founder of Slovenian community. He is author or coauthor of twenty books on SQL Server and data science.
Optimizing ETL
- Podatki in analitika
- ponedeljek 10:00 - 10:45
- Emerald 2
- Nivo 300
- Demo 0%
- Konferenca TECH
On SQL Server Side When you try to optimize Power BI, the first advice you get is to push the data processing as far upstream as possible. Query folding in Power Query tries to enforce this logic as much as possible. But what is going on SQL Server side? What if the daily extract – transform - load (ETL) process is still too slow? The real optimization of the ETL process starts with finding what has changed on the source, finding the delta. In SQL Server, you have many methods for this. You can use the Change Tracking (CT), Change Data Capture (CDC), system-versioned tables, ROWVERSION data type, triggers… Which method might be the fastest? In the next step, how do you optimize staging tables? Do you use regular tables, temporary tables, memory-optimized tables? Can you leverage minimally logged operations? Does it make sense to pre-aggregate the data? Disable indexes and constraints or not? What about table partitioning? This presentation will help you answer these questions and help finding the most suitable solution for you. Parts: • Finding the delta • Optimizing staging tables • Disabling indexes and constraints • Table partitioning
Dejan Sarka
Dejan Sarka s.p.
Komentiranje v NTK aplikaciji je mogoče za potrjene udeležence NT konference.
iPhone Android