ETL - Microsoft SSIS and Data Tools
Has anyone set up an ETL and if so what resources did you find helpful? I am a beginner in a DW environment and would like to actually be able to create ETL's to handle extracting data from different sources.
Even an ETL design or strategy guide would be helpful if you know of one.
Thanks in advance
Even an ETL design or strategy guide would be helpful if you know of one.
Thanks in advance
Comments
-
NightShade03 Member Posts: 1,383 ■■■■■■■□□□So I haven't used the two tools that you mentioned specifically, but there are some good open source ones that you can try out. The two I recommend you try out are "R" or Pandas (a Python framework). If you have a second you should go through:
https://www.udacity.com/course/ud032
https://www.udacity.com/course/ud651
Both courses walk you through the full ETL process and some samples of how to manipulate data sources as part of ETL (also called data munging). Each ETL design will be different so I doubt you'll find some reference architectures.
Hope this helps. -
N2IT Inactive Imported Users Posts: 7,483 ■■■■■■■■■■Thanks for the follow up. The platform is Windows Server and SQL Server with Data Tools that's why I am focusing on that manufacture. My boss specifically mentioned SSIS, so it looks like they want to go with that integration service.
Again thanks for posting this information, I'll review. Thanks! -
the_Grinch Member Posts: 4,165 ■■■■■■■■■■Man I am seeing data everywhere! Fell into the Matrix I think!
"I, I don't even see the code. All I see is...blond, brunette, red-head..." - CypherWIP:
PHP
Kotlin
Intro to Discrete Math
Programming Languages
Work stuff -
N2IT Inactive Imported Users Posts: 7,483 ■■■■■■■■■■Yeah all I do day in and day is write SQL. It's pretty amazing the stuff you pick up when you keep writing code over and over again.
With that said I am still to total NUB working through the challenges of RDBMS. -
RobertKaucher Member Posts: 4,299 ■■■■■■■■■■The Data Warehouse ETL Toolkit: Practical Techniques for Extracting, Cleaning, Conforming, and Delivering Data has some excellent strategies and suggestions. I'd also say that just the data warehouse toolkit is required reading.