Sep 23 2008
ETL effort estimation: Points to factor-in
Estimation of ETL effort is not always fun. (as with any estimation).
There are several ways of estimating the effort needed to complete an ETL job. Work Breakdown Structure (WBS) is popular. And so is Function Point Analysis (FPA).
But the most widely used is the one that factors in complexity based on the understanding of things like source, target, resources on project, etc.
Though I haven’t really seen anyone use this method to perfection, it is a good place to start with. Some people argue against this method, but I see this as a complementary option to whatever method you have.
So, here is a list of points that I think would be useful when you do any ETL effort estimation. I have grouped it under 5 heads: Source, target, transformations, resources, other.
Source based:
- No of different sources & types
- Incremental extraction needs
- Profiling of data sources
- Cleansing / de-duplication dirty data sources
- Availability of documentation / transition of knowledge of source data
- Access control & management, if needed
- Data volumes for unit testing

