I am a new QA Tester to a large Data Warehouse and am struggling with understanding what to test and when enough is enough. Does anyone know of classes or online CBT's or books that explain IT QA testing in Data Warehouses?
Thanks! I also found thoughtworks.com. We are not testing the validity of the data, but that is one thing I am trying to change. I just have to convince the higher-ups of that. I am mainly trying to test the ETL process, the FTP process and the end marts.
I am a bit puzzled to figure out what you want to know. I presume you did not develop the warehouse engine yourself. In that context, you must have selected the warehouse to satisfy business needs, and there should be business requirements and/or functional specifications that tell you what your system is supposed to do. If you take that as the starting point to design your high-level cases it will almost certainly evolve into establishing a robust series of test cases that demonstrate that all the requirements are satisfied.
If you test the ETL processes then you need to be able to validate that the extract was correct and that the transformation and load succeeded based on your process requirements. Again, I assumy you would focus on your specifications, not on basic ETL functionality that may have been purchased to meet those requirements. Similarly, the FTP is a bit of a smokescreen, because such a standard web utility has been around for ages. If your target mart is an ISP, for example, they you need to be concerned about security, backup, etc., too much to list here. The point is, you should not think as if you have a hard time finding things to be tested. Rather, you may have more opportunities for testing than you have time to implement.