As an outline, in the rest of the chapter, we proceed with
a brief presentation about the state of the art in ETL technology. Afterwards, we
discuss why the modeling of ETL workflows is important and we indicate the main
problems that arise during all the phases of an ETL process. Moreover, we present
a modeling approach for the construction of ETL workflows, which is based on
the life cycle of the data warehouse, along with an exemplary research framework
named Arktos II. Finally, we list several open research challenges that proclaim
ETL as a commodity of future research.
Figure 1. The environment of extraction-transformation-loading processes (Simitsis,
2004)
Sources
Extract Transform.
&.Clean
DW
Load
DSA
4 Simitsis, Vassiliadis, Skiadopoulos, & Sellis
Copyright ?© 2007, Idea Group Inc. Copying or distributing in print or electronic forms without written permission of
Idea Group Inc. is prohibited.
Background
In this section, we present ETL methodologies that are proposed by (a) commercial
studies and tools and (b) the research community. Then, we present the reasons and
the motives that signify the research on ETL processes is a valid research goal.
State.of.the.Art
??? Commercial.studies.and.tools: In terms of technological aspects, the main
characteristic of the area is the involvement of traditional database vendors
with ETL solutions built in the DBMS.
Pages:
210
211
212
213
214
215
216
217
218
219
220
221
222
223
224
225
226
227
228
229
230
231
232
233
234