Format Conversion, ETL
ETL:
Extraction, Transformation and Loading. Activities required to populate data warehouses and OLAP applications with clean, consistent, integrated and probably summarized data.
From: www.olapreport.com/glossary.htm
Extract/Transform/Load: This is the process of extracting data from their operational data sources or external data sources, transforming the data which includes cleansing, aggregation, summarization, integration, as well as basic transformation and loading the data into some form of the data warehouse (ODS, enterprise data warehouse, data mart). ETL can also refer to the vendor software that performs these processes.
From : it.csumb.edu/departments/data/glossary.html
Octopus
Enhydra Octopus is a Java-based Extraction, Transformation, and Loading (ETL) tool. It may connect to any JDBC data sources and perform transformations defined in an XML file.
DODS data models are supported by generating oid's for new objects. Natural keys can be used to insert/update existing data and create relationships with oid's.
A loadjob-generator is provided to generate Octopus loadjob skeletons (and even DODS DOML files !) from an existing database. Many different types of databases can be mixed (MSSQL, Oracle, DB2, QED, JDBC-ODBC with Excel and Access, MySQL, CSV-files, XML-files,...) Four special JDBC drivers come with Octopus to support JDBC access to CSV-files (CSV-JDBC), MS-SQL (FreeTDS), XML (XML-JDBC) and property files (i18n-JDBC).
Octopus supports Ant and JUnit to create a database / tables and extract /load data during a build or test process. Loadjobs can be executed during execution of an application installation (e.g. NSIS, Installshield,...)
Octopus gives you a very generic way to transform data. You can define transformations by implementing Transformer interface or using JavaScript code (directly in load job XML file).
visit
Octopus
Octopus was visited : 56 times
Loading .....