-
Notifications
You must be signed in to change notification settings - Fork 32
Home
olehmberg edited this page Oct 2, 2018
·
16 revisions
Welcome to the WInte.r Wiki. The pages in this Wiki introduce the most important concepts and parts of the framework. In addition, code examples provide a guideline how to start a data integration project.
The WInte.r framework comprises a data model and methods for various data integration tasks. The general workflow of an end-to-end data integration process looks like the following.
- Load the data into dataset objects
- Apply schema matching to get correspondences between attributes
- Apply identity resolution to get correspondences between records
- Transform the data into a consolidated schema using the correspondences between the attributes
- Use the correspondences between the records to perform data fusion and create one consolidated dataset
Contents