This is an article published as part of my graduate research:
One way to publish information in the Web is to create XML data sources. In these data sources, information is contained in one or more XML documents with a particular structure and content format. In this paper, we introduce a framework to prepare and to support the comparison of XML documents from different data sources, aiming at a further integration of similar XML instances. It is composed by some processes with one or more stages. The main contribution of this framework is to facilitate the similarity score definition between heterogeneous XML instances, allowing an uniformization of XML data defined in different contexts by different authors.