A mapping approach for fully virtual data integration system processes
dc.Affiliation | October University for modern sciences and Arts (MSA) | |
dc.contributor.author | El Qutaany A.Z. | |
dc.contributor.author | Hegazi O.M. | |
dc.contributor.author | El Bastawissy A.H. | |
dc.contributor.other | Faculty of Computers and Information | |
dc.contributor.other | Cairo University | |
dc.contributor.other | Cairo | |
dc.contributor.other | Egypt; Faculty of Computer Science | |
dc.contributor.other | MSA University | |
dc.contributor.other | Cairo | |
dc.contributor.other | Egypt | |
dc.date.accessioned | 2020-01-09T20:41:02Z | |
dc.date.available | 2020-01-09T20:41:02Z | |
dc.date.issued | 2018 | |
dc.description | Scopus | |
dc.description.abstract | Nowadays, organizations cannot satisfy their information needs from one data source. Moreover, multiple data sources across the organization fuels the need for data integration. Data integration system's users pose queries in terms of an integrated schema and expect accurate, unambiguous, and complete answers. So the data integration system is not limited to, getting the answers to the queries from the sources, but also it is extended to detect and resolve the data quality problems appeared due to the integration process. The most crucial component in any data integration system is the mappings constructed between the data sources and the integrated schema. In this paper a new mapping approach is proposed to map not only the elements of the integrated schema as done by the existing approaches, but also it maps other elements required in detecting and resolving the duplicates. It provides a means to facilitate future extensibility and changes to both the sources and the integrated schema. The proposed approach provides a linkage between the fundamental components required to provide accurate and unambiguous answers to the users' queries from the integration system. 2018 International Journal of Advanced Computer Science and Applications. | en_US |
dc.description.uri | https://www.scimagojr.com/journalsearch.php?q=21100867241&tip=sid&clean=0 | |
dc.identifier.doi | https://doi.org/10.14569/IJACSA.2018.091216 | |
dc.identifier.doi | PubMed ID : | |
dc.identifier.issn | 2158107X | |
dc.identifier.other | https://doi.org/10.14569/IJACSA.2018.091216 | |
dc.identifier.other | PubMed ID : | |
dc.identifier.uri | https://t.ly/2dXxk | |
dc.language.iso | English | en_US |
dc.publisher | Science and Information Organization | en_US |
dc.relation.ispartofseries | International Journal of Advanced Computer Science and Applications | |
dc.relation.ispartofseries | 9 | |
dc.subject | Data integration | en_US |
dc.subject | Inconsistency detection | en_US |
dc.subject | Inconsistency resolution | en_US |
dc.subject | Mapping | en_US |
dc.subject | Virtual data integration | en_US |
dc.title | A mapping approach for fully virtual data integration system processes | en_US |
dc.type | Article | en_US |
dcterms.isReferencedBy | Lenzerini, M., "Data integration: A theoretical perspective" (2002) Proceedings of the twenty-first ACM SIGMOD-SIGACT-SIGART symposium on Principles of database systems, , Madison, Wisconsin, USA, June; Golshan, B., Halevy, A.Y., Mihaila, M., Tan, W.C., "Data integration: after the teenage years" (2017) Proceedings of the 36th ACM SIGMOD-SIGACT-SIGAI Symposium on Principles of Database Systems (PODS'17), , Chicago, Illinois, USA; Xu, L., Embley, D.W., "Combining the best of Global-as-View and Local-as-View for data integration" (2004) Proceedings of the 3rd ISTA, , Salt Lake city, Utah, USA; Rahm, E., Do, H.H., "Data cleaning: problems and current approaches" (2001) Bulletin of the IEEE Computer Society Technical Committee on Data Engineering, 23, pp. 103-113; Chu, X., Ilyas, I.F., Koutris, P., "Distributed data deduplication" (2016) Proceedings of the VLDB Endowment, 9, pp. 864-875; Elmagaramid, A., Ipeirotis, P.G., Verykios, V.S., "Duplicate record detection: a survey" (2007) IEEE Transactions on Knowledge and Data engineering, 19, pp. 1-16; Nentwig, M., Hartung, M., Ngomo, A., Rahm, E., "A survey of current link discovery frameworks" (2016) Semantic Web Journal, 8, pp. 419-436; Mudgal, S., Li, H., Rekatsinas, T., Doan, A., Park, Y., Krishnan, G., Deep, R., Raghavendra, V., "Deep learning for entity matching: a design space exploration" (2018) Proceedings of the International Conference on Management of Data SIGMOD'18, , TX, USA, June; Yang, Y., Sun, Y., Tang, J., Ma, B., Li, J., "Entity matching across heterogeneous sources" (2015) Proceedings of the 21th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (KDD'15), , Sydney, NSW, Australia, August; Gruenheid, A., Dong, X.L., Srivastava, D., "Incremental record linkage" (2014) VLDB Endowment, 7, pp. 697-708; Rezig, E.K., Dragut, E.C., Ouzzani, M., Elmagarmid, A.K., "Query-time record linkage and fusion over Web databases" (2015) Proceedings of IEEE 31st International Conference on Data Engineering, , Seoul, South Korea, April; Rezig, E.K., Dragut, E.C., Ouzzani, M., Elmagarmid, A.K., Aref, W.G., "ORLF: A flexible framework for online record linkage and fusion" (2016) Proceedings of IEEE 32nd International Conference on Data Engineering, , Helsinki, Finland, May; Ilyas, I.F., Chu, X., "Trends in cleaning relational data: consistency and deduplication" (2015) Foundations and Trends in Databases Journal, 5, pp. 281-393; Bronselaer, A., Britsom, D.V., Tre, G.D., "Pointwise multi-values fusion" (2015) Proceedings of the 18th International Conference on Information Fusion, , Washington, USA, July; Dubois, D., Liu, W., Ma, J., Prade, H., "The basic principles of uncertain information fusion. An organized review of merging rules in different representation frameworks" (2016) Proceedings of Information Fusion Heidelberg, , Germany, July; Bronselaer, A., Britsom, D.V., Tre, G.D., "Propagation of data fusion" (2015) IEEE Tran. on Knowledge and data engineering, 27, pp. 1330-1342; Chen, X., Schallehn, E., Saake, G., "Cloud-scale entity resolution: current state and open challenges" (2018) Open Journal of Big Data (OJBD), 4, pp. 30-51; Gal, A., "Tutorial: uncertain entity resolution" (2014) VLDB Endowment, 7, pp. 1711-1712; Bleiholder, J., Neumann, F., "Conflict handling strategies in an integrated information system" (2006) Workshop on Information Integration on the Web (IIWeb), , Edinburgh, UK, May; Bilke, A., Bleiholder, J., Bohm, C., Draba, K., Naumann, F., Weis, M., "Automatic data fusion with HumMer" (2005) Proceedings of the 31st VLDB, Trondheim, , Norway, September; Motro, A., Berlin, J., Anokhin, P., "Multiplex, Fusionplex and Autoplex: three generations of information integration" (2004) ACM SIGMOD Record, 33, pp. 51-57; Giacomo, G.D., Lembo, D., Lenzerini, M., Rosati, R., "Tackling inconsistencies in data integration through source preferences" (2004) Proceedings of the International Workshop on Information Quality in Information Systems, , Paris, France, June; Katsis, Y., Deutsch, A., Papakonstantinou, Y., Vassalos, V., "Inconsistency resolution in online databases" (2004) proceedings of IEEE 26th International Conference on Data Engineering (ICDE), , Long Beach, California, USA, March; Mendes, P.N., Muhleisen, H., Bizer, C., "Sieve: linked data quality assessment and fusion" (2012) 2nd International Workshop on Linked Web Data Management (LWDM 2012) at the 15th International Conference on Extending Database Technology, , Berlin, Germany, March; Fan, W., Geerts, F., Tang, N., Yu, W., "Inferring data currency and consistency for conflict resolution" (2013) Proceedings of the 2013 IEEE International Conference on Data Engineering, , Brisbane, Australia, April; Cali, A., Calvanese, D., De Giacomo, G., Lenzerini, M., "Data integration under integrity constraints" (2002) In Proceedings of the 14th International Conference on Advanced Information Systems Engineering, , Ontario, Canada, May; Kirk, T., Levy, A.Y., Sagiv, Y., Srivastava, D., "The information manifold" (1995) Proceedings of the AAAI Spring Symp. On Information Gathering from Heterogeneous, , Distributed Enviroments, Cambridge, Massachusetts, United States, November; McBrien, P.J., Poulovassilis, A., "Data integration by bi-directional schema transformation rules" (2003) Proceedings 19th International Conference on Data Engineering, , Bangalore, India, March; Fagin, R., Haas, L.M., Hernandez, M., Miller, R.J., Popa, L., Velegrakis, Y., "Clio: schema mapping creation and data exchange" (2009) Conceptual Modeling: Foundations and Applications, , Springer-Verlag, Berlin, Heidelberg; Rahm, E., "The case for holistic data integration" (2016) Proceedings of East European Conference on Advances in Databases and Information Systems, , Prague, Czech Republic, August; Alsarkhi, A., Talburt, J.R., "A method for implementing probabilistic entity resolution" (2018) IJACSA, 9 (11), pp. 8-15. , November | |
dcterms.source | Scopus |