Browsing by Author "H El-Bastawissy, Ali"
Now showing 1 - 2 of 2
- Results Per Page
- Sort Options
Item DIRA: A Framework Of Data Integration Using Data Quality(arXiv preprint, 2016) I Abdel Monem, Reham; H El-Bastawissy, Ali; M Elwakil, MohamedData integration is the process of collecting data from different data sources and providing user withunified view of answers that meet his requirements. The quality of query answers can be improved by identifying the quality of data sources according to some quality measures and retrieving data from only significant ones. Query answers that returned from significant data sources can be ranked according toquality requirements that specified in user query and proposed queries types to return only top-k query answers. In this paper, Data integration framework called Data integration to return ranked alternatives (DIRA) will be introduced depending on data quality assessment module that will use data sources quality to choose the significant ones and ranking algorithm to return top-k query answers according to different queries types.Item Using Information Gain in Data Fusion and Ranking(Recent Advances in Computer Engineering, Communications and Information Technology, 2014) M Hafez, Mohamed; H El-Bastawissy, Ali; M Hegazy, OsmanEntropy and information gain have been traditionally used to measure association between inputs and outputs. In this paper, Information gain is used to measure and decide the level of dependency or relevance between attributes. A data fusion technique based on information gain measures in a virtual data integration environment is introduced. After the detection and clustering of duplicates, the fused records are ranked and provided to the user in the final answer set with a preference score associated with each answer