Lake Data Warehouse Architecture for Big Data Solutions

dc.AffiliationOctober University for modern sciences and Arts (MSA)
dc.contributor.authorSaddad, Emad
dc.contributor.authorMokhtar, Hoda M. O.
dc.contributor.authorEl-Bastawissy, Ali
dc.contributor.authorHazman, Maryam
dc.date.accessioned2020-09-25T20:08:49Z
dc.date.available2020-09-25T20:08:49Z
dc.date.issued2020
dc.descriptionScopusen_US
dc.description.abstractTraditional Data Warehouse is a multidimensional repository. It is nonvolatile, subject-oriented, integrated, time- variant, and non-operational data. It is gathered from multiple heterogeneous data sources. We need to adapt traditional Data Warehouse architecture to deal with the new challenges imposed by the abundance of data and the current big data characteristics, containing volume, value, variety, validity, volatility, visualization, variability, and venue. The new architecture also needs to handle existing drawbacks, including availability, scalability, and consequently query performance. This paper introduces a novel Data Warehouse architecture, named Lake Data Warehouse Architecture, to provide the traditional Data Warehouse with the capabilities to overcome the challenges. Lake Data Warehouse Architecture depends on merging the traditional Data Warehouse architecture with big data technologies, like the Hadoop framework and Apache Spark. It provides a hybrid solution in a complementary way. The main advantage of the proposed architecture is that it integrates the current features in traditional Data Warehouses and big data features acquired through integrating the traditional Data Warehouse with Hadoop and Spark ecosystems. Furthermore, it is tailored to handle a tremendous volume of data while maintaining availability, reliability, and scalability.en_US
dc.description.urihttps://www.scimagojr.com/journalsearch.php?q=21100867241&tip=sid&clean=0
dc.identifier.issn2158-107X
dc.identifier.urihttp://repository.msa.edu.eg/xmlui/handle/123456789/3786
dc.language.isoenen_US
dc.publisherSAIen_US
dc.relation.ispartofseriesInternational Journal of Advanced Computer Science and Applications;, Vol. 11, No. 8, 2020
dc.subjectOctober University for sparken_US
dc.subjectHadoopen_US
dc.subjectnovel data warehouses architectureen_US
dc.subjectunstructured dataen_US
dc.subjectsemi- structured dataen_US
dc.subjectTraditional data warehouseen_US
dc.subjectbig dataen_US
dc.titleLake Data Warehouse Architecture for Big Data Solutionsen_US
dc.typeArticleen_US

Files