Lake Data Warehouse Architecture for Big Data Solutions

Show simple item record

dc.contributor.author Saddad, Emad
dc.contributor.author Mokhtar, Hoda M. O.
dc.contributor.author El-Bastawissy, Ali
dc.contributor.author Hazman, Maryam
dc.date.accessioned 2020-09-25T20:08:49Z
dc.date.available 2020-09-25T20:08:49Z
dc.date.issued 2020
dc.identifier.issn 2158-107X
dc.identifier.uri http://repository.msa.edu.eg/xmlui/handle/123456789/3786
dc.description Scopus en_US
dc.description.abstract Traditional Data Warehouse is a multidimensional repository. It is nonvolatile, subject-oriented, integrated, time- variant, and non-operational data. It is gathered from multiple heterogeneous data sources. We need to adapt traditional Data Warehouse architecture to deal with the new challenges imposed by the abundance of data and the current big data characteristics, containing volume, value, variety, validity, volatility, visualization, variability, and venue. The new architecture also needs to handle existing drawbacks, including availability, scalability, and consequently query performance. This paper introduces a novel Data Warehouse architecture, named Lake Data Warehouse Architecture, to provide the traditional Data Warehouse with the capabilities to overcome the challenges. Lake Data Warehouse Architecture depends on merging the traditional Data Warehouse architecture with big data technologies, like the Hadoop framework and Apache Spark. It provides a hybrid solution in a complementary way. The main advantage of the proposed architecture is that it integrates the current features in traditional Data Warehouses and big data features acquired through integrating the traditional Data Warehouse with Hadoop and Spark ecosystems. Furthermore, it is tailored to handle a tremendous volume of data while maintaining availability, reliability, and scalability. en_US
dc.description.uri https://www.scimagojr.com/journalsearch.php?q=21100867241&tip=sid&clean=0
dc.language.iso en en_US
dc.publisher SAI en_US
dc.relation.ispartofseries International Journal of Advanced Computer Science and Applications;, Vol. 11, No. 8, 2020
dc.subject October University for spark en_US
dc.subject Hadoop en_US
dc.subject novel data warehouses architecture en_US
dc.subject unstructured data en_US
dc.subject semi- structured data en_US
dc.subject Traditional data warehouse en_US
dc.subject big data en_US
dc.title Lake Data Warehouse Architecture for Big Data Solutions en_US
dc.type Article en_US
dc.Affiliation October University for modern sciences and Arts (MSA)


Files in this item

This item appears in the following Collection(s)

Show simple item record

Search MSAR


Advanced Search

Browse

My Account