Incorporating a metadata layer, and a Data Lake Datasets composed of discrete objects, like image collections, scanned documents or free text is a poor match for the traditional database environment. These requirements compel rethinking of basic assumptions about data architecture and system design, assumptions that have been present for two decades. It is no longer sufficient to use only a relational database and an ETL tool, nor is there a single unified data model for all data. Instead there are many discrete data sets that can be integrated as needed, stored in their original form or in various stages of integration all the way through to the heavily standardized and quality-assured data one finds in a data warehouse. This paper present Apache Spark as a computation Engine designed to solve the challenges related to data gravity, the fact that Services and applications that use data tend to bring even more masses of data.

A Customer Intelligence Platform: bringing customer insights in a CRM platform

EUGENIO BRENTARI;ANDREA ALBERICI
2016-01-01

Abstract

Incorporating a metadata layer, and a Data Lake Datasets composed of discrete objects, like image collections, scanned documents or free text is a poor match for the traditional database environment. These requirements compel rethinking of basic assumptions about data architecture and system design, assumptions that have been present for two decades. It is no longer sufficient to use only a relational database and an ETL tool, nor is there a single unified data model for all data. Instead there are many discrete data sets that can be integrated as needed, stored in their original form or in various stages of integration all the way through to the heavily standardized and quality-assured data one finds in a data warehouse. This paper present Apache Spark as a computation Engine designed to solve the challenges related to data gravity, the fact that Services and applications that use data tend to bring even more masses of data.
2016
978-9928-148-56-8
File in questo prodotto:
File Dimensione Formato  
Proceedings Book_ISTI2016.pdf

accesso aperto

Tipologia: Abstract
Licenza: Dominio pubblico
Dimensione 1.07 MB
Formato Adobe PDF
1.07 MB Adobe PDF Visualizza/Apri

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11379/515766
 Attenzione

Attenzione! I dati visualizzati non sono stati sottoposti a validazione da parte dell'ateneo

Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus ND
  • ???jsp.display-item.citation.isi??? ND
social impact