020INTDM2 | Enterprise Data Management |
---|---|
“Enterprise Data Management (EDM) is the ability of an organization to precisely define, easily integrate and effectively retrieve data for both internal applications and external communication. EDM focuses on the creation of accurate, consistent, and transparent content.” (Wikipedia). This course addresses the challenges of enterprise data management at scale, mainly at the level of the data architecture, data modeling and data integration, on-premise as well as on the cloud. It covers different enterprise data architectures i.e DataWarehouses, and DataLakes. It details various data models (structured, semi-structured (XML), unstructured and semantic data with RDF/OWL/SPARQL, and describes various NoSQL databases (key-value, Column, Document or Graph Oriented Databases), as well as various Big Data Formats (Avro, ORC and Parquet). It describes different data integration approaches: Integration according to a materialized view (Data Warehouses/OLAP) and integration according to a virtual view (Mediators/GAV-LAV). Temps présentiel : 37.5 heures Charge de travail étudiant : 70 heures Méthode(s) d'évaluation : Examen final |
Ce cours est proposé dans les diplômes suivants | |
---|---|
Master en data sciences |