Managementul informatiilor in mediul WEB

FISA DISCIPLINEI

Anul universitar 2022 - 2023



  Departament Home


Cod : MIAM315
Titular curs : Conf. dr. M. Cosulschi
Forma de invatamant : Master
Ciclul : 2 Anul : 2
Semestrul : 1, Curs : 2h, Laborator : 2h
Nr. credite : 6
Profil : Informatica
Specializare : Metode si Modele ale Inteligentei Artificiale
Tip disciplina : obligatorie
Categoria formativa : de specialitate


Obiective:

  • Insusirea principiilor web-ului semantic;
  • Intelegerea principiilor de functionare ale unui motor de cautare;
  • Modelarea datelor pe web;
  • Insusirea paradigmei de programare MapReduce.

Continutul cursului:

  • Arhitectura Web
  • Modelul RDF (Resource Description Framework);
  • Managementul datelor RDF. Interogarea datelor RDF cu SPARQL;
  • Arhitectura aplicatiilor Web-ului semantic. Linked Data;
  • Extragerea automata a datelor din paginile Web;
  • Cloud Computing;
  • MapReduce - dezvoltarea aplicatiilor distribuite cu MapReduce;
  • Apache Hadoop - arhitectura;
  • Pig, Cassandra, etc.

Forma de evaluare : examen

Bibliografie:
  1. S. Abiteboul, I. Manolescu, P. Rigaux, M.-C. Rousset, P. Senellart: Web Data Management, Cambridge University Press, 2011.
  2. C. D. Manning, P. Raghavan and H. Schütze, Introduction to Information Retrieval, Cambridge University Press, 2008.
  3. J. Lin and C. Dyer, Data-Intensive Text Processing with MapReduce, Morgan & Claypool Publishers, 2010.
  4. T. Heath and C. Bizer, Linked Data: Evolving the Web into a Global Data Space (1st edition), Synthesis Lectures on the Semantic Web: Theory and Technology, Morgan & Claypool, 2011.
  5. T. White, Hadoop: The Definitive Guide. Storage and Analysis at Internet Scale, 3rd Edition, O'Reilly Media / Yahoo Press, 2012.
  6. S. Abiteboul, R. Hull, V. Vianu, Foundations of databases, Addison-Wesley, 1995.
  7. S. Abiteboul, P. Buneman, D. Suciu, Data on the Web: From Relations to Semistructured Data and XML, Morgan Kaufmann, 1999.

Material didactic:

  1. Above the Clouds: A Berkeley View of Cloud Computing
  2. Resource Description Framework (RDF)
  3. Introducere catre RDF
  4. **!!Introduction to RDF
  5. Linked data
  6. Inside search
  7. Datasets Available for Linked Open Data Initiatives
  8. PatchR Repository
  9. The Dark Face of Google.
  10. Introduction to RDF and the Semantic Web for the life sciences.
  11. Sindice.com: A Document-oriented Lookup Index for Open Linked Data

Pachete software:

  1. Lucene - este o librarie software gratuita pentru extragerea de informatii;
  2. SIREn: - Efficient semi-structured Information Retrieval for Lucene;
  3. AllegroGraph - o baza de date scalabila pentru date RDF;
  4. Swoogle - motor de cautare peste date semantice;
  5. Aplicatii - ce folosesc open data;

Prezentari:

  1. Data Sciences: From First Order Logic to the Web - Serge Abiteboul;

Alte cursuri:

  1. Dezvoltarea aplicatiilor Web
  2. CS 276: Information Retrieval and Web Search
  3. CSE 591: Semantic Web Mining
  4. CS561 Web Data Management (Spring 2017)
  5. Web and Social Information Extraction
  6. Web Data Management

Ultima actualizare: Octombrie 2022