Infographic: Linked Open Data Lifecycle

Linked Open Data Lifecycle

I know, it is not really a info graphic but the term ‘research project poster’ is not that 'en vogue' in the web :) I made it last week as a last-minute-job for the LOD2 lifecycle. Sebastian Tramp is currently at the ESWC 2012 in Crete, attending there at the EU Project networking track to represent the LOD2 project.

The lifecycle is taken from the slides “The Semantic Data Web” by Sören Auer from AKSW/Uni Leipzig. Text was also written by Sören. As the explanations on the poster aren’t that big, I add the texts here:

Linked Open Data Lifecycle

The lifecycle is supported by tools of the the Debian-based LOD2 Stack.

Extraction

RDF is the lingua franca for data integration on the Web. Other data structures, semi-structured and even unstructured information, however, are and will be always there as well. In LOD2 we develop techniques for mapping and accessing such information efficiently and effectively.

Tools: Triplify, D2R Server, DBpedia Extraction

Storage

RDF Data Mangement is still more challenging than relational Data Mangement. We aim to close this performance gap by employing column-store technology, dynamic query optimization, adaptive caching of joins, optimized graph processing, cluster/cloud scalability.

Tools: Openlink Virtuoso

Authoring

LOD2 facilitates the authoring of rich semantic knowledge bases, by leveraging Semantic Wiki technology, the WYSIWIM paradigm (What You See Is What You Mean) and distributed social, semantic collaboration and networking techniques.

Tools: OntoWiki, RDFaCE Text Annotation, Poolparty Taxonomy Editor

Interlinking

Creating and maintaining links in a (semi-)automated fashion is still a major challenge and crucial for establishing coherence and facilitating data integration. We aim at linking approaches yielding high precision and recall, which configure themselves automatically or based on end-user feedback.

Tools: Silk, LIMES, SemFM

Enrichment

Linked Data on the Web is mainly raw instance data. For data integration, fusion, search and many other applications, however, we need this raw instance data to be linked and integrated with upper level ontologies.

Tools: DL-Learner

Quality

The quality on the Data Web is varying as the quality on the document web varies. LOD2 develops techniques, which help to assess the quality based on characteristics such as provenance, context, coverage or structure.

Tools: WIQA, LODStats, LDIF Data Integration

Evolution

Data on the Web is dynamic. We need to facilitate the evolution of data while keeping things stable. Changes and modifications to knowledge bases, vocabularies and ontologies should be transparent and observable. LOD2 also develops methods to spot problems in knowledge bases and to automatically suggest repair strategies.

Tools: ORE, OntoWiki EvoPat

Exploration

For many users Data Web is still invisible below the surface. LOD2 develops search, browsing, exploration and visualization techniques for different kinds of Linked Data (i.e. spatial, temporal, statistic), which make the Data Web sensible for real users.

Tools: CubeViz, Sig.ma EE, Spatial Semantic Browser

LOD2 Project

LOD2 is a large-scale integrating project co-funded by the European Commission within the FP7 Information and Communication Technologies Work Programme. Started 2010 it is planned for 4 years, comprises leading Linked Open Data technology researchers, companies, and service providers from across 11 European countries and one associated partner from Korea, it is coordinated by the AKSW research group at the University of Leipzig.

blog comments powered by Disqus
  1. 48techblog posted this