Submissions/Semanticpedia: why a semantic extraction of the Wikimedia projects still make sense in the Wikidata era

From Wikimania 2013 • Hong Kong

After careful consideration, the programme committee has decided not to accept the below submission at this time. Thank you to the author(s) for participating in the Wikimania 2013 programme submission, we hope to still see you at Wikimania this August.

Submission no.
5038
Subject no.
T4
Title of the submission
Semanticpedia: why a semantic extraction of the Wikimedia projects still make sense in the Wikidata era
Type of submission
presentation
Author of the submission
Julien Cojan
Country of origin
France
Affiliation
Inria
E-mail address
julien.cojan@inria.fr
Username
lejuin
Personal homepage or blog
http://www-sop.inria.fr/members/Julien.Cojan/
Abstract

DBpedia is a project about extraction and publication of data from Wikipedia. Although part of this content has been integrated to Wikidata, DBpedia extraction provides data that was not migrated to Wikidata, like Category hierarchical relations, or data generated by DBpedia community like interlinking with Linked Data resources. Links between Wikidata and DBpedia are also being extracted, enabling their data to be exploited complementary.

We present first DBpedia extraction process, and explain what data can be extracted from wikipedia and other wikimedia projects. Then we will present applications built over the data extracted that could be useful for Wikimedia community.

Detailed proposal

Semanticpedia is a collaboration between INRIA, the French Ministry of Culture and Wikimedia France to extract data from Wikimedia French content. First project lead by Semanticpedia is DBpedia in French, data extraction from Wikipedia fr and part of DBpedia internationalization committee.

A part of the data currently extracted from Wikipedia has been integrated to Wikidata or will be in short term. However, not all the data that can be extracted are planned to be migrated to Wikidata. For instance the Category hierarchy relations, available in DBpedia, are not migrated. In addition, an interlinking effort from the DBpedia community provides connections with many resources of the Linked Data. Connexions between Wikidata and DBpedia are also being extracted now, thus enabling their data to be exploited complementary[1].

We will explain how the extraction process works, and what data can be extracted from wikipedia and other wikimedia projects. Then we will present some applications built over the data extracted that could be useful for Wikimedia community. For instance: cycle detection in category hierarchy, analysis of image repartition according to the Categories, etc.


Track
Technology and Infrastructure
Length of presentation/talk
25 minutes
Language of presentation/talk
English
Will you attend Wikimania if your submission is not accepted?
Not sure
Slides or further information (optional)
Special requests
  1. https://meta.wikimedia.org/wiki/Wikidata/Notes/DBpedia_and_Wikidata


Interested attendees

If you are interested in attending this session, please sign with your username below. This will help reviewers to decide which sessions are of high interest. Sign with four tildes. (~~~~).

  1. --SereinWMfr (talk) 12:50, 30 April 2013 (UTC)[reply]
  2. Phoebe (talk) 04:20, 1 May 2013 (UTC) I would like to know more about dbpedia.[reply]
  3. PierreSelim (talk) 11:48, 2 May 2013 (UTC)[reply]
  4. Multichill (talk) 14:51, 4 May 2013 (UTC)[reply]