Documente Academic
Documente Profesional
Documente Cultură
Open Data
Shashi Singh
Concept Introduction
Continued
Reviewed Papers
Jain, P., Hitzler, P., Sheth, A., Verma, K.: Ontology alignment for linked open data.
BLOOMS is a system for finding schema-level links between LOD datasets in the sense of
Ontology matching.
Julius Volz1, Christian Bizer, Martin Gaedke, and Georgi Kobilarov Discovering and
maintaining links on the web of data
Data Linking: Capturing and Utilising Implicit Schemalevel Relations Andriy Nikolov Victoria
Uren Enrico Motta
Main Challenges:
LOD datasets are interlinked these interlinks are mainly on instance level(owl:sameAs)
Schema level information that is taxonomies built using rdfs:subClassof is relatively scarce.
there is a lack of interlinks between the different schemas.
Applications based on LOD face difficulties due to loosely connected pieces of information.
there are no established benchmarks or available baselines for measuring precision and
recall for LOD schema alignment.
Most competitive state-of-art ontology alignment systems performed poorly on LOD schema
datastes.
Detailed Analysis
Results
BLOOMS Approach
- The chosen datasets give significant coverage of the
LOD cloud. cover different domains such as Music,
Publication and the Web.
- Some of the dataset providers such as LinkedMDB have
not made their schema publicly available.
there are no established benchmarks or available baselines for
measuring precision and recall for LOD schema alignment
human experts familiar with the domains created reference
alignments
BLOOMS approach
1) Preprocessing of the input ontologies
2) Construction of the BLOOMS forest
3) Comparision of the constructed
BLOOMS forest
4) Post Processing
Evaluation of Results They have compared more generic schema and have used Wikipedia for handling the
diverse domain of LOD. Following were the shortcomings of various ontology alignment systems suggested by
Jain et al.
Ontology Alignment
System
Issues
RiMOM
AROMA
OMViaUo
Alignment API
Able to find few correct analogy but found some wrong analogies as
well
S-Match
The Gap there are tools available for publishing Linked Data on the Web but there is still a
lack of tools that support data publishers in setting RDF Links to other data sources and to
maintain RDF links over time as data sources change
Design Goal of Silk was to fill this gap.
Silk - Linking Framework, a toolkit for discovering and maintaining data links between Web
data sources
Components
1) A link discovery engine, which computes links between data sources based on a
declarative specification of the conditions that entities must fulfill in order to be interlinked.
2) A tool for evaluating the generated data links in order to fine-tune the linking specification
3) A protocol for maintaining data links between continuously changing datasources
Main Features
- support the greneration of owl:sameAs links as well as other types of RDF links
Flexible,declarative language for specifying link conditions
Can be employed in distributed environments without having to replicate datasets locally
Capablity of being used where terms from different vocalbularies are mixed and where no
consistent RDFS or OWL schemata exist.
Link specification Language
- Data Access
<
- Link Conditions
Evaluating Links
- Resource Comparison
Silk Implementation
Written in Python
Runs from command line
Framework can be downloaded form
Google Code(http://silk.googlecode.com)
Challenges:
- Schema-level heterogeneity represents an obstacle for auto
- Silk - Linking Framework, a toolkit for discovering and maintaining data links between
Web data sources. Silk consists of three components.
- Coreference Resolution Service
A CRS maintaines "bundles" of URIs which are deemed to be equivalent
-The newly published repositories arelinked to hub repostories e.g Dbpedia and then, in
order to obtain complete information about a certain entity we need to compute a transitive
closure of coreference links and gather all URIs used to represent this entity in dfferent
datasets. These transitive closures can be maintained
in a centralised way e.g RKB explorer
recent effort to use ontology alignment systems for aligning ontologies on Linked Open Datasets.
BLOOMS is a system for finding schema-level links between LOD datasets in the sense of
Ontology matching. I wanted to use Agreement Maker to align ontologies on Linked Open
Data and Compare the results. To be able to suggest ways to improve on the alignmnet.
- human experts familiar with the domains created reference alignments
- The experts identified all possible subclass and equivalence mappings via a subclass or an equivalence relationship
Concept Introduction..
BBC Music Data about Artists, Releases and Reviews. Largely based upon MusicBrainz and the
Music Ontology
BBC Programmes Data about TV and Radio Programmes broadcast on by the BBC. Interlinked
with MusicBrainz and DBpedia.
The Bio2RDF project, a Semantic web atlas of post-genomic knowledge about human and
mouse, has published 27 biology-, gene- and medical-related data sets (altogether 2.3 billion
triples, served up by Virtuoso instances).