Matching and Merging Anonymous Terms from Web Sources

Research output: Contribution to journalArticleScientificpeer-review

Abstract

This paper describes a workflow of simplifying and matching special language terms in RDF generated from trawling term candidates from Web terminology sites with TermFactory, a Semantic Web framework for professional terminology. Term candidates from such sources need to be matched and eventually merged with resources already in TermFactory. While merging anonymous data, it is important not to lose track of provenance. For coding provenance in RDF, TF uses a minor but apparently novel variant of RDF reification. In addition, TF implements a toolkit of methods for dealing with graphs containing anonymous (blank) nodes.
Original languageEnglish
Article numberNo.4
JournalInternational journal of web and semantic technology
Volume5
Issue number4
Pages (from-to)53-69
Number of pages17
Publication statusPublished - Oct 2014
MoE publication typeA1 Journal article-refereed

Fields of Science

  • 222 Other engineering and technologies
  • RDF
  • provenance
  • anonymous/blank nodes
  • LSP
  • professional terminology work

Cite this

@article{d42578d868cd40598052645895d7f1d7,
title = "Matching and Merging Anonymous Terms from Web Sources",
abstract = "This paper describes a workflow of simplifying and matching special language terms in RDF generated from trawling term candidates from Web terminology sites with TermFactory, a Semantic Web framework for professional terminology. Term candidates from such sources need to be matched and eventually merged with resources already in TermFactory. While merging anonymous data, it is important not to lose track of provenance. For coding provenance in RDF, TF uses a minor but apparently novel variant of RDF reification. In addition, TF implements a toolkit of methods for dealing with graphs containing anonymous (blank) nodes.",
keywords = "222 Other engineering and technologies, RDF, provenance, anonymous/blank nodes, LSP, professional terminology work",
author = "Kun Ji and Shanshan Wang and Lauri Carlson",
year = "2014",
month = "10",
language = "English",
volume = "5",
pages = "53--69",
journal = "International journal of web and semantic technology",
issn = "0976-2280",
publisher = "Academy & Industry Research Collaboration Centre",
number = "4",

}

Matching and Merging Anonymous Terms from Web Sources. / Ji, Kun; Wang, Shanshan; Carlson, Lauri.

In: International journal of web and semantic technology, Vol. 5, No. 4, No.4, 10.2014, p. 53-69.

Research output: Contribution to journalArticleScientificpeer-review

TY - JOUR

T1 - Matching and Merging Anonymous Terms from Web Sources

AU - Ji, Kun

AU - Wang, Shanshan

AU - Carlson, Lauri

PY - 2014/10

Y1 - 2014/10

N2 - This paper describes a workflow of simplifying and matching special language terms in RDF generated from trawling term candidates from Web terminology sites with TermFactory, a Semantic Web framework for professional terminology. Term candidates from such sources need to be matched and eventually merged with resources already in TermFactory. While merging anonymous data, it is important not to lose track of provenance. For coding provenance in RDF, TF uses a minor but apparently novel variant of RDF reification. In addition, TF implements a toolkit of methods for dealing with graphs containing anonymous (blank) nodes.

AB - This paper describes a workflow of simplifying and matching special language terms in RDF generated from trawling term candidates from Web terminology sites with TermFactory, a Semantic Web framework for professional terminology. Term candidates from such sources need to be matched and eventually merged with resources already in TermFactory. While merging anonymous data, it is important not to lose track of provenance. For coding provenance in RDF, TF uses a minor but apparently novel variant of RDF reification. In addition, TF implements a toolkit of methods for dealing with graphs containing anonymous (blank) nodes.

KW - 222 Other engineering and technologies

KW - RDF

KW - provenance

KW - anonymous/blank nodes

KW - LSP

KW - professional terminology work

M3 - Article

VL - 5

SP - 53

EP - 69

JO - International journal of web and semantic technology

JF - International journal of web and semantic technology

SN - 0976-2280

IS - 4

M1 - No.4

ER -