Skip to main navigation Skip to search Skip to main content

Combining statistical and semantic approaches to the translation of ontologies and taxonomies

  • John McCrae
  • , Mauricio Espinoza
  • , Elena Montiel-Ponsoda
  • , Guadalupe Aguado-De-Cea
  • , Philipp Cimiano

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

12 Scopus citations

Abstract

Ontologies and taxonomies are widely used to organize concepts providing the basis for activities such as indexing, and as background knowledge for NLP tasks. As such, translation of these resources would prove useful to adapt these systems to new languages. However, we show that the nature of these resources is significantly different from the “free-text” paradigm used to train most statistical machine translation systems. In particular, we see significant differences in the linguistic nature of these resources and such resources have rich additional semantics. We demonstrate that as a result of these linguistic differences, standard SMT methods, in particular evaluation metrics, can produce poor performance. We then look to the task of leveraging these semantics for translation, which we approach in three ways: by adapting the translation system to the domain of the resource; by examining if semantics can help to predict the syntactic structure used in translation; and by evaluating if we can use existing translated taxonomies to disambiguate translations. We present some early results from these experiments, which shed light on the degree of success we may have with each approach.

Original languageEnglish
Title of host publication5th Workshop on Syntax, Semantics and Structure in Statistical Translation, SSST 2011 at the Annual Meeting of the Association for Computational Linguistics
Subtitle of host publicationHuman Language Technologies, ACL HLT 2011 - Proceedings of the Workshop
EditorsDekai Wu, Marianna Apidianaki, Marine Carpuat, Lucia Specia
PublisherAssociation for Computational Linguistics (ACL)
Pages116-125
Number of pages10
ISBN (Electronic)9781932432992
StatePublished - 2011
Event5th Workshop on Syntax, Semantics and Structure in Statistical Translation, SSST 2011 at the Annual Meeting of the Association for Computational Linguistics: Human Language Technologies, ACL HLT 2011 - Portland, United States
Duration: 23 Jun 2011 → …

Publication series

Name5th Workshop on Syntax, Semantics and Structure in Statistical Translation, SSST 2011 at the Annual Meeting of the Association for Computational Linguistics: Human Language Technologies, ACL HLT 2011 - Proceedings of the Workshop

Conference

Conference5th Workshop on Syntax, Semantics and Structure in Statistical Translation, SSST 2011 at the Annual Meeting of the Association for Computational Linguistics: Human Language Technologies, ACL HLT 2011
Country/TerritoryUnited States
CityPortland
Period23/06/11 → …

Fingerprint

Dive into the research topics of 'Combining statistical and semantic approaches to the translation of ontologies and taxonomies'. Together they form a unique fingerprint.

Cite this