Skip to main navigation Skip to search Skip to main content

Building a Gold Standard Dataset to Identify Articles About Geographic Information Science

  • Carlos Lopez-Vazquez (First Author)
  • , Maria Ester Gonzalez-Campos
  • , Miguel A. Bernabe-Poveda
  • , Daniela Moctezuma (Corresponding Author)
  • , Esther Hochsztain
  • , Maria A. Barrera
  • , Carlos Granell-Canut
  • , Maria F. Leon-Pazmino
  • , Pablo Lopez-Ramirez
  • , Villie Morocho-Zurita
  • , Javier Moya-Honduvilla
  • , Maria T. Manrique-Sancho
  • , Marcela E. Montiveros
  • , Rocio Narvaez-Benalcazar
  • , Jose De Perez-Alcazar
  • , Yuri Resnichenko
  • , Diego Seco
  • Universidad ORT Uruguay
  • Universidad de Concepción
  • Universidad Nacional de Catamarca
  • Centro de Investigación en Ciencias de la Información Geoespacial (CentroGeo)
  • Universidad de la República
  • Jaume I University
  • Instituto Geográfico Militar
  • EnvJoy Nature SL
  • Las Rozas de Madrid
  • Universidad de Cuenca
  • Universidade de São Paulo

Research output: Contribution to journalArticlepeer-review

1 Scopus citations

Abstract

To know the overall regional or international scientific production is of vital importance to many areas of knowledge. Nevertheless, in interdisciplinary areas such as Geographic Information Science (GISc) it is not enough to just count papers published in specific journals. Most of them, as is the case of the International Journal of Remote Sensing (IJRS), welcome GISc papers but are not exclusive to that area so the production assignable to authors in the region must consider not only affiliation but also whether or not each paper falls into the theme of GISc. IJRS publishes far more papers than any other GISc journal, so it is important to assess quantitatively how many of them are of GISc. In this work, a representative sample of IJRS articles published over a period of almost 30 years was analyzed using a specific GISc definition. With these data, a manual classification methodology through a set of experts was carried out, and a dataset was built, analyzed, and statistically tested. As a result we estimate that between 47 and 76% of the IJRS articles can be considered from GISc, with a confidence level of 95%. Aside from the primary goal, this set could be used as a gold standard for future classification tasks. It constitutes the first GISc dataset of this kind, that may be used to train artificial intelligence systems capable of performing the same classification automatically and in a massive way. A similar procedure could be applied to other interdisciplinary fields of knowledge as well.

Original languageEnglish
Pages (from-to)19926-19936
Number of pages11
JournalIEEE Access
Volume10
DOIs
StatePublished - 2022

Keywords

  • Gold standard
  • geographic information science
  • indexer consistency
  • manual classification

Fingerprint

Dive into the research topics of 'Building a Gold Standard Dataset to Identify Articles About Geographic Information Science'. Together they form a unique fingerprint.

Cite this