An approach to publish statistics from Open-Access Journals using Linked Data technologies
María Hallo, Sergio Luján-Mora, Juan Trujillo
Proceedings of the 9th International Technology, Education and Development Conference (INTED 2015), p. 5940-5948, Madrid (Spain), March 2-4 2015. ISBN: 978-84-606-5763-7. ISSN: 2340-1079.
(INTED'15a2)
Congreso internacional / International conference
Resumen
Semantic web encourages digital libraries that include open access journals, to collect, link and share their data across the web in order to ease its processing by machines and humans to get better queries and results. Linked Data technologies enable connecting structured data across the web using the principles and recommendations set out by Tim Berners-Lee in 2006. Several universities develop knowledge, through scholarship and research, under open access policies and use several ways to disseminate information. Open access journals collect, preserve and publish scientific information in digital form using a peer review process. The evaluation of the usage of this kind of publications needs to be expressed in statistics and linked to external resources to give better information about the resources and their relationships. The statistics expressed in a data mart facilitate queries about the history of journals usage by several criteria. This data linked to another datasets gives more information such as: the topics in the research, the origin of the authors, the relation to the national plans, and the relations with the study curriculums. This paper reports a process to publish an open access journal data mart on the web using Linked Data technologies in such a way that it can be linked to related datasets. Furthermore, methodological guidelines are presented with related activities. The proposed process was applied extracting data from a university open journal system data mart and publishing it in a SPARQL endpoint using the open source edition of the software OpenLink Virtuoso. In this process the use of open standards facilitates the creation, development and exploitation of knowledge. The RDF data cube vocabulary has been used as a model to publish the multidimensional data on the web. The visualization was made using CubeViz a faceted browser filtering observations to be presented interactively in charts. The proposed process help to publish statistical datasets in an easy way.