An initial Analysis of Topic-based Similarity among Scientific Documents based on their Rhetorical Discourse Parts

Badenes-Olmedo, Carlos and Redondo-Garcia, Jose Luis and Corcho, Oscar (2017). An initial Analysis of Topic-based Similarity among Scientific Documents based on their Rhetorical Discourse Parts. In: "SemSci 2017 workshop co-located with ISWC", 21-25 Oct 2017, Vienna, Austria.

Description

Title: An initial Analysis of Topic-based Similarity among Scientific Documents based on their Rhetorical Discourse Parts
Author/s:
  • Badenes-Olmedo, Carlos
  • Redondo-Garcia, Jose Luis
  • Corcho, Oscar
Item Type: Presentation at Congress or Conference (Article)
Event Title: SemSci 2017 workshop co-located with ISWC
Event Dates: 21-25 Oct 2017
Event Location: Vienna, Austria
Title of Book: CEUR Workshop Proceedings
Date: 2017
Subjects:
Faculty: E.T.S. de Ingenieros Informáticos (UPM)
Department: Inteligencia Artificial
UPM's Research Group: Ontology Engineering Group OEG
Creative Commons Licenses: Recognition - Share

Full text

[img]
Preview
PDF - Requires a PDF viewer, such as GSview, Xpdf or Adobe Acrobat Reader
Download (817kB) | Preview

Abstract

Summaries and abstracts of research papers have been tra- ditionally used for many purposes by scientists, research practitioners, editors, programme committee members or reviewers (e.g. to identify relevant papers to read or publish, cite them, explore new fields and disciplines). As a result, many paper repositories only store or expose abstracts, what may limit the capacity of finding the right paper for a specific research purpose. Given the size limitations and the concise nature of abstracts, they usu- ally omit explicit references to some contributions and impacts of the paper. Therefore for certain information retrieval tasks they cannot be considered as the most appropriate excerpt of the paper to base these operations on. In this paper we have studied other kinds of summaries, built upon textual fragments falling under certain categories of the sci- entific discourse, such as outcome, background, approach, etc, in order to decide which one is more appropriate in order to substitute the origi- nal text. In particular, two novel measures are proposed: (1) internal- representativeness, which evaluates how well a summary describes what the full-text is about and (2) external-representativeness, which evaluates the potential of a summary to discover related texts. Results suggest that summaries explaining the method of a scientific article express a more accurate description of the full-content than oth- ers. In addition, more relevant related articles are also discovered from summaries describing the method, together with those containing the background knowledge or the outcomes of the research paper.

More information

Item ID: 52012
DC Identifier: http://oa.upm.es/52012/
OAI Identifier: oai:oa.upm.es:52012
Deposited by: Carlos Badenes-Olmedo
Deposited on: 03 Sep 2018 10:38
Last Modified: 03 Sep 2018 10:38
  • Logo InvestigaM (UPM)
  • Logo GEOUP4
  • Logo Open Access
  • Open Access
  • Logo Sherpa/Romeo
    Check whether the anglo-saxon journal in which you have published an article allows you to also publish it under open access.
  • Logo Dulcinea
    Check whether the spanish journal in which you have published an article allows you to also publish it under open access.
  • Logo de Recolecta
  • Logo del Observatorio I+D+i UPM
  • Logo de OpenCourseWare UPM