A Knowledge Engineering Approach to Recognizing and Extracting Sequences of Nucleic Acids from Scientific Literature

García Remesal, Miguel

, Maojo García, Víctor Manuel

and Crespo del Arco, Jose

(2010). A Knowledge Engineering Approach to Recognizing and Extracting Sequences of Nucleic Acids from Scientific Literature. En: "32nd Annual International Conference of the IEEE EMBS", 31/08/2010 - 04/09/2011, Buenos Aires, Argentina. ISBN 978-1-4244-4123-5.

Descripción

Título:	A Knowledge Engineering Approach to Recognizing and Extracting Sequences of Nucleic Acids from Scientific Literature
Autor/es:	García Remesal, Miguel https://orcid.org/0000-0002-5948-8691 Maojo García, Víctor Manuel https://orcid.org/0000-0001-5103-4292 Crespo del Arco, Jose https://orcid.org/0000-0002-0772-5421
Tipo de Documento:	Ponencia en Congreso o Jornada (Artículo)
Título del Evento:	32nd Annual International Conference of the IEEE EMBS
Fechas del Evento:	31/08/2010 - 04/09/2011
Lugar del Evento:	Buenos Aires, Argentina
Título del Libro:	Proceedings of the 32nd Annual International Conference of the IEEE EMBS
Fecha:	2010
ISBN:	978-1-4244-4123-5
Materias:	Biología Informática Medicina
ODS:	03. Salud y bienestar
Escuela:	Facultad de Informática (UPM) [antigua denominación]
Departamento:	Inteligencia Artificial
Licencias Creative Commons:	Reconocimiento - Sin obra derivada - No comercial

Texto completo

Vista Previa

PDF (Portable Document Format) - Se necesita un visor de ficheros PDF, como GSview, Xpdf o Adobe Acrobat Reader
Descargar (81kB) | Vista Previa

Resumen

In this paper we present a knowledge engineering approach to automatically recognize and extract genetic sequences from scientific articles. To carry out this task, we use a preliminary recognizer based on a finite state machine to extract all candidate DNA/RNA sequences. The latter are then fed into a knowledge-based system that automatically discards false positives and refines noisy and incorrectly merged sequences. We created the knowledge base by manually analyzing different manuscripts containing genetic sequences. Our approach was evaluated using a test set of 211 full-text articles in PDF format containing 3134 genetic sequences. For such set, we achieved 87.76% precision and 97.70% recall respectively. This method can facilitate different research tasks. These include text mining, information extraction, and information retrieval research dealing with large collections of documents containing genetic sequences.

Más información

ID de Registro:	9123
Identificador DC:	https://oa.upm.es/9123/
Identificador OAI:	oai:oa.upm.es:9123
URL Oficial:	http://ieeexplore.ieee.org/xpls/abs_all.jsp?arnumb...
Depositado por:	Memoria Investigacion
Depositado el:	15 Nov 2011 11:40
Ultima Modificación:	20 May 2024 11:20

Estadísticas

Exportar cita

Editar (sólo personal del Archivo)

En esta página

Menú principal

Buscar

A Knowledge Engineering Approach to Recognizing and Extracting Sequences of Nucleic Acids from Scientific Literature

Cita

Descripción

Texto completo

Resumen

Más información

Acciones

Documentos

El repositorio

Agrupados por ...

Datos Investigación

Financiadores

Especiales

En otros formatos

Redes sociales

Información adicional