A Semantic Scraping Model for Web Resources - Applying Linked Data to Web Page Screen Scraping

Fernández Villamor, José Ignacio; Blasco Garcia, Jacobo; Iglesias Fernandez, Carlos Angel y Garijo Ayestaran, Mercedes (2011). A Semantic Scraping Model for Web Resources - Applying Linked Data to Web Page Screen Scraping. En: "ICAART 2011 3rd International Conference on Agents and Artificial Intelligence", 28/01/2011 - 30/01/2011, Roma, Italia. pp. 451-456.

Descripción

Título: A Semantic Scraping Model for Web Resources - Applying Linked Data to Web Page Screen Scraping
Autor/es:
  • Fernández Villamor, José Ignacio
  • Blasco Garcia, Jacobo
  • Iglesias Fernandez, Carlos Angel
  • Garijo Ayestaran, Mercedes
Tipo de Documento: Ponencia en Congreso o Jornada (Artículo)
Título del Evento: ICAART 2011 3rd International Conference on Agents and Artificial Intelligence
Fechas del Evento: 28/01/2011 - 30/01/2011
Lugar del Evento: Roma, Italia
Título del Libro: Proceedings of ICAART 2011 - Proceedings of the 3rd International Conference on Agents and Artificial Intelligence
Fecha: 2011
Materias:
Escuela: E.T.S.I. Telecomunicación (UPM)
Departamento: Ingeniería de Sistemas Telemáticos [hasta 2014]
Licencias Creative Commons: Reconocimiento - Sin obra derivada - No comercial

Texto completo

[img]
Vista Previa
PDF (Document Portable Format) - Se necesita un visor de ficheros PDF, como GSview, Xpdf o Adobe Acrobat Reader
Descargar (318kB) | Vista Previa

Resumen

In spite of the increasing presence of Semantic Web Facilities, only a limited amount of the available resources in the Internet provide a semantic access. Recent initiatives such as the emerging Linked Data Web are providing semantic access to available data by porting existing resources to the semantic web using different technologies, such as database-semantic mapping and scraping. Nevertheless, existing scraping solutions are based on ad-hoc solutions complemented with graphical interfaces for speeding up the scraper development. This article proposes a generic framework for web scraping based on semantic technologies. This framework is structured in three levels: scraping services, semantic scraping model and syntactic scraping. The first level provides an interface to generic applications or intelligent agents for gathering information from the web at a high level. The second level defines a semantic RDF model of the scraping process, in order to provide a declarative approach to the scraping task. Finally, the third level provides an implementation of the RDF scraping model for specific technologies. The work has been validated in a scenario that illustrates its application to mashup technologies

Más información

ID de Registro: 13159
Identificador DC: http://oa.upm.es/13159/
Identificador OAI: oai:oa.upm.es:13159
URL Oficial: http://www.icaart.org/ICAART2011/
Depositado por: Memoria Investigacion
Depositado el: 29 Nov 2012 11:41
Ultima Modificación: 21 Abr 2016 12:28
  • Open Access
  • Open Access
  • Sherpa-Romeo
    Compruebe si la revista anglosajona en la que ha publicado un artículo permite también su publicación en abierto.
  • Dulcinea
    Compruebe si la revista española en la que ha publicado un artículo permite también su publicación en abierto.
  • Recolecta
  • e-ciencia
  • Observatorio I+D+i UPM
  • OpenCourseWare UPM