Universidad Politecnica de Madrid
Search
Navegation
User Area
About Archivo Digital UPM
Dulcinea
Sherpa Romeo
Recolecta

Temporal characterization of the requests to Wikipedia

Reinoso, Antonio J. and González-Barahona, Jesus M. and Muñoz-Mansilla, Rocio and Herraiz Tabernero, Israel (2011) Temporal characterization of the requests to Wikipedia. Proceedings of the 5th International Workshop on New Challenges in Distributed Information Filtering and Retrieval (DART 2011), 771 . ISSN 1613-0073

Ver estadisticas de descargas para este eprint (solo desde ordenadores de la UPM) Estadisticas UPM
Bookmark and Share
Item Type:Article
Authors/Creators:
Creators NameCreators email (if known)
Reinoso, Antonio J.areinpei@uax.es
González-Barahona, Jesus M.jgb@gsyc.urjc.es
Muñoz-Mansilla, Rociormunoz@dia.uned.es
Herraiz Tabernero, Israelisrael.herraiz@upm.es
Title:Temporal characterization of the requests to Wikipedia
Title of Book:Proceedings of the 5th International Workshop on New Challenges in Distributed Information Filtering and Retrieval
Publisher:CEUR-WS.org
Journal/Publication Title:Proceedings of the 5th International Workshop on New Challenges in Distributed Information Filtering and Retrieval (DART 2011)
Date:September 2011
Volume:771
Department:Mathematics and Computer Science applied in Civil Engineering
Faculty:E.T.S.I. Roads, Canals and Ports (UPM)
Creative Commons licenses:Recognition
Item ID:8836
Subjects:Mathematics
Computer Science

Texto completo disponible como:

[img]
Preview
PDF
196Kb - Idioma: English

Official URL: http://sunsite.informatik.rwth-aachen.de/Publications/CEUR-WS/Vol-771/

Abstract

This paper presents an empirical study about the temporal patterns characterizing the requests submitted by users to Wikipedia. The study is based on the analysis of the log lines registered by the Wikimedia Foundation Squid servers after having sent the appropriate content in response to users' requests. The analysis has been conducted regarding the ten most visited editions of Wikipedia and has involved more than 14,000 million log lines corresponding to the traffic of the entire year 2009. The conducted methodology has mainly consisted in the parsing and filtering of users' requests according to the study directives. As a result, relevant information fields have been finally stored in a database for persistence and further characterization. In this way, we, first, assessed, whether the traffic to Wikipedia could serve as a reliable estimator of the overall traffic to all the Wikimedia Foundation projects. Our subsequent analysis of the temporal evolutions corresponding to the different types of requests to Wikipedia revealed interesting differences and similarities among them that can be related to the users' attention to the Encyclopedia. In addition, we have performed separated characterizations of each Wikipedia edition to compare their respective evolutions over time.

Item Type:Article
Subjects:Mathematics
Computer Science
Código ID:8836
Depositado Por:Israel Herraiz
Depositado el:07 Sep 2011 07:57
Last Modified:17 Apr 2012 10:29

Sólo para Personal del Archivo: editar este registro