Reinoso, Antonio J. and González-Barahona, Jesus M. and Muñoz-Mansilla, Rocio and Herraiz Tabernero, Israel (2011) Temporal characterization of the requests to Wikipedia. Proceedings of the 5th International Workshop on New Challenges in Distributed Information Filtering and Retrieval (DART 2011), 771 . ISSN 1613-0073
Ver estadisticas de descargas para este eprint (solo desde ordenadores de la UPM)| Item Type: | Article | ||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|
| Authors/Creators: |
| ||||||||||
| Title: | Temporal characterization of the requests to Wikipedia | ||||||||||
| Title of Book: | Proceedings of the 5th International Workshop on New Challenges in Distributed Information Filtering and Retrieval | ||||||||||
| Publisher: | CEUR-WS.org | ||||||||||
| Journal/Publication Title: | Proceedings of the 5th International Workshop on New Challenges in Distributed Information Filtering and Retrieval (DART 2011) | ||||||||||
| Date: | September 2011 | ||||||||||
| Volume: | 771 | ||||||||||
| Department: | Mathematics and Computer Science applied in Civil Engineering | ||||||||||
| Faculty: | E.T.S.I. Roads, Canals and Ports (UPM) | ||||||||||
| Creative Commons licenses: | Recognition | ||||||||||
| Item ID: | 8836 | ||||||||||
| Subjects: | Mathematics Computer Science |
Texto completo disponible como:
| PDF 196Kb - Idioma: English |
Official URL: http://sunsite.informatik.rwth-aachen.de/Publications/CEUR-WS/Vol-771/
Abstract
This paper presents an empirical study about the temporal patterns characterizing the requests submitted by users to Wikipedia. The study is based on the analysis of the log lines registered by the Wikimedia Foundation Squid servers after having sent the appropriate content in response to users' requests. The analysis has been conducted regarding the ten most visited editions of Wikipedia and has involved more than 14,000 million log lines corresponding to the traffic of the entire year 2009. The conducted methodology has mainly consisted in the parsing and filtering of users' requests according to the study directives. As a result, relevant information fields have been finally stored in a database for persistence and further characterization. In this way, we, first, assessed, whether the traffic to Wikipedia could serve as a reliable estimator of the overall traffic to all the Wikimedia Foundation projects. Our subsequent analysis of the temporal evolutions corresponding to the different types of requests to Wikipedia revealed interesting differences and similarities among them that can be related to the users' attention to the Encyclopedia. In addition, we have performed separated characterizations of each Wikipedia edition to compare their respective evolutions over time.
| Item Type: | Article |
|---|---|
| Subjects: | Mathematics Computer Science |
| Código ID: | 8836 |
| Depositado Por: | Israel Herraiz |
| Depositado el: | 07 Sep 2011 07:57 |
| Last Modified: | 17 Apr 2012 10:29 |
Sólo para Personal del Archivo: editar este registro





