creators_name: Dasdan, Ali creators_name: Huynh, Xinh type: conference_item datestamp: 2009-04-06 19:13:05 lastmod: 2009-04-14 04:37:11 metadata_visibility: show title: User-Centric Content Freshness Metrics for Search Engines ispublished: pub full_text_status: public pres_type: poster abstract: In order to return relevant search results, a search engine must keep its local repository synchronized to the Web, but it is usually impossible to attain perfect freshness. Hence, it is vital for a production search engine continually to monitor and improve repository freshness. Most previous freshness metrics, formulated in the context of developing better synchronization policies, focused on the web crawler while ignoring other parts of a search engine. But, the freshness of documents in a web crawler does not necessarily translate directly into the freshness of search results as seen by users. We propose metrics for measuring freshness from a user’s perspective, which take into account the latency between when documents are crawled and when they are viewed by users, as well as the variation in user click and view frequency among different documents. We also describe a practical implementation of these metrics that were used in a production search engine. date: 2009-04 pagerange: 1129-1129 event_title: 18th International World Wide Web Conference event_location: Madrid, Spain event_dates: April 20th-24th, 2009 event_type: conference refereed: TRUE citation: Dasdan, Ali and Huynh, Xinh (2009) User-Centric Content Freshness Metrics for Search Engines. In: 18th International World Wide Web Conference, April 20th-24th, 2009, Madrid, Spain. document_url: http://www2009.eprints.org/145/1/p1129.pdf