IBM Skip to main contentUnited States
     Home  |  Products & services  |  Support & downloads  |  My account
 Select a country
 IBM Research
David Olshefski's homepage
Research interests
Publications
Patents
Karol Olszewski
Page Contact

 
David Olshefski's homepage   >   Publications   >
Prefetching based on Web Usage Mining

Written by: Daby Sow, David Olshefski, Mandis Beigi, Guru Banavar

Citation: Proceedings of the ACM/IFIP/USENIX International Middleware Conference 2003, p. 262-281. Rio de Janeiro, Brazil, June 16-20, 2003.

PDF  Published Version (202KB)
 Get Adobe® Reader®
Abstract:
This paper introduces a new technique for prefetching web content by learning the access patterns of individual users. The pre- diction scheme for prefetching is based on a learning algorithm, called Fuzzy-LZ, which mines the history of user access and identi es patterns of recurring accesses. This algorithm is evaluated analytically via a metric called learnability and validated experimentally by correlating learnabil- ity with prediction accuracy. A web prefetching system that incorporates Fuzzy-LZ is described and evaluated. Our experiments demonstrate that Fuzzy-LZ prefetching provides a gain of 41.5 % in cache hit rate over pure caching. This gain is highest for those users who are neither highly predictable nor highly random, which turns out to be the vast majority of users in our workload. The overhead of our prefetching technique for a typical user is 2.4 prefetched pages per user request.

  About IBM  |  Privacy  |  Terms of use  |  Contact