System Support for EBI-Proxy cacheing

Media Caching

Major scenaria of importance:

Proxy server caching
Intelligent algorithms for selective default caching:
To cache or not to cache - the question is now!

Demand video caching
Intelligent algorithms for selective default prefetch:
To prefetch-if, what, and when!

Virtual, Inc.:
An industry entrepreneur in the EBI aspect of the research:

Proxy Caching and Prefetching

Dejan Petkovic
dwecci@sezampro.yu
http://galeb.etf.rs/~dejan

Veljko Milutinovic
vm@etf.rs
http://galeb.etf.rs/~vm

Traditional Proxy Servers

An add-on to WWW servers to provide caching and security
(a part of WWW server)

References:

WWW cache related articles, papers, and reports:
http://cache.kaist.ac.kr/docs/related.html

WWW cache related links:
http://w3cache.icm.edu.pl/links/software.html

The Cache Now!
Campaign is designed to increase the awareness and use of proxy cache
on the Web:
http://vancouver-webpages.com/CacheNow/

Proxy caching

Topics of Interest

Introduction

Client - Proxy - Server architecture

Caching

Hierarchical caching (client, proxy^(c), proxy⁽ⁱ⁾, proxy^(s), server)
Transport characteristics

Cache characteristics
Removal policy
Coherence
Implementation
Proxy configuring

Prefetching

Prefetching in theory and practice:
active and passive prefetching

Statistics
Future trends

Client - Proxy - Server Architecture

Problems with traditional client-server architectures:

Server overload and protection;
Network congestion;
Long response times.

System data flow

Proxy services:	Proxy forms:
(a) Firewall (b) Caching (c) Prefetching	h Server h Client h Intermediate

Cache stores a local copy of the requested object:

Reducing hits to a server,
Reducing the number of bytes over the Internet,
Reducing time that users wait for an object to load.

Prefetching:
storing the local copy
of "not-yet-but-probably-soon" requested object,
reducing the latencies.

Hierarchical caching

Cache hierarchy at the School of Electrical Engineering

Client cache - built into a Web browser:

Persistent - retains its documents
between two invocations of the Web browser
(e.g., Netscape Navigator);
Non-persistent - deallocates any memory or disk used for caching when the user quits the browser
(e.g., Mosaic).

Proxy cache is located on a machine
on the path from multiple clients to multiple servers.

Parent: proxy ® rti7020
Peers: proxy « proxy2

Prefetch: local or server hinted

Transport characteristics

Volume of an object
(Information on server)

Internal transfer rate
(Could be estimated)

External transfer rate
(Could be estimated)

Probability of future access to the same object

Latencies

Cache characteristics

Limited storage space
Þ Removal policy

Limited disk I/O throughput
Þ Limited number of connections

Removal policy

Caching policy - what should be cached (html, gif,...)
and what should not (audio, video, queries, long files, dynamic docs).

Removal policy - what should be removed and when.

Removal algorithm sorts the cached objects by one or more keys
and removes them in order.

Replacement - removal on demand.

Proposed algorithms:

First in first out sorts objects by the cache entering time (CET)
and removes those with the smallest CET.

Least recently used sorts objects by last access time (LAT)
and removes those with the smallest LAT.

Least frequently used sorts objects by number of references (NR)
and removes those with the smallest NR.

LRU-MIN tests whether there are any documents equal or larger in size;
if there is, removes one of them by LRU:
otherwise, considers all documents larger than half the size of incoming document;
if there is, removes one of them by LRU.

LRU-THOLD is identical to LRU,
except that no document larger than a threshold size is cached.
(Even if the cache has room, a document whose size is larger than the threshold is never cached.)

Hyper-G sorts objects by the number of references (NR) as a primary key,
LAT as a secondary key, and Size as a tertiary key.

Pitkow-Recker determines the relationship
between the number of document requests during a period (called the window)
and the probability of access on a subsequent day (called the pane).

Space Working Set removes the largest file in the cache.

Space-Time Working Set excludes the document
with the largest product of time since last access and byte size (size× time).

Space-Time Product removes the document with the greatest size× (time^y)
where y is a parameter close to 1 (suggested 1.4).

Space-Time-Cost Working Set removes the file with the highest Size∙Time/Cost,
where Size is the byte size,
Time is the time since last access, and
Cost is the time needed to fetch the document.

CERN httpd3 takes into account the age of a document, time since last access,
expiration date, network delay, and byte size. Each of these factors changes
linearly from 0 to 1 according to the formula:

attribute_factor=1-(document_attribute)/(max_attribute)

The worth of a document is the product of all five factors.
The max_attribute is usually set in the configuration file.

Bolot-Hoschka proposed two weighting functions:

W(ti, Si, rtti, ttli) = w3/ti
W(ti, Si, rtti, ttli) = w1× rtti+w2× Si+(w3+w4× Si)/ti
W₁, W₂, W₃, W₄ - Weights

ti	-	The time since the document was last referenced
Si	-	The size of the document
rtti	-	The time it took to retrieve the document
ttli	-	The time to live

Latency-based Removal (LAT) selects for replacement the object i
with the smallest download time estimated, denoted d_i:

d_i = clat_ser(i) + s_i/cbw_ser(i).

Hybrid Removal (HYB) selects for replacement the object i
with the lowest value of the following expression:

(clat_ser(i)+W_B/cbw_ser(i))(nref_i^WN)/s_i,

clat_j	-	Estimated latency
cbw_j	-	Estimated bandwidth of the connection (in bytes/second)
s_i	-	The object's size
nref_i	-	Number of references to the object i since it last entered the cache
W_B, W_N	-	Constants that set the relative importance of the variables

Removal policy could be run:

On demand - when the size of requested object exceeds free room in a cache
Periodically - every T units.
Both, periodically, and on demand (Pitkow-Recker)

Coherence

Web browsers can be configured to validate their caches
every time an object is requested, one per session, or never.

Proxy cache coherence maintenance based on:

Explicit document's expiry date;
Predicted document's expiry date (based on last docs validation or request);
"Staleness threshold" calculated by proxy cache manager;
Periodical validation;
User's demand.

Implementation

Some popular proxies: Squid, Netscape proxy server, WinGate,...

The most employed algorithm: LRU, removal policy runs on demand.

Proxy configuring

Squid - configuration file, read on start-up (disk space, LRU High/Low marks).

WinGate - program GateKeeper for dialog based configuring.

Prefetching

Local - clients (browsers, proxy)
use local information (e.g., access patterns)
to determine which objects to prefetch.
Prefetch policy:

-Administrator hinted

-Images

-Referenced documents

-Objects found in the access list

Server hinted - clients use information
provided by server to determine prefetch.
Server tracks access patterns
and information about object to suggest prefetch.

Statistics

External latency: 80%

Cache with unlimited storage:

Total latency reduction: 24%

Hit rate: 50-55% (30-50% in practice)

Prefetching:

Local - total latency reduction: 41%

Server hinted - total latency reduction: 57%

Weighed hit rate: 5-10% less than hit rate.

Future trends

Advanced Caching

Subject recognition,

Spatial locality (server hinted and client estimated),

Prefetching based on spatial and temporal locality.

Virtual Proxy Servers

A middle layer between WWW servers and browsers of clients,
responsible not only for caching and security,
but also for search, indexing, filtering, profiling, agenting...

3 + 3

References:

Pitkow, Recker. "A Simple Yet Robust Caching Algorithm Based on Dynamic Access Patterns",
Proceedings of the Second World Wide Web Conference '94:Mosaic and the Web,
http://www.ncsa.uiuc.edu/SDG/IT94/Proceedings/DDay/pitkow/caching.html.

Williams, Abrams, Standridge, Abdulla, Fox, "Removal Policies in Network Caches for World-Wide Web Documents," http://www.acm.org/sigcomm/sigcomm96/papers/williams.html.< /P>

Roland P. Wooster, Marc Abrams, "Proxy Caching that Estimates Page Load Delays", WWW6, April 1997, pp. 325-334
http://www.cs.vt.edu/~chitra/docs/www6r./

Partl, Dingle "A Comparison of WWW Caching Algorithm Efficiency", http://webcache.ms.mff.cuni.cz:8080/paper/paper.html.

Wu, Liao "Virtual Proxy Servers for WWW and Inteligent Agents on Internet", Proceedings of the HCSS-97,
Maui, Hawai'I, USA, January 1997, pp. 200-209.

V. N. Padmanabhan, J. C. Mogul, "Using Predictive Prefetching to Improve World Wide Web Latency," ACM
Computer Communication Review, pp. 22-36, vol. 27, no. 3, July 1996. http://daedalus.cs.berkeley.edu/publications/ccr-july96.ps.gz.

Ken-ichi Chinen, Suguru Yamaguchi, "An Interactive Prefetching Proxy Server for Improvement of WWW Latency".

Hiroyuki Inoue, Kanchana Kanchanasut, Suguru Yamaguchi "An Adaptive WWW Cache Mechanism in the AI3Network"
http://www.isoc.org/isoc/whatis/conferences/inet/97/proceedings/A1/A1_2.HTM.

Gihan V.Dias, Graham Cope and Ravi Wijayaratne, "A Smart Internet Caching System," INET96 Conference,
http://www.isoc.org/isoc/whatis/conferences/inet/96/proceedings/a4/a4_3.htm.

Jeffrey C Mogul, "Hinted Caching in the Web".

"Squid Internet Object Cache," http://squid.nlanr.net/.

Research at UB/IFACT

Two major research domains:

Algorithms
Exploiting spatial and temporal locality,
using past behavior and future correlation [Milutinovic97]

Tools
Efficient kernel modifications,
to enable experimenting with various algorithms.

Acknowledgments:

Vladan Dugaric
Dejan Petkovic

Exploring Spatial and Temporal Locality
in HTML Documents

Traditional caching

Based on temporal locality (LRU, LFU);
Hierarchical organization (proxy hierarchy).

Problems:

Hits in lower levels of the hierarchy hide the hits in higher levels;
Access to some objects dependent on access to some other objects.

Existing solutions:

Access patterns analyzed on a server side
Þ suggested caching and prefetching.

Problem:

Server analyzes accesses only to local documents.
New protocols required.

Proposed solution:

To analyze fetched HTML documents and to track multiple references to objects.
Reference: pointer within the current HTML object to some other object.

Objects with more references in the current set of fetched documents
have higher probability of a repeated access in the relatively near future.

Objects with more accessed references in the current set of fetched documents
have higher probability of being accessed sooner then the referenced documents.

Problem:

Parsing takes CPU time (CPU used for proxy caching 5-10%).
Takes disk space for building tree structure.

Expected improvement in all levels of cache hierarchy.
Thesis research of Dejan Petkovic...

Example:

Two documents that share the same GIF
with equal probability of access p.
Browser fetches in the following order:
Doc1.html, Gif1.gif, Doc2.html.

Picture is fetched only once.
Probability of fetching GIF is 2∙p

The LRU, LFU, FIFO removal in the following in order:
Doc1.html, Gif1.gif, Doc2.html.
If one estimates probability considering the number of references to each object, the removal order could be Doc1.html, Doc2.html, and then Gif1.gif,
giving more chances to Gif1.gif.