Question Regarding TinyLFU increment() #1016

kevins981 · 2023-06-12T22:56:18Z

kevins981
Jun 12, 2023

Hi Caffeine Developer(s),

A quick question regarding the TinyLFU implementation:

I see from the TinyLFU paper that upon an increment, only the minimal counters are incremented and the rest are untouched (Section 3.2 of https://arxiv.org/pdf/1512.00727.pdf). In the Caffeine implementation, I cannot see where this happens. It seems like we are incrementing all 4 counters by calling incrementAt() four times. If I understood the code correctly, is there a particular reason for this difference?

Please correct me if I missed something.

Thank you,
Kevin

Answered by ben-manes

Jun 12, 2023

I believe @gilga1983 intended to use that scheme in his GuessingBloomFilter, but NeedToIncrement is always true (sources). It was likely mentioned for background material and to help alleviate fears about a sketch's accuracy. As readers might take their new knowledge and apply it elsewhere, awareness of this boosting mechanism would avoid them suffering lower quality if their use-cases were less tolerant to error.

The simulator has an option to evaluate that approach, as well as a perfect histogram. In my analysis it did not make a noticeable improvement to the hit rate (noise) so the small cost was not worthwhile. You are welcome to perform a fresh analysis to see the impact.

caffeine…

View full answer

ben-manes · 2023-06-12T23:44:05Z

ben-manes
Jun 12, 2023
Maintainer

I believe @gilga1983 intended to use that scheme in his GuessingBloomFilter, but NeedToIncrement is always true (sources). It was likely mentioned for background material and to help alleviate fears about a sketch's accuracy. As readers might take their new knowledge and apply it elsewhere, awareness of this boosting mechanism would avoid them suffering lower quality if their use-cases were less tolerant to error.

The simulator has an option to evaluate that approach, as well as a perfect histogram. In my analysis it did not make a noticeable improvement to the hit rate (noise) so the small cost was not worthwhile. You are welcome to perform a fresh analysis to see the impact.

caffeine/simulator/src/main/resources/reference.conf

Lines 179 to 185 in 3f4c159

    
           tiny-lfu { 
        
             # CountMinSketch: count-min-4 (4-bit), count-min-64 (64-bit) 
        
             # Table: random-table, tiny-table, perfect-table 
        
             sketch = count-min-4 
        
             # If increments are conservative by only updating the minimum counters for CountMin sketches 
        
             count-min.conservative = false

My conclusion was that it is an excellent idea when the sketch's accuracy is of significant importance for application correctness. A cache is best effort to improve performance, where TinyLFU only needs to consider relative value rather than report statistics externally. If both candidates are poor than a slightly less accurate choice still evicts an item that has a low probability for reuse, and similarly if both are very popular than they were both likely to be worth keeping. At very similar frequencies it is a toss-up, so Gil's insight was actually to separate the wheat from the chaff by retaining the heavy hitters instead of cache pollutants. Therefore a small accuracy improvement won't dramatically change our decisions and hit rates over a long run since we care about large difference in frequencies, halve the counters periodically, use small saturating counters, and recency ordering to pick the best victims.

Later on we introduced an hash flooding protection and an adaptive window, which would further cloud a straightforward analysis of whether improved accuracy could be beneficial. You are welcome to revisit this and suggest improvements, there is always something that could be done better!

0 replies

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Question Regarding TinyLFU increment() #1016

{{title}}

Replies: 1 comment

{{title}}

Select a reply

Question Regarding TinyLFU increment() #1016

kevins981 Jun 12, 2023

Replies: 1 comment

ben-manes Jun 12, 2023 Maintainer

kevins981
Jun 12, 2023

ben-manes
Jun 12, 2023
Maintainer