Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:Partitioned Learned Bloom Filter

Jun 05, 2020

Kapil Vaidya, Eric Knorr, Tim Kraska, Michael Mitzenmacher

Figure 1 for Partitioned Learned Bloom Filter

Figure 2 for Partitioned Learned Bloom Filter

Figure 3 for Partitioned Learned Bloom Filter

Share this with someone who'll enjoy it:

Abstract:Learned Bloom filters enhance standard Bloom filters by using a learned model for the represented data set. However, a learned Bloom filter may under-utilize the model by not taking full advantage of the output. The learned Bloom filter uses the output score by simply applying a threshold, with elements above the threshold being interpreted as positives, and elements below the threshold subject to further analysis independent of the output score (using a smaller backup Bloom filter to prevent false negatives). While recent work has suggested additional heuristic approaches to take better advantage of the score, the results are only heuristic. Here, we instead frame the problem of optimal model utilization as an optimization problem. We show that the optimization problem can be effectively solved efficiently, yielding an improved {partitioned learned Bloom filter}, which partitions the score space and utilizes separate backup Bloom filters for each region. Experimental results from both simulated and real-world datasets show significant performance improvements from our optimization approach over both the original learned Bloom filter constructions and previously proposed heuristic improvements.

* 13 pages, 3 figures

View paper on

Share this with someone who'll enjoy it:

Title:Partitioned Learned Bloom Filter

Paper and Code