BitSet-Bloom-Filters

This is a fork of Callidon/bloom-filters using lemire/FastBitSet.js as a data structure. Original project uses plain JS arrays filled with numbers which could be quite heavy on memory.

JavaScript/TypeScript implementation of probabilistic data structures: Bloom Filter (and its derived), HyperLogLog, Count-Min Sketch, Top-K and MinHash. This package rely on non-cryptographic hash functions.

📕Online documentation

Keywords: bloom filter, cuckoo filter, KyperLogLog, MinHash, Top-K, probabilistic data-structures.

add(element: string) -> void: add an element into the filter.
has(element: string) -> boolean: Test an element for membership, returning False if the element is definitively not in the filter and True is the element might be in the filter.
equals(other: BloomFilter) -> boolean: Test if two filters are equals.
rate() -> number: compute the filter's false positive rate (or error rate).

const { BloomFilter } = require('bloom-filters')
// create a Bloom Filter with a size of 10 and 4 hash functions
let filter = new BloomFilter(10, 4)
// insert data
filter.add('alice')
filter.add('bob')

// lookup for some data
console.log(filter.has('bob')) // output: true
console.log(filter.has('daniel')) // output: false

// print the error rate
console.log(filter.rate())

// alternatively, create a bloom filter optimal for a number of items and a desired error rate
const items = ['alice', 'bob']
const errorRate = 0.04 // 4 % error rate
filter = BloomFilter.create(items.length, errorRate)

// or create a bloom filter optimal for a collections of items and a desired error rate
filter = BloomFilter.from(items, errorRate)

Every hash function is seeded

By default every hash function is seeded with an internal seed which is equal to 0x1234567890. If you want to change it:

const { BloomFilter } = require('bloom-filter')
const bl = new BloomFilter(...)
console.log(bl.seed) // 78187493520
bl.seed = 0xABCD
console.log(bl.seed) // 43981

Documentation

See documentation online or generate it in directory doc/ with: npm run doc

Tests

Running with Mocha + Chai

# run tests
npm test

References

Classic Bloom Filter: Bloom, B. H. (1970). Space/time trade-offs in hash coding with allowable errors. Communications of the ACM, 13(7), 422-426.

Changelog

Version	Release date	Major changes
`v0.1.0`	08/05/2021	Classic only implementation with FastBitSet

License

MIT License

Name		Name	Last commit message	Last commit date
Latest commit History 186 Commits
.github/workflows		.github/workflows
src		src
test		test
.gitignore		.gitignore
.npmignore		.npmignore
LICENSE		LICENSE
README.md		README.md
package.json		package.json
tsconfig.json		tsconfig.json
yarn.lock		yarn.lock

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

BitSet-Bloom-Filters

Table of contents

Installation

Data structures

Classic Bloom Filter

Methods

Every hash function is seeded

Documentation

Tests

References

Changelog

License

About

Languages

License

api-haus/bitset-bloom-filters

Folders and files

Latest commit

History

Repository files navigation

BitSet-Bloom-Filters

Table of contents

Installation

Data structures

Classic Bloom Filter

Methods

Every hash function is seeded

Documentation

Tests

References

Changelog

License

About

Resources

License

Stars

Watchers

Forks

Languages