A C++ template library for summarizing data streams.
StreamingCC
implements various streaming algorithms and probabilistic data structures. They can be used to effectively summarize the data stream even when data is too large to fit into memory.
Algorithms/Data Structures included in StreamingCC
:
- Count-Min Sketch
- Count-Sketch
- AMS Sketch
- Distinct Elements Counter(section 3)
- Reservoir sampling
- Streaming Submodular Maximization
- Bloom Filter and its variants
- ...
- See StreamingCC to get started.
- See API Docs to dive straight into API.
- CMake (>= 2.8.7)
- C++11 support required
- Armadillo (optional, required by some features)
The source code compiles to static library See xmerge.me/StreamingCC for details.
- Jiecao Chen [email protected] (currently supported by NSF CCF-1525024)
- Qin Zhang [email protected]
MIT License