@fabian@floss.social
aprxc β A command-line tool to estimate the number of distinct elements in a file/stream using Chakraborty/Vinodchandran/Meel approximation algorithmΒΉ.
βΈ Easier to remember & faster than sort | uniq | wc -l.
βΈ Bound memory usage
βΈ Exact until 80k unique values, 0.4β1% deviation beyond
βΈ v2 with less BS in the README
Useful? You decide! Try out:
βΈ uvx aprxc -h
βΈ pipx run aprxc -h
https://codeberg.org/fa81/aprxc
https://github.com/hellp/aprxc
ΒΉ https://arxiv.org/pdf/2301.10191#section.2
#math #ComputerScience #Python #CLI