Skip to content

Approximate cardinality estimation with HyperLogLog, as a Hive function

License

Notifications You must be signed in to change notification settings

MLnick/hive-udf

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

5 Commits
 
 
 
 
 
 
 
 
 
 

Repository files navigation

An implementation of the HyperLogLog approximate cardinality estimation algorithm (as well as Linear Counting), as a Hive User-defined Aggregation Function (UDAF).

Relies on stream-lib for implementation of the relevant algorithms.

See the Wiki for usage instructions.

About

Approximate cardinality estimation with HyperLogLog, as a Hive function

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages