Skip to content

Script for calculating complexity of a data set according to complexity curve methodology.

License

Notifications You must be signed in to change notification settings

mipopolo/complexity_curve

 
 

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

5 Commits
 
 
 
 
 
 

Repository files navigation

Complexity curve

Script for calculating complexity of a data set according to complexity curve methodology.

More information: https://peerj.com/articles/cs-76/

Script contains the following functions:

  • complexity_curve -- calculates full complexity curve for a given data set.
  • conditional_complexity_curve -- calculates conditional complexity curve for a given data set.
  • find_minimal_subset -- finds minimal subset of given data set with acceptable similarity to the original set (according to a given threshold).
  • find_minimal_subset_cond -- finds minimal subset of given data set using conditional complexity curve.

Usage example:

import numpy as np
from complexity_curve import *

X = np.random.random((100, 10))
y = (np.random.random(100) > 0.5)

print(complexity_curve(X))

print(conditional_complexity_curve(X, y))

print(find_minimal_subset(X, 0.01))

print(find_minimal_subset_cond(X, y, 0.01))

About

Script for calculating complexity of a data set according to complexity curve methodology.

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • Python 100.0%