AutoML-Benchmark/binary-classification/datasets/kc1 at master · Alex-Lekov/AutoML-Benchmark

History

Name		Name	Last commit message	Last commit date
parent directory ..
README.md		README.md
datapackage.json		datapackage.json
features.json		features.json
kc1.arff		kc1.arff
kc1.csv		kc1.csv
kc1.csv_profiling.html		kc1.csv_profiling.html

README.md

The resources for this dataset can be found at https://www.openml.org/d/1067

Author: Mike Chapman, NASA
Source: tera-PROMISE - 2004
Please cite: Sayyad Shirabad, J. and Menzies, T.J. (2005) The PROMISE Repository of Software Engineering Databases. School of Information Technology and Engineering, University of Ottawa, Canada.

KC1 Software defect prediction
One of the NASA Metrics Data Program defect data sets. Data from software for storage management for receiving and processing ground data. Data comes from McCabe and Halstead features extractors of source code. These features were defined in the 70s in an attempt to objectively characterize code features that are associated with software quality.

Attribute Information

loc : numeric % McCabe's line count of code
v(g) : numeric % McCabe "cyclomatic complexity"
ev(g) : numeric % McCabe "essential complexity"
iv(g) : numeric % McCabe "design complexity"
n : numeric % Halstead total operators + operands
v : numeric % Halstead "volume"
l : numeric % Halstead "program length"
d : numeric % Halstead "difficulty"
i : numeric % Halstead "intelligence"
e : numeric % Halstead "effort"
b : numeric % Halstead
t : numeric % Halstead's time estimator
lOCode : numeric % Halstead's line count
lOComment : numeric % Halstead's count of lines of comments
lOBlank : numeric % Halstead's count of blank lines
lOCodeAndComment: numeric
uniq_Op : numeric % unique operators
uniq_Opnd : numeric % unique operands
total_Op : numeric % total operators
total_Opnd : numeric % total operands
branchCount : numeric % of the flow graph
problems : {false,true} % module has/has not one or more reported defects

Relevant papers

Shepperd, M. and Qinbao Song and Zhongbin Sun and Mair, C. (2013) Data Quality: Some Comments on the NASA Software Defect Datasets, IEEE Transactions on Software Engineering, 39.
Tim Menzies and Justin S. Di Stefano (2004) How Good is Your Blind Spot Sampling Policy? 2004 IEEE Conference on High Assurance Software Engineering.
T. Menzies and J. DiStefano and A. Orrego and R. Chapman (2004) Assessing Predictors of Software Defects", Workshop on Predictive Software Models, Chicago

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

kc1

kc1

README.md

Attribute Information

Relevant papers

Files

kc1

Directory actions

More options

Directory actions

More options

Latest commit

History

kc1

Folders and files

parent directory

README.md

Attribute Information

Relevant papers