-
Notifications
You must be signed in to change notification settings - Fork 4
/
datapackage.json
151 lines (151 loc) · 6.61 KB
/
datapackage.json
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
{
"name": "kc1",
"title": "Kc1",
"resources": [
{
"path": "kc1.arff",
"pathType": "local",
"name": "kc1",
"format": "arff",
"encoding": "ISO-8859-1"
},
{
"path": "kc1.csv",
"pathType": "local",
"name": "kc1",
"format": "csv",
"mediatype": "text/csv",
"encoding": "ISO-8859-1",
"dialect": {
"delimiter": ",",
"quoteChar": "\""
},
"schema": {
"fields": [
{
"name": "loc",
"type": "number",
"format": "default"
},
{
"name": "v(g)",
"type": "number",
"format": "default"
},
{
"name": "ev(g)",
"type": "number",
"format": "default"
},
{
"name": "iv(g)",
"type": "number",
"format": "default"
},
{
"name": "n",
"type": "number",
"format": "default"
},
{
"name": "v",
"type": "number",
"format": "default"
},
{
"name": "l",
"type": "number",
"format": "default"
},
{
"name": "d",
"type": "number",
"format": "default"
},
{
"name": "i",
"type": "number",
"format": "default"
},
{
"name": "e",
"type": "number",
"format": "default"
},
{
"name": "b",
"type": "number",
"format": "default"
},
{
"name": "t",
"type": "number",
"format": "default"
},
{
"name": "lOCode",
"type": "number",
"format": "default"
},
{
"name": "lOComment",
"type": "number",
"format": "default"
},
{
"name": "lOBlank",
"type": "number",
"format": "default"
},
{
"name": "locCodeAndComment",
"type": "number",
"format": "default"
},
{
"name": "uniq_Op",
"type": "number",
"format": "default"
},
{
"name": "uniq_Opnd",
"type": "number",
"format": "default"
},
{
"name": "total_Op",
"type": "number",
"format": "default"
},
{
"name": "total_Opnd",
"type": "number",
"format": "default"
},
{
"name": "branchCount",
"type": "number",
"format": "default"
},
{
"name": "defects",
"type": "boolean",
"format": "default"
}
],
"missingValues": [
""
]
}
}
],
"readme": "The resources for this dataset can be found at https://www.openml.org/d/1067\n\nAuthor: Mike Chapman, NASA \nSource: [tera-PROMISE](http://openscience.us/repo/defect/mccabehalsted/kc1.html) - 2004 \nPlease cite: Sayyad Shirabad, J. and Menzies, T.J. (2005) The PROMISE Repository of Software Engineering Databases. School of Information Technology and Engineering, University of Ottawa, Canada. \n \nKC1 Software defect prediction \nOne of the NASA Metrics Data Program defect data sets. Data from software for storage management for receiving and processing ground data. Data comes from McCabe and Halstead features extractors of source code. These features were defined in the 70s in an attempt to objectively characterize code features that are associated with software quality.\n\n### Attribute Information \n\n1. loc : numeric % McCabe's line count of code\n2. v(g) : numeric % McCabe \"cyclomatic complexity\"\n3. ev(g) : numeric % McCabe \"essential complexity\"\n4. iv(g) : numeric % McCabe \"design complexity\"\n5. n : numeric % Halstead total operators + operands\n6. v : numeric % Halstead \"volume\"\n7. l : numeric % Halstead \"program length\"\n8. d : numeric % Halstead \"difficulty\"\n9. i : numeric % Halstead \"intelligence\"\n10. e : numeric % Halstead \"effort\"\n11. b : numeric % Halstead \n12. t : numeric % Halstead's time estimator\n13. lOCode : numeric % Halstead's line count\n14. lOComment : numeric % Halstead's count of lines of comments\n15. lOBlank : numeric % Halstead's count of blank lines\n16. lOCodeAndComment: numeric\n17. uniq_Op : numeric % unique operators\n18. uniq_Opnd : numeric % unique operands\n19. total_Op : numeric % total operators\n20. total_Opnd : numeric % total operands\n21. branchCount : numeric % of the flow graph\n22. problems : {false,true} % module has/has not one or more reported defects\n\n### Relevant papers \n\n- Shepperd, M. and Qinbao Song and Zhongbin Sun and Mair, C. (2013)\nData Quality: Some Comments on the NASA Software Defect Datasets, IEEE Transactions on Software Engineering, 39.\n\n- Tim Menzies and Justin S. Di Stefano (2004) How Good is Your Blind Spot Sampling Policy? 2004 IEEE Conference on High Assurance\nSoftware Engineering.\n\n- T. Menzies and J. DiStefano and A. Orrego and R. Chapman (2004) Assessing Predictors of Software Defects\", Workshop on Predictive Software Models, Chicago\n",
"description": "The resources for this dataset can be found at https://www.openml.org/d/1067\n\nAuthor: Mike Chapman, ",
"licenses": [
{
"name": "ODC-PDDL",
"path": "http://opendatacommons.org/licenses/pddl/",
"title": "Open Data Commons Public Domain Dedication and License"
}
]
}