-
Notifications
You must be signed in to change notification settings - Fork 13
/
Copy pathCHANGES.txt
194 lines (139 loc) · 7.32 KB
/
CHANGES.txt
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
Pangool Change Log
====================
Pangool 0.71 (DEVELOPMENT)
BUG FIXES
- 2014-11-10 - HCatTupleInputFormat must read all fields as nullable.
Pangool 0.70
BUG FIXES
- 2013-11-27 - Solved an issue with not throwing an exception if repeated field-names for a Schema are provided.
- 2013-10-30 - Bug in TupleOutputFormat (NPE when not emitting any record) solved.
- 2013-10-04 - MultipleInputs was not handling special characters properly.
NEW FEATURES
- 2013-12-18 - #20 Backwards compatibility for tuple input formats and readers.
- 2013-12-17 - #34 TupleTextOutputFormat supports nulls as well
- 2013-10-01 - New strong typed getters for Tuple
- 2013-09-26 - New class Mutator for mutating schemas.
- 2013-09-11 - More flexible named outputs. By setting default parameters, any number of named outputs can be opened in runtime.
Pangool 0.60.6
NEW FEATURES
- 2013-09-01 - Official support for CDH4, with a small fix added. Maven profile "cdh4" added.
Pangool 0.60.3
NEW FEATURES
- 2013-04-03 - Support for MapReduce 2 and YARN. Includes Maven pom
modifications to support the compilation to MapReduce 1 and
MapReduce 2. A new maven profile "mr2" can be used to
create a Pangool version ready for MapReduce 2 and YARN.
Pangool 0.60.2
BUG FIXES
- 2013-03-13 - Text tokenizer was using escape character always, but it should use it only when there are quotes.
Pangool 0.60.1 (minor release)
NEW FEATURES
- 2013-03-12 - Specific per-Input-Format context configuration properties, like in Multiple outputs.
Pangool 0.60.0
IMPROVEMENTS
- 2013-02-21 - Native integration with HCatalog through HCatTupleInputFormat.
- 2013-02-12 - Native integration with Cascading through CascadingTupleInputFormat.
- 2013-02-12 - StoreFunc for Pig for saving Pangool Tuple files directly from Pig.
- 2013-01-07 - Nulls support. Now you can declare fields as nullable in schemas. Routines for
serializing and performing the proper comparison between nulls and objects
is properly provided.
- 2013-01-16 - Deep copy support for Tuples.
BUG FIXES
- 2013-03-07 - Solved problem with Doubles and Floats comparisons (using Double.compare and Float.compare now)
- 2013-02-22 - MapOnlyJobBuilder needed also to do a deep copy of the Configuration object
to avoid strange things in Job flows.
- 2013-02-14 - Strange race condition in BitField solved.
- 2013-01-22 - Changed default instances folder to user-located folder ./pangool-instances .
Before it was /tmp which caused users who didn't have access to it in the HDFS
to fail with this default value.
- 2013-01-22 - SolrOutputFormat needed to use an explicit absolute Path for a strange behavior
in Cygwin to be solved.
Pangool 0.50.1
IMPROVEMENTS
- 2012-12-24 - TupleMRBuilder was not creating a copy of the Configuration object it received.
While this may seem irrelevant, it is important as you might want to execute Jobs
in parallel and share some common Configuration. But it wasn't possible to do that.
- 2013-01-02 - DCUtils refactored to InstancesDistributor. No more use of postfixing.
- 2013-01-04 - Added method "cleanupInstanceFiles" in TupleMRBuilder and MapOnlyJobBuilder for
actively deleting temporary instance files in HDFS generated by Pangool.
BUG FIXES
- 2012-12-12 - On mappers, when user overwritting a "string" field with a String,
there were a casting exception. Now it is detected and a new Utf8 is created.
Pangool 0.5
IMPROVEMENTS
- 2012-11-27 - TupleFile for reading SequenceFiles with Tuples, natively, without using
Avro. So, from this version Avro is not mandatory anymore.
- Rewritten TupleInputFormat & TupleOutputFormat to use the new TupleFile
BUG FIXES
- 2012-11-20 - Integers or longs with "+" at the start failed with
TupleTextInputFormat
- 2012-11-21 - Improved TupleTextInputFormat. Better handling of escaping
and quotes
Pangool 0.43.5
BUG FIXES
- 2012-10-30 - Solr 4.0 dependency introduced jcl-over-slf4j as depedency.
That is completely incompatible with Hadoop, as Hadoop uses
a very old version of commons-logging.
- 2012-11-12 - Skip header in TupleTextInputFormat was not working. Solved.
IMPROVEMENTS
- 2012-10-30 - Changed TupleTextInputFormat for being able to support compression in input files and added a unit test for it.
- 2012-10-29 - Improved TupleTextInputFormat wiht a custom CSV Tokenizer for being able to handle nulls and differentiate them from empty strings.
- 2012-11-08 - TupleTextInputFormat now supports fixed width fields input
files.
CHANGES
- 2012-10-30 - Mark Solr as optional dependency on pom.xml. That means
that projetcs using this functionality should explicitely
declare Solr as dependency.
- 2012-10-30 - Change Hadoop dependency scope to "provided" in order to
reduce conflicts with other Hadoop library versions.
Pangool 0.43.4
BUG FIXES
- 2012-10-25 - Solved a bug in TupleTextInputFormat with headers. Headers was not properly
skipped
IMPROVEMENTS
- 2012-10-24 - Added Field Selection to TupleTextInputFormat
CHANGES
- 2012-10-09 - Migrated to Solr 4.0.0-BETA
Pangool 0.43.3 (2012-10-01)
IMPROVEMENTS
- 2012-10-1 - Solved bug in TupleSolrOutputFormat inherited from SolrOutputFormat where Hadoop Temp folder was not deleted if it existed
- 2012-07-30 - Added optional schema strict validation in ser(config parameter : "pangool.schema.validation")
- 2012-07-25 - Added some utility methods and refactored them a little bit
- 2012-06-29 - Miscellaneous improvements to Pangool-flow
Pangool 0.43.2 (2012-06-28)
BUG FIXES
- 2012-06-28 - NullPointerException with RollupReducer for empty reduce
tasks
Pangool 0.43.1 (17/05/2012)
NEW FEATURES
- 16/05/2012 - Possibility of serializing Tuples inside Tuples
- 17/05/2012 - Improved TupleSolrOutputFormat to copy all SOLR config into the output
Pangool 0.43.0 (11/05/2012)
NEW FEATURES
- 11/05/2012 - Implemented seamlessly integration with SOLR (TupleSolrOutputFormat)
Pangool 0.42.1 (09/05/2012)
NEW FEATURES
- 09/05/2012 - Minor : added utility in Fields to create Avro fields easily.
Pangool 0.42.0 (24/04/2012)
NEW FEATURES
- 24/04/2012 - Avro{Input,Output}Format Pangool's version using Serializable instances, not statically configured via Configuration
- 23/04/2012 - Avro{Input,Output}Format implemented in new hadoop's API (mapreduce.lib.{input,output}). Should be in avro-mapred artifact.
IMPROVEMENTS
- 23/04/2012 - Redesign of the custom serialization API
Pangool 0.41.0 (20/04/2012)
NEW FEATURES
- 18/04/2012 - Added custom serialization for type OBJECT
- 16/04/2012 - Added type BYTES
IMPROVEMENTS
- Simplification:custom serialization and comparison just for type OBJECT.
- Moved from Avro 1.6.2 to 1.6.3 (efficiency enhancement in Utf8)
Pangool 0.40.2 (2/04/2012)
NEW FEATURES
- 29/03/2012 - Added field aliases in TupleMRBuilder.
IMPROVEMENTS
- 29/03/2012 - Solved NullPointerException when used unknown field in Tuple.set(fieldname,value) and Tuple.get(fieldname)
Pangool 0.40.1 (27/03/2012)
BUG FIXES
- Fixed HashPartitioner bug when treating Utf8 or String with more than 1 reducer
Pangool 0.40 (5/03/2012)
- Initial Release!