Cannot parse multi-line pica records with 1.2.0 (MissingIdException) #155

dmj · 2013-12-05T09:04:00Z

PicaDecoder in 1.2.0 uses a regular expression to find the record id. The expression is defined as:

"003@ " + 0x1F + "0(.*)?" + 0x1E

By default "." does not match the newline character (cf. http://docs.oracle.com/javase/7/docs/api/java/util/regex/Pattern.html#DOTALL), thus the regexp fails if a multi-line record like the one in https://gist.github.com/dmj/7116314 is used.

cboehme · 2013-12-05T14:16:50Z

Version 1.2.0 still uses the old regex-based PicaDecoder which was not designed to handle multi-line pica records. The new PicaDecoder is not included in this minor release because it behaves differently from the old version in some situations. The new decoder is already merged into master and will be part of the next major release. Sorry about that.

dmj · 2013-12-08T09:48:32Z

Thanks. Wasn't sure if `as-records' is supposed to work with multi-line pica records or not. Turns out it isn't.

Add integration test for #155

ghost assigned cboehme Dec 5, 2013

cboehme closed this as completed Dec 5, 2013

cboehme mentioned this issue Dec 5, 2013

Field-separating newline in multi-line pica records part of subfield value #156

Closed

blackwinter pushed a commit that referenced this issue Dec 13, 2024

Add integration test for #155

ff2edee

blackwinter pushed a commit that referenced this issue Dec 13, 2024

Rename folder for integration test of #155

86f9e1c

blackwinter pushed a commit that referenced this issue Dec 13, 2024

Merge pull request #174 from metafacture/160-addIntegrationTestFor155

2a35b97

Add integration test for #155

blackwinter pushed a commit that referenced this issue Dec 13, 2024

Update passing integration tests (#102, #92, #121, #149, #155)

61e3748

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Cannot parse multi-line pica records with 1.2.0 (MissingIdException) #155

Cannot parse multi-line pica records with 1.2.0 (MissingIdException) #155

dmj commented Dec 5, 2013

cboehme commented Dec 5, 2013

dmj commented Dec 8, 2013

Cannot parse multi-line pica records with 1.2.0 (MissingIdException) #155

Cannot parse multi-line pica records with 1.2.0 (MissingIdException) #155

Comments

dmj commented Dec 5, 2013

cboehme commented Dec 5, 2013

dmj commented Dec 8, 2013