Skip to content

Latest commit

 

History

History
440 lines (310 loc) · 21 KB

jpWrappaUserManual.md

File metadata and controls

440 lines (310 loc) · 21 KB

Jpwrappa

User Manual 25 November 2013.

Author: Johan van der Knijff, KB / National Library of the Netherlands; contact: johan.vanderknijff@kb.nl

What is jpwrappa?

Jpwrappa is a Python tool that provides an easy-to-use interface to the j2kdriver application that is part of Aware’s JPEG 2000 SDK. Aware's encoder is very feature rich, however since all encoding options are provided as command-line arguments, this may result in excessively lengthy command lines. Here's an example:

j2kdriver --set-input-image balloon.tif --set-output-type JP2 --set-output-j2k-xform
I97 5 --set-output-j2k-color-xform YES --set-output-j2k-progression-order RPCL --set-
output-j2k-tile-size 1024 1024 --set-output-j2k-codeblock-size 6 6 --set-output-j2k-
channel-precinct-size ALL 0 7 7 --set-output-j2k-channel-precinct-size ALL 1 7 7 --
set-output-j2k-channel-precinct-size ALL 2 7 7 --set-output-j2k-channel-precinct-size
ALL 3 7 7 --set-output-j2k-channel-precinct-size ALL 4 8 8 --set-output-j2k-channel-
precinct-size ALL 5 8 8 --set-output-j2k-layers 8 --set-output-j2k-layer-ratio 0 2560
--set-output-j2k-layer-ratio 1 1280 --set-output-j2k-layer-ratio 2 640 --set-output-
j2k-layer-ratio 3 320 --set-output-j2k-layer-ratio 4 160 --set-output-j2k-layer-ratio
5 80 --set-output-j2k-layer-ratio 6 40 --set-output-j2k-layer-ratio 7 20  --set-
output-j2k-error-resilience ALL  --set-output-jp2-add-metadata-box XML balloon.xmp --
output-file-name balloon.jp2

Although it’s entirely possible to call the encoder from a batch file or shell script in the above way, this is not always a practical solution: it is quite hard to keep an overview of what’s going on, and the meaning of most of the arguments is not immediately obvious. Moreover, any modifications of existing sets of options will introduce the risk of something going wrong, and this may go completely unnoticed.

The jpwrappa software solves this problem by putting all encoding options in a formatted XML file. These files can be re-used, and may include comments (making them largely self-documenting). The wrapper software simply parses the options in the XML file, uses this information to create the Aware command line, and finally launches the encoder. Optionally metadata from a source image may be extracted from the source image, and embedded as an XML box in the JP2 (this functionality requires ExifTool).

Using jpwrappa, the sequence of command-line arguments in the example above can be reduced to something as simple as this:

jpwrappa.py balloon.tif balloon.jp2 –p demoAccesslossy.xml

Jpwrappa can either be used as a command-line tool, or as an importable module that can be called from other Python scripts.

Funding

The development of jpwrappa was partially funded by the EU FP 7 project SCAPE (SCAlabable Preservation Environments). More information about SCAPE can be found here:

http://www.scape-project.eu/

License

Jpwrappa is licensed under the Apache License, Version 2.0. (the "License"). You may obtain a copy of the License at:

http://www.apache.org/licenses/LICENSE-2.0

Unless required by applicable law or agreed to in writing, software distributed under the License is distributed on an "AS IS" BASIS, WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied. See the License for the specific language governing permissions and limitations under the License.

Installation and configuration

Before you start installing jpwrappa, make sure you have a licensed copy of Aware's JPEG 2000 SDK installed on your system. Optionally, you may also want to install Exiftool, which is available for free.

Installation as a Python script (any platform)

To be written.

Debian package (Linux)

Simply download the package and open in with your package installer (double-clicking usually does the trick). Alternatively you may wish to use the command terminal, in which case you can use apt-get, e.g.:

sudo apt-get install jpwrappa_0.2.0_amd64.deb

Jpwrappa wraps around the Aware j2kdriver tool and (optionally) ExifTool as well. The locations of these tools are defined in a configuration file (config.xml), which is located in the directory /etc/jpwrappa. It looks like this:

<?xml version="1.0"?>
<!-- jpwrappa configuration file.
-->

<config>

<!-- location of Aware j2kdriver application -->

<j2kDriverApp>/usr/bin/j2kdriver</j2kDriverApp>

<!-- location of ExifTool application -->

<exifToolApp>/usr/bin/exiftool</exifToolApp>

</config>

The default values of j2kDriverApp and exifToolApp will probably be fine for most situations, but if these binaries are located somewhere else on your system you can just update the configuration file accordingly. Make sure not to wrap these file paths in quotation marks, even if they contain spaces, as Python's os.path library doesn't appear to like this. Note that in order to make any changes to the configuration file you will need to open it as administrator!

Windows executable

Download the zip file (e.g. jpwrapppa_0.2.0_win32.zip) and extract it to any directory you like. Jpwrappa wraps around the Aware j2kdriver tool and (optionally) ExifTool as well. The locations of these tools are defined in a configuration file (config.xml), which is located in the jpwrappa installation directory. It looks like this:

<?xml version="1.0"?>
<!-- jpwrappa configuration file. This file MUST be in the same directory as jpwrappa.py/ jpwrappa.exe!
-->
<config>

<!-- location of Aware j2kdriver application -->

<j2kDriverApp>C:\Program Files (x86)\Aware\JPEG2000_SDK\demo\j2kdriver.exe</j2kDriverApp>

<!-- location of ExifTool application -->

<exifToolApp>C:\exiftool\exiftool.exe</exifToolApp>

</config>

Update the values of j2kDriverApp and exifToolApp to the locations of these tools on your machine. Make sure not to wrap these file paths in quotation marks, even if they contain spaces, as Python's os.path library doesn't appear to like this.

Optionally, you may also want to include the directory containing jpylyzer.exe in your Windows Path environment variable. This will allow you to execute jpwrappa from any directory without having to type the full path each time.

Command-line syntax

usage: jpwrappa [-h] [-v] [-p PROFILE] [-s SUFFIX] [-m] [-l] imageIn imageOut

Positional arguments

  • imageIn: input image(s)
  • imageOut: output JP2 image (or directory where output images are written)

Optional arguments

  • -h, --help:show help message and exit
  • -v, --version: show program's version number and exit
  • -p PROFILE: JP2 profile
  • -s SUFFIX: suffix added to base name of output images
  • -m: embed metadata from input image (requires ExifTool)
  • -l: print XML-formatted log to stdout

Examples

Convert TIFF to JP2 using the default profile: jpwrappa balloon.tif balloon.jp2

As above, but using the demoAccessLossy profile: jpwrappa balloon.tif balloon.jp2 -p demoAccessLossy.xml

Convert all TIFF images in working directory to JP2, adding suffix _test to output JP2s: jpwrappa *.tif . -p demoAccessLossy.xml -s _test

(In this case, balloon.tif will be converted to balloon_test.jp2.)

As above, but generate XML-formatted log file:

jpwrappa *.tif . -p demoAccessLossy.xml -s _test -l > log.xml

Using jpwrappa as an importable Python module

The jpwrappa functions can also be imported in (and used by) other Python scripts. This involves 2 steps:

  1. Import jpwrappa in your own script by adding the following line at the top: import jpwrappa

  2. Now call the imagesToJP2 function, which uses the following syntax:

imagesOut,log=jpwrappa.imagesToJP2(imagesIn,fileOut,jp2Profile,suffixOut,flagMetadata)

Arguments

  • imagesIn: list of input image(s)
  • fileOut: name (including path) of output JP2 image, or directory for writing output images
  • jp2Profile: JP2 profile file
  • flagMetadata: flag that is True if metadata are extracted from the input image(s), and False otherwise

Output

  • imagesOut: a list object with the names of all generated output images
  • log: an element object that contains logging information (can easily be written as XML if needed)

JP2 profiles

The JP2 profile contains the encoding settings. The minimal example below shows the basic layout of the file:

<?xml version="1.0"?>

<options>
<aware>

<!--
jpwrappa default profile: Set output format to JP2 and use Aware defaults for everything else.
-->

<set-output-type>JP2</set-output-type>
<!--
	Sets output format to JP2
-->

</aware>
</options>

So the file's root element is options, which has a child element aware which contains the encoding options. The naming of these options is based on the naming used in Chapter 9 (‘Command Line Tools’) of the Aware manual, using the following naming convention:

  • Aware option: --aware-option value
  • Jpwrappa equivalent: <aware-option>value</aware-option>

Example 1: set progression order

  • Aware: -- set-output-j2k-progression-order RPCL
  • Jpwrappa: <set-output-j2k-progression-order>RPCL</set-output-j2k-progression-order>

Example 2: set transformation type and number of decomposition levels

  • Aware: --set-output-j2k-xform I97 5
  • Jpwrappa: <set-output-j2k-xform>I97 5</set-output-j2k-xform>

Note that the order in which Aware's options are given on the command line influences the results. Jpwrappa always uses the exact order that is used in the profile. Note that you can use XML comments to annotate your profiles and make them self-documented.

Example profiles

The jpwrappa installation includes a collection of example profiles for lossless and lossy encoding. Linux users who are using the Debian package of jpwrappa can find these profiles in the /etc/jpwrappa/profiles directory. Windows users should look for the profiles directory in the installation root. See Annex 1 of this User Manual (below). Annex 2 lists the full XML of one example.

Logging

You can generate detailed logging information on each converted image using the -l switch.

Each log entry contains the following information:

  • Log entry creation date and time
  • Names of input and output images
  • Information on whether the output image was created
  • Information on whether metadata extraction was enabled
  • A reference to the JP2 profile that was used
  • The exit status of the Aware encoder (and ExifTool, if metadata extraction was enabled)
  • Standard output and standard error output from the Aware encoder (and ExifTool,if metadata extraction was enabled)

As an example, below is a log entry for an image that could not be converted (in fact this was a zero-byte file):

<?xml version='1.0' encoding='ascii'?>
<jpwrappa>
<image>
<time>Tue Oct 02 17:45:01 2012</time>
<imageIn>E:\testjpwrappa\empty.tif</imageIn>
<imageOut>E:\testjpwrappa\empty.jp2</imageOut>
<jp2Created>False</jp2Created>
<exifExitStatus>0</exifExitStatus>
<exifStdOut />
<exifStdErr />
<awareExitStatus>0</awareExitStatus>
<awareStdOut />
<awareStdErr>Error: unable to read `empty.tif'

Warning: aw_j2k_get_output_image returned error code 203

</awareStdErr>
<profile>F:\johan\pythonCode\jpwrappa\profiles\default.xml</profile>
</image>
</jpwrappa>

Note

If imageIn refers to an image / a set of images that doesn't exist, jpwrappa will exit with an error message. For example:

Error: E:\TIFFSamples\dpp_colour_00990.tiff does not exist!

If the -l switch was used no error log is reported in this special case! This behaviour may change in future versions.

Error handling

Especially in case of the batch processing on multiple images it is important that one or two ‘bad’ input images do not cause a crash of the whole batch process. Perhaps even more importantly, any errors should be traceable. One problem is that the exit status of the Aware j2kdriver application is not a reliable indicator for establishing whether an image was converted successfully. Jpwrappa tries to handle any problems with ExifTool and/or the j2kdriver application in the following way:

  1. If metadata extraction is enabled, it calls ExifTool to extract metadata to an XMP file. If ExifTool encounters any problems, ExifTool generates an error which is reported in its exit status. This exit status is reported back to jpwrappa and added to the log file.
  2. If ExifTool’s exit status is not equal to 0 (meaning that ExifTool encountered some problem), jpwrappa will not call Aware’s j2kdriver application.
  3. If ExifTool’s exit status is equal to 0 (meaning that ExifTool ran without any problems), jpwrappa will call Aware’s j2kdriver application to do the conversion.
  4. In order to ensure that the file really was converted, jpwrappa performs a final check on the existence of the output JP2 image. Based on the outcome of this, it then either reports a message to the log file saying that the output image was created, or a warning that it was not created (see example above).
  5. In case jpwrappa encounters any of the aforementioned errors while processing one input image,it will simply proceed to the next image (after reporting the relevant message(s) to the log file).

Environment limitations

Depending on the operating system that is used, there is a limit on the number of characters that can be used in a command line. This is important, as jpwrappa works by transforming the information in the options file to a valid Aware command line. For Windows XP and more recent, the maximum number of characters equals 8191; for older versions a limit of 2047 characters applies [1]. On Linux/Unix systems, the actual value depends on the specific flavour, but in general it will exceed the Windows value by several orders of magnitude. Just to give an indication: the example command line that is given in the introduction in this User Manual is made up of 1067 characters. So, these limitations are unlikely to cause any major problems, unless extremely long file paths are used for the in- and output files.

Annex 1: Example profiles

This Annex documents example profiles that are supplied with jpwrappa. Linux users who are using the Debian package of jpwrappa can find these profiles in the /etc/jpwrappa/profiles directory. Windows users should look for the profiles directory in the installation root.

default.xml

Synopsis: minimalistic profile that produces JP2s according to the default settings of Aware's encoder (lossy compression). Note that this is also the default profile that is used by jpwrappa (i.e. if the user doesn't specify anything with the -p option).

demoMasterLossless.xml

Synopsis: lossless compression with error resilience features.

Parameter Value
File format JP2 (JPEG 2000 Part 1)
Compression type Lossless (reversible 5-3 wavelet filter)
Colour transform Yes (only for colour images)
Number of decomposition levels 5
Progression order RPCL
Tile size 1024 x 1024
Code block size 64 x 64 (26 x 26)
Number of quality layers 1
Target compression ratio N/A
Error resilience Start-of-packet headers; end-of-packet headers; segmentation symbols
Codestream comment demoMasterLossless

demoAccessLossy.xml

Synopsis: lossy compression with 20:1 target compression ratio, 8 quality layers, precincts and error resilience features.

Parameter Value
File format JP2 (JPEG 2000 Part 1)
Compression type Lossy (irreversible 9-7 wavelet filter)
Colour transform Yes (only for colour images)
Number of decomposition levels 5
Progression order RPCL
Tile size 1024 x 1024
Code block size 64 x 64 (26 x 26)
Precinct size 256 x 256 (28) for 2 highest resolution levels; 128 x 128 (27) for remaining resolution levels
Number of quality layers 8
Target compression ratio layer 1 2560:1
Target compression ratio layer 2 1280:1
Target compression ratio layer 3 640:1
Target compression ratio layer 4 320:1
Target compression ratio layer 5 160:1
Target compression ratio layer 6 80:1
Target compression ratio layer 7 40:1
Target compression ratio layer 8 20:1
Error resilience Start-of-packet headers; end-of-packet headers; segmentation symbols
Codestream comment demoAccessLossy

demoLossyJpylyzer.xml

Synopsis: lossy compression with 50:1 target compression ratio, 6 quality layers, precincts and error resilience features (profile used for creation of jpylyzer test files).

Parameter Value
File format JP2 (JPEG 2000 Part 1)
Compression type Lossy (irreversible 9-7 wavelet filter)
Colour transform Yes (only for colour images)
Number of decomposition levels 5
Progression order RPCL
Tile size 1024 x 1024
Code block size 64 x 64 (26 x 26)
Precinct size 256 x 256 (28) for 2 highest resolution levels; 128 x 128 (27) for remaining resolution levels
Number of quality layers 6
Target compression ratio layer 1 1600:1
Target compression ratio layer 2 800:1
Target compression ratio layer 3 400:1
Target compression ratio layer 4 200:1
Target compression ratio layer 5 100:1
Target compression ratio layer 6 50:1
Error resilience Start-of-packet headers; end-of-packet headers; segmentation symbols
Codestream comment demoAccessLossy

demoLosslessHarvard.xml

Synopsis: lossless profile based on Harvard College Library 'recipe' by Bill Comstock, described in this blog post.

demoLossyHarvard.xml

Synopsis: lossy profile based on Harvard College Library 'recipe' by Bill Comstock, described in this blog post. Defines target compression as Peak Signal-to-Noise Ratio (PSNR) of 46 dB.

optionsKBMasterLossless.xml

Synopsis: same as demoMasterLossless.xml (see above), but with different codestream comment that refers to KB-specific specifications.

optionsKBAccessLossy.xm

Synopsis: same as demoAccessLossy.xml (see above), but with different codestream comment that refers to KB-specific specifications.

Annex 2: profile layout example

The code below corresponds to the demoMasterLossless.xml example described in Annex 1 of this document.

<?xml version="1.0"?>

<options>
<aware>

<!--
****
jpwrappa options file 

Description: demo lossless JP2 encoding settings for preservation masters
Author: Johan van der Knijff
****
-->


<set-output-j2k-add-comment>demoMasterLossless TEXT MAIN</set-output-j2k-add-comment>
<!--
	Inserts reference to this options profile as codestream comment

-->


<set-output-type>JP2</set-output-type>
<!--
	Sets output format to JP2
-->


<set-output-j2k-xform>R53 5</set-output-j2k-xform>
<!--
	Sets compression type and number of decomposition levels
	- First argument: compression type:
 		* R53: reversible 5-3 wavelet filter
		* I97: irreversible 9-7 filter 
	- Second argument: number of decomposition levels (dl)
		Note that this controls the number of resolution levels (rl): rl = dl + 1

-->


<set-output-j2k-color-xform>YES</set-output-j2k-color-xform>
<!--
	Performs color rotation from RBG to YUV color spaces if input image 
	is in RGB format.
-->


<set-output-j2k-progression-order>RPCL</set-output-j2k-progression-order>
<!--
	Sets progression order
-->


<set-output-j2k-tile-size>1024 1024</set-output-j2k-tile-size>
<!--
	Sets tile size
-->


<set-output-j2k-codeblock-size>6 6</set-output-j2k-codeblock-size>
<!--
	Sets code block size: not actual block size but its base 2 logarithm!
	Value of 6 corresponds to code block size 64 (2^6)

-->


<set-output-j2k-ratio>0</set-output-j2k-ratio>
<!--
	Sets compression ratio. A ratio of 0 indicates that all the quantized
	data should be included in the image. This creates lossless images 
	if the R53 wavelet is chosen using set-output-j2k-xform
-->


<set-output-j2k-error-resilience>ALL</set-output-j2k-error-resilience>
<!--
	Activates all error resilience features: start and end-of packet markers
	+ segmentation symbols
-->