Libreoffice OOXML QA Automation Proposal (Mar/2013)

Libreoffice OOXML QA Automation Proposal (Mar/2013)

Motivation and Purpose

Libreoffice OOXML import/export/roundtrip testing is time consuming, especially it matters to be efficiency awared in product release phases, when testing plan is more intensive than usual times.

This project is motivated by conducting automated results verification as many as possible based on current software conditions, meanwhile optimizing the process of manual interfered testing results analysis via user friendly results reporting tools.

Requirements Analysis and Specification

Terms

OOXML MSOffice OpenXML format with file name suffixdocx/xlsx/pptx

MSO03 MSOffice 97-03 binary document format with file name suffix doc/xls/ppt

ODF Open Document Format with file name suffix odt/ods/odp

OOXML Test Procedures

Generically, current OOXML testing behaviors or so called processes can be summarized as:

Import testing

OOXML    --import-->    PDF

Open OOXML with Libreoffice
Open OOXML with MSOffice
Visually compare the document line by line. Interactive elements in the document, also need test here (i.e hyperlinks, indexes, fields etc.).
Verification:
- Pass If the document is rendered exactly the same in both of the applications
- Feature Fail If the document is rendered not same or interactive elements inside the document could not work correctly
- Hang/Crash Fail - If Libreoffice is hang/crash during the opening procedure, the testing is considered as FAIL.
In the condition of testing FAIL and it is a REGRESSION:
- Feature fail: Describe the issue and report the bug in bugzilla with the following documents attached:
  - original OOXML
  - produce a PDF of the OOXML through Libreoffice
  - produce a PDF of the OOXML through MS office
- Hang/crash fail: Describe the issue and report the bug in bugzilla with the following documents attached:
  - original OOXML
  - crash traceback

Export testing

MSO03    --export-->    OOXML

Open MSO03 with Libreoffice, then save it as OOXML
Open MSO03 with MSOffice, then save it as OOXML
Open both of the saved OOXML in MSOffice, visually compare the document line by line. Interactive elements in the document, also need test here (i.e hyperlinks, indexes, fields etc.).
Verification:
- Pass If the document is rendered exactly the same in both of the applications
- Feature Fail If the document is rendered not same or interactive functions inside the document could not work correctly (i.e hyperlinks, indexes, fields etc.)
- Hang/Crash Fail - If Libreoffice is hang/crash during the opening procedure, the testing is considered as FAIL.
In the condition of testing FAIL and it is a REGRESSION
- Feature Fail Describe the issue and report the bug in bugzilla with the following documents attached:
  - original MSO03
  - produce a PDF of the OOXML through Libreoffice
  - produce a PDF of the OOXML through MS office
- Hang/crash Fail Describe the issue and report the bug in bugzilla with the following documents attached:
  - original MSO03
  - crash traceback

Rountrip testing

Roundtrip testing should be done for both OOXML and MSO03 with same steps, however the path of testing can be defined through input:

MSO03    --export-->    OOXML    --import-->    MSO03

OOXML    --import-->    MSO03    --export-->    OOXML

Since the testing logics of the two steps are actually identical , we can describe the detailed steps using MSO03 to explain. The OOXML testing will share the exact steps except for the different target input/output document formats:

With Libreoffice, open MSO03 and save as OOXML
With Libreoffice, open the produced OOXML and save back as MSO03
With MSOffice, open both MSO03 mentioned in the above 2 steps, visually compare the document line by line. Interactive elements in the document, also need test here (i.e hyperlinks, indexes, fields etc.).
Verification:
- PASS - If the documents are rendered exactly the same in MSOffice
- Feature Fail - If the documents are rendered not same or interactive elements inside the document could not work consistently in MSOffice.
- Hang/Crash Fail - If Libreoffice is hang/crash during the converting procedure in 3.1 and 3.2
In the condition of testing FAIL and it is a REGRESSION
- Feature failure Identify it is an import or export problem. Describe the issue and report the bug in bugzilla with the following documents attached:
  - original MSO03
  - produce a PDF for necessary OOXML or MSO03 according to the importing/exporting test procedure
- Hang/crash failure Describe the issue and report the bug in bugzilla with the following documents attached:
  - original MSO03
  - crash traceback

Test Procedures Analysis

Imagine there are hundreds of testing samples input, according to the procedures description above, we could easily feel challenge when manually testing each of them. Thus it makes sense to review all the steps by analyzing them as a whole, in which lots of common characteristics are shared, and possibly to be done automatically, more efficient or better organized. I will review those items from following aspects one by one:

Test documents mangement
Reference documents generating
Automated verification
Regressions identification

Test documents management

Consider to manage hundreds of test documents, 3 major problems come into the testing practices.

Sample documents and reference PDF documents

The test input can be either an OOXML or MSO03 file for import or export test respectively. For a particular testing topic (e.g. hyperlink, font size, embedded picture etc.), we want the same test content to be tested in all import, export and roundtrip scenarios.

On the other hand, when the testing fails, we want PDF to be exported for sharing bugs with other people (who probably do not have MSOffice to see a correct behavior).

As a results the manifest of what we need for sample documents are:

OOXML, MSO03 documents
PDF documents for each of the testing samples

Test documents classification

We once thought to use external test case management tool to tag or name the testing topics for test documents. However relying on external tools to classify test documents does not bring much convenience, especially when the test documents partake as standalone files. Alternatively defining the high level test topic on the file name itself would bring more benifits for easiness.

Test documents storage

The test documents set can be evolved by adding, updating and removing from time to time. So it is not bad to store the test samples in a centralized mantainable storage. Thus with the same up-to-date test documents set feched, tests could be executed in any platform as SUSE, Ubuntu, Fedora etc. at any time.

Results documents management

By the fact we usually execute and analyze the testing on Linux, which MSOffice could not be naturally installed to verify correct rendering behavior, it makes sense to have corresponding PDFs for each of the original documents, as well as all documents generated over all testing procedures. Consequently we will not have to review testing results by switching between Linux and Windows.

Automated verification

As shown in the section of test procedure, we ultimately want to verify every test results by visual comparison. The direct way to do this automatically is to compare pictures derived from the testing results in various testing phases using graphic techniques.

Manually verification

The limitation of visual comparison is that we can not test the import/export quality of how interactive elements works. Even if the graphic techniques could offer help to identify difference between pictures, there might be yet random factors to influence the accuracy. Hence it seems inevitable to have manual interference for the testing results verification.

Spot regressions

Once a test is spot as failed, we usually want to identify the regression status to tell if the bug has been there for some time. Meanwhile we want to tell if the bug has been reported.

Requirements Specification

This section targets to give specific requirements according to each of the above analysis.

Test documents management

A test documents management is supposed to play the major roles as:

The documents in the storage is mainly classified by file names
Easily generate sample documents. As stated before, each of the testing topic should have 4 sample documents as:
- OOXML for import testing
- MSO03 for export testing
- 2 PDFs for both respectively
What we want to make it a bit easier is to provide one testing file, either OOXML or MSO03, then generate the reset 3 files automatically.
Uniformly store generated original test documents
Uniformly store PDF counterparts of these test documents

Results documents management

The testing results can be on various testing machines. It makes sense to gather them together to a single machine for future reference automatically.

For the sake of frequent comparison between testing result and testing documents, it is reasonable to put all the test results on the same storage of the test documents.

Manual verification

We should provide a system make the manual verification easier:

Testing results would be viewed intuitively to provide:
- raw graphics resulting from corresponding pdf files
- a single level of UI to show out the graphics for easier visual comparison
Testing results report view is sharable so that other people could see the result easily, which makes it preferrable to be a web based view.
Testing results, test documents as well as corresponding PDFs can be easily navigated and acquired in the report view.

Automated verification

Currently, we will use graphic techniques to verify the similarity pdf results.

The automated verification results should be reflected and merged into the manual verification UI mentioned before.

Spot regressions

We may also bring certain level of automation for regression identification by:

Search bugzilla automatically for similar attachments
Compare the test results automatically with old testing results, leveraging techniques in automated verification section.

The automatic regression analysis should be reflected and merged into the manual verification UI mentioned before.

Name		Name	Last commit message	Last commit date
Latest commit History 13 Commits
README.md		README.md
kw.py		kw.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Libreoffice OOXML QA Automation Proposal (Mar/2013)

Motivation and Purpose

Requirements Analysis and Specification

Terms

OOXML Test Procedures

Import testing

Export testing

Rountrip testing

Test Procedures Analysis

Test documents management

Sample documents and reference PDF documents

Test documents classification

Test documents storage

Results documents management

Automated verification

Manually verification

Spot regressions

Requirements Specification

Test documents management

Results documents management

Manual verification

Automated verification

Spot regressions

Design and Implementation

About

Releases

Packages

Languages

yifanjiang/lo_ooxml_harness

Folders and files

Latest commit

History

Repository files navigation

Libreoffice OOXML QA Automation Proposal (Mar/2013)

Motivation and Purpose

Requirements Analysis and Specification

Terms

OOXML Test Procedures

Import testing

Export testing

Rountrip testing

Test Procedures Analysis

Test documents management

Sample documents and reference PDF documents

Test documents classification

Test documents storage

Results documents management

Automated verification

Manually verification

Spot regressions

Requirements Specification

Test documents management

Results documents management

Manual verification

Automated verification

Spot regressions

Design and Implementation

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages