Release v1.1.0 #368

tcojean · 2019-10-19T08:23:52Z

This PR is for pushing the release to the master branch. This brings all commits from develop plus the fix_jacobi branch and a few commits for adaptation.

DO NOT MERGE THIS. Currently, tests are still ongoing !
https://gitlab.com/ginkgo-project/ginkgo-public-ci/pipelines/89916672

- Added used LaTeX packages into `Doxyfile-usr.in` - Improved code coverage by adding tests for + `IteratorFactory` (specifically for the `operator<` in Reference) + `ParIlu` (Added additional test with sorted CSR matrix) - re-added the overview description for `ParIlu`

To prevent conflicts with other users on the CI system, we now restrict it to only use the first GPU (device ID 0) for all tests. Note: That also restricts the CUDA executor copy test to a single GPU, meaning data will be copied internally on a single GPU instead of across devices.

+ Move `matrix_from` to the external loop. + Benchmark directly into `matrix_to`.

+ The same `grid_dim` was used for the reduction and for the `calculate_nnz_per_row` kernel + The `grid_dim` was limited to maximum default_block_size^2 elements. + For bigger matrices, the extracted `max_nnz_per_row` could be wrong, due to omitted values.

When compiling Ginkgo with gcc 6.4.0, there was an `internal compiler error` when compiling the `IteratorFactory`. This was resolved by changing the return type of the helper functions to `const &` instead of a value copy.

+ Move the construction of `matrix_to` in `convert_matrix` function. This allows to catch the related exceptions. + Do not construct the `matrix_to` object until the use of `copy_from` in the warmup. This allows to catch non-existing conversions quickly, without the overhead of loading big matrix data first. + Catch the `AllocationError` related exceptions for `matrix_from` and add a `completed: false` entry to the results on failure to instantiate a matrix from the data.

+ The previous grid dimensions for `initialize_zero_ell` were `stride * num_rows`, i.e. roughly the dense matrix dimension. + Using `max_nnz_per_row * num_rows` reduces significantly the amount of threads created which makes this kernel call more efficient (less useless thread creation).

- Instead of just computing the nnz of L and U, the CSR `row_ptrs` are computed in the first kernel, allowing for better parallelization - Now checking if the `system_matrix` given to factory is square (including a test that checks if it is working) - Fixed errors in omp test (leading to a lower epsilon in comparison)

- Use a strategy object (the default one that is used if no strategy is provided) in ParIlu - parallized the omp `initialize_row_ptrs_l_u` kernel

- Added additional test with more iterations for OpenMP ParIlu test (Therefore, creating a function `compute_lu` to simplify that) - Renaming variables to better fit their purpose

- Added functions that generate unsorted matrices for Csr tests - Added Omp kernel to check if a CSR matrix is sorted and the sorting kernel itself - Added Omp tests for the newly implemented kernels

See the updated [xSDK template](https://github.com/xsdk-project/xsdk-policy-compatibility/blob/master/template.md). + Add support information in the README.md + Add a CHANGELOG.md file which tracks Ginkgo's changes. The full version 1.0.0 description is provided and changes on top of this base version are detailed.

Additionally, separated the storage of the matrices to a single location to prevent matrix file duplication between CUDA and OpenMP tests.

hartwiganzt · 2019-10-19T18:24:19Z

@tcojean I think the problem persists - at least in the current version. However, I still approve it as I feel this is acceptable.

tcojean · 2019-10-19T18:30:26Z

I think this is ready. The tools do not show anything terrible. I fixed a few things from clang-tidy and the documentation issue shown by Hartwig. The problem I found was the submenus had different level (black dot vs white dot), but I still kept the submenus and the same layout.

The full pipeline from this PR can be accessed here:
https://gitlab.com/ginkgo-project/ginkgo-public-ci/pipelines/89916672

All the data is available on the dashboard:
https://my.cdash.org/index.php?project=Ginkgo+Project

Here is the updated documentation:
https://ginkgo-project.github.io/ginkgo/doc/release/v1.1.0/
https://ginkgo-project.github.io/ginkgo/doc/pdf/release/v1.1.0.pdf

The commits which have not been reviewed are the following:
https://github.com/ginkgo-project/ginkgo/pull/368/files/8256a6cc1fd78b1deeabbd92297afcd66d9beaae..7106bb8f0110af3d98dadd556ceb2369f8acf90a

yhmtsai · 2019-10-19T19:07:05Z

@tcojean Does it pick the fix_jacobi branch?

tcojean · 2019-10-19T20:56:05Z

@yhmtsai yes this include the commits from the fix jacobi as you can see in the last link

pratikvn

One minor typo. LGTM!

CHANGELOG.md

yhmtsai

LGTM

CHANGELOG.md

thoasm

looks really good, I have some minor comments.
I also just realized that I forgot to create an issue for add_new_algorithm.sh for factorization, which I have done just now (see #369). I am not sure if we want to have this also as part of this release
(I will work on this tomorrow regardless).

INSTALL.md

README.md

tcojean · 2019-10-20T17:44:36Z

I think most of the typos are fixed.

yhmtsai

LGTM

pratikvn

LGTM!

Thomas Grützmacher and others added 30 commits October 19, 2019 10:16

Updated documentation of ParIlu

e25c866

Added changes from the review comments

b0f64a1

Made further style changes.

4e7cb7c

Make the conversions benchmark faster.

eadf3ac

+ Move `matrix_from` to the external loop. + Benchmark directly into `matrix_to`.

add OMP kernels for ParILU

6f84c0b

formatting

267d031

missing CMakeLists

2878545

add test matrix

feb2b6e

apply path that separates test matrix

964b6c9

include thomas comments

4fdcb46

seperate test functions

eb1c713

minor changes according to review

93a0ea0

typo

2353b4b

Fixed gcc 6.4 internal compiler error

98b4fc8

When compiling Ginkgo with gcc 6.4.0, there was an `internal compiler error` when compiling the `IteratorFactory`. This was resolved by changing the return type of the helper functions to `const &` instead of a value copy.

Small improvements for ParIlu

d81b371

- Use a strategy object (the default one that is used if no strategy is provided) in ParIlu - parallized the omp `initialize_row_ptrs_l_u` kernel

ParIlu improvements according to reviewer

6c5f880

- Added additional test with more iterations for OpenMP ParIlu test (Therefore, creating a function `compute_lu` to simplify that) - Renaming variables to better fit their purpose

Added support for sorting CSR on OpenMP

d212dbd

- Added functions that generate unsorted matrices for Csr tests - Added Omp kernel to check if a CSR matrix is sorted and the sorting kernel itself - Added Omp tests for the newly implemented kernels

Fix custom-logger typo.

caf0f02

Started implementing CUDA kernels for ParILU

ae29a82

Added CUDA test for ParILU

f8419db

Additionally, separated the storage of the matrices to a single location to prevent matrix file duplication between CUDA and OpenMP tests.

Fixed kernel and test mistake in CUDA, ParILU

8794148

Added initialize kernel for ParILU on CUDA

77fda9b

Added remaining kernel (compute) for CUDA ParILU

5234eba

Update all the version references to version 1.1.0.

7106bb8

tcojean force-pushed the release/v1.1.0 branch 2 times, most recently from 109ad6b to 7106bb8 Compare October 19, 2019 18:22

tcojean added 1:ST:ready-for-review This PR is ready for review and removed 1:ST:do-not-merge Please do not merge PR this yet. 1:ST:WIP This PR is a work in progress. Not ready for review. labels Oct 19, 2019

hartwiganzt previously approved these changes Oct 19, 2019

View reviewed changes

pratikvn previously approved these changes Oct 19, 2019

View reviewed changes

CHANGELOG.md Outdated Show resolved Hide resolved

yhmtsai previously approved these changes Oct 20, 2019

View reviewed changes

CHANGELOG.md Outdated Show resolved Hide resolved

CHANGELOG.md Outdated Show resolved Hide resolved

CHANGELOG.md Outdated Show resolved Hide resolved

thoasm reviewed Oct 20, 2019

View reviewed changes

INSTALL.md Outdated Show resolved Hide resolved

INSTALL.md Outdated Show resolved Hide resolved

INSTALL.md Outdated Show resolved Hide resolved

README.md Show resolved Hide resolved

README.md Outdated Show resolved Hide resolved

README.md Show resolved Hide resolved

Fix typos.

c217c76

tcojean dismissed stale reviews from yhmtsai, pratikvn, and hartwiganzt via c217c76 October 20, 2019 17:39

hartwiganzt requested review from hartwiganzt, yhmtsai, pratikvn and thoasm October 20, 2019 18:25

hartwiganzt approved these changes Oct 20, 2019

View reviewed changes

yhmtsai approved these changes Oct 20, 2019

View reviewed changes

pratikvn approved these changes Oct 20, 2019

View reviewed changes

tcojean merged commit b9bec82 into master Oct 20, 2019

tcojean added 1:ST:ready-to-merge This PR is ready to merge. and removed 1:ST:ready-for-review This PR is ready for review labels Oct 20, 2019

tcojean mentioned this pull request Oct 20, 2019

Release v1.1.0 for develop #370

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Release v1.1.0 #368

Release v1.1.0 #368

tcojean commented Oct 19, 2019

hartwiganzt commented Oct 19, 2019

tcojean commented Oct 19, 2019

yhmtsai commented Oct 19, 2019

tcojean commented Oct 19, 2019

pratikvn left a comment

yhmtsai left a comment

thoasm left a comment

tcojean commented Oct 20, 2019

yhmtsai left a comment

pratikvn left a comment

Release v1.1.0 #368

Release v1.1.0 #368

Conversation

tcojean commented Oct 19, 2019

hartwiganzt commented Oct 19, 2019

tcojean commented Oct 19, 2019

yhmtsai commented Oct 19, 2019

tcojean commented Oct 19, 2019

pratikvn left a comment

Choose a reason for hiding this comment

yhmtsai left a comment

Choose a reason for hiding this comment

thoasm left a comment

Choose a reason for hiding this comment

tcojean commented Oct 20, 2019

yhmtsai left a comment

Choose a reason for hiding this comment

pratikvn left a comment

Choose a reason for hiding this comment