Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[hotfix] FPGA kernels generate an extra bracket #596

Merged
merged 2 commits into from
Nov 29, 2024

Conversation

stratika
Copy link
Collaborator

@stratika stratika commented Nov 28, 2024

Description

This PR fixes issue #590.

Problem description

After close inspection the problem was that for FPGAs, we apply a compiler optimization for loop flattening of the outermost for loop. In this case, the OCLBlockVisitor.java class did not account for the flattened loop and was emitting the closing bracket.

Backend/s tested

Mark the backends affected by this PR.

  • OpenCL
  • PTX
  • SPIRV

OS tested

Mark the OS where this PR is tested.

  • Linux
  • OSx
  • Windows

Did you check on FPGAs?

If it is applicable, check your changes on FPGAs.

  • Yes
  • No

How to test the new patch?

make && make fast-tests
tornado-test -V uk.ac.manchester.tornado.unittests.branching.TestLoopConditions

In my case the FPGA device is device 0:4. To test on FPGAs kernels, run:

rm -rf fpga-source-comp/ && tornado --igv --jvm "-Ds0.t0.device=0:4 -Xmx20g -Xms20g" --printKernel --threadInfo -m tornado.examples/uk.ac.manchester.tornado.examples.dynamic.DFTMT --params="256 default 1"

rm -rf fpga-source-comp/ && tornado --igv --jvm "-Ds0.t0.device=0:4 -Xmx20g -Xms20g" --printKernel --threadInfo -m tornado.examples/uk.ac.manchester.tornado.examples.dynamic.BlackScholesMT --params="256 default 1"

rm -rf fpga-source-comp/ && tornado --igv --jvm "-Ds0.t0.device=0:4 -Xmx20g -Xms20g" --printKernel --threadInfo -m tornado.examples/uk.ac.manchester.tornado.examples.dynamic.MontecarloMT --params="256 default 1"

rm -rf fpga-source-comp/ && tornado --igv --jvm "-Ds0.t0.device=0:4 -Xmx20g -Xms20g" --printKernel --threadInfo -m tornado.examples/uk.ac.manchester.tornado.examples.dynamic.NBodyMT --params="256 default 1"

rm -rf fpga-source-comp/ && tornado --igv --jvm "-Ds0.t0.device=0:4 -Xmx20g -Xms20g" --printKernel --threadInfo -m tornado.examples/uk.ac.manchester.tornado.examples.dynamic.SaxpyMT --params="256 default 1"

…fore the FPGA loop flattening optimization. This is necessary as later the count of the closed loops is used to emit the close bracket for loop blocks.
@stratika stratika self-assigned this Nov 28, 2024
@stratika stratika added bug Something isn't working fpga FPGA labels Nov 28, 2024
@jjfumero
Copy link
Member

Thanks @stratika. I can reproduce it using Intel oneAPI and the FPGA emulator mode.

Copy link
Collaborator

@mairooni mairooni left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM

@jjfumero jjfumero merged commit d01e3bc into beehive-lab:develop Nov 29, 2024
2 checks passed
@stratika stratika deleted the hotfix/ocl/fpga-bracket branch November 29, 2024 11:22
jjfumero added a commit to jjfumero/TornadoVM that referenced this pull request Dec 20, 2024
Improvements
=============

- beehive-lab#573: Enhanced output of unit-tests with a summary  of pass-rates and fail-rates.
- beehive-lab#576: Extended support for 3D matrices.
- beehive-lab#580: Extended debug information for execution plans.
- beehive-lab#584: Added helper menu for the ``tornado`` launcher script when no arguments are passed.
- beehive-lab#589: Enable partial loop unrolling for all backends.
- beehive-lab#594: Added RISC-V 64 CPU port support to run OpenCL with vector instructions RVV 1.0 (using the Codeplay OCK Toolkit).
- beehive-lab#598: OpenCL low-level buffers tagged as read, write and read/write based on the data dependency analysis.
- beehive-lab#601: Feature to select an immutable task graph to execute from a multi-task graph execution plan.

Compatibility
=============

- beehive-lab#570:  Extended timeout for all suite of unit-tests.
- beehive-lab#579: Removed legacy JDK 8 and JDK11 build options from the TornadoVM installer.
- beehive-lab#582: Restored tornado runner scripts for IntellIJ.
- beehive-lab#583: Automatic generation of IDE IntelliJ configuration runner files from the TornadoVM command.
- beehive-lab#597: Updated white-list of unit-test and checkstyle improved.

Bug Fixes
=============

- beehive-lab#571: Fix issues with bracket closing for if/loops conditions.
- beehive-lab#572: Fix for printing default execution plans (execution plans with default parameters).
- beehive-lab#575: Fix the Level Zero version used for building the SPIR-V backend.
- beehive-lab#577: Fix checkstyle.
- beehive-lab#587: Fix thread scheduler for new NVIDIA Drivers.
- beehive-lab#592: Fix ``Float.POSITIVE_INFINITY`` and ``Float.NEGATIVE_INFINITIVE`` constants for the OpenCL, CUDA and SPIR-V backends.
- beehive-lab#596: Fix extra closing bracket during the code-generation for the FPGAs.
- Remove the intermediate CUDA pinned memory regions in the JNI code: [link](beehive-lab@9c3f8ce)
- Fix bitwise negation operations for the PTX backend:  [link](beehive-lab@0db1cd3)
- `GetBackendImpl::getAllDevices` thread-safe: [link](beehive-lab@0d44252)
- Check size elements for memory segments: [link](beehive-lab@4360385)
@jjfumero jjfumero mentioned this pull request Dec 20, 2024
8 tasks
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
bug Something isn't working fpga FPGA
Projects
Development

Successfully merging this pull request may close these issues.

3 participants