Preserving libtorch_python in package #246

jeongseok-meta · 2024-07-09T05:16:25Z

Our project requires linking to libtorch_python. Currently, the shared library is not included in the official TorchConfig.cmake, so we use find_library() to locate it. However, this method fails to find libtorch_python, although it successfully locates other symlinked libraries in the same directory (e.g., libtorch_cpu or libtorch_global_deps).

+ ls -l /home/conda/feedstock_root/build_artifacts/momentum_1720498881696/_build_env/lib/python3.12/site-packages/torch/lib
total 22012
lrwxrwxrwx  1 conda conda       21 Jul  9 04:23 libc10.so -> ../../../../libc10.so
lrwxrwxrwx  1 conda conda       21 Jul  9 04:23 libshm.so -> ../../../../libshm.so
lrwxrwxrwx  1 conda conda       27 Jul  9 04:23 libtorch_cpu.so -> ../../../../libtorch_cpu.so
lrwxrwxrwx  1 conda conda       35 Jul  9 04:23 libtorch_global_deps.so -> ../../../../libtorch_global_deps.so
-rwxr-xr-x 49 conda conda 22537640 Jun 17 01:29 libtorch_python.so
lrwxrwxrwx  1 conda conda       23 Jul  9 04:23 libtorch.so -> ../../../../libtorch.so

This PR aims to prevent the removal of libtorch_python by this line. Instead, it moves the library to PREFIX/lib and creates a symbolic link in the torch lib folder, similar to other libraries, hoping this change resolves the issue with find_library() not finding the shared library.

Checklist

Used a personal fork of the feedstock to propose changes
Bumped the build number (if the version is unchanged)
Reset the build number to 0 (if the version changed)
Re-rendered with the latest conda-smithy (Use the phrase @conda-forge-admin, please rerender in a comment in this PR for automated rerendering)
Ensured the license file is being packaged.

conda-forge-webservices · 2024-07-09T05:16:33Z

Hi! This is the friendly automated conda-forge-linting service.

I just wanted to let you know that I linted all conda-recipes in your PR (recipe) and found it was in an excellent condition.

jeongseok-meta · 2024-07-09T05:17:37Z

@conda-forge-admin, please rerender

github-actions · 2024-07-09T05:19:22Z

Hi! This is the friendly automated conda-forge-webservice.

I tried to rerender for you, but it looks like there was nothing to do.

This message was generated by GitHub actions workflow run https://github.com/conda-forge/pytorch-cpu-feedstock/actions/runs/9851406294.

h-vetinari · 2024-07-09T05:33:25Z

recipe/build.sh

@@ -218,7 +218,6 @@ if [[ "$PKG_NAME" == "libtorch" ]]; then
  for f in ATen caffe2 tensorpipe torch c10; do
    mv torch/include/$f ${PREFIX}/include/$f
  done
-  rm ${PREFIX}/lib/libtorch_python.*


This won't work AFAIU; libtorch_python.so depends on a specific python version, while the conda package libtorch intentionally does not depend on python at all.

If we want to ship libtorch_python.so, we should most likely package it in the pytorch output (or perhaps create a separate output).

Thank you for pointing out that libtorch should not depend on Python. Moving it or creating a separate output seems like a viable solution.

I am relatively new to the build script of this feedstock and find it more complex than other packages I've worked with. Could you please provide some guidance on how to implement your suggestion?

Could you please provide some guidance on how to implement your suggestion?

As a first approach, it should work to "install" libtorch_python.so in build_pytorch.sh, by copying from the build cache (that's created & populated during the execution of build.sh) into $PREFIX/lib.

hmaarrfk · 2024-07-09T10:52:26Z

you are asking for for the python specific library, you should be able to replace torch_python with torch in your setup script.

h-vetinari · 2024-07-09T11:03:25Z

you are asking for for the python specific library, you should be able to replace torch_python with torch in your setup script.

We genuinely don't seem to package libtorch_python.so, also not in pytorch itself (which is empty).

hmaarrfk · 2024-07-09T11:26:19Z

~/miniforge3/pkgs
07:25 $ find -name libtorch_python.so
./pytorch-2.3.1-cpu_mkl_py312h3b258cc_100/lib/python3.12/site-packages/torch/lib/libtorch_python.so
./pytorch-2.3.1-cpu_mkl_py310h75865b9_100/lib/python3.10/site-packages/torch/lib/libtorch_python.so
./pytorch-2.3.1-cpu_generic_py310ha4c588e_0/lib/python3.10/site-packages/torch/lib/libtorch_python.so

in a typical workflow, people often are not linking to it, so i guess we left it in lib so it could be python specific.

hmaarrfk · 2024-07-09T11:27:49Z

And a computer that has cuda120

~/miniforge3/pkgs
$ find -name libtorch_python.so
./pytorch-2.3.1-cuda120_py310h2c91c31_300/lib/python3.10/site-packages/torch/lib/libtorch_python.so

hmaarrfk · 2024-07-09T11:29:36Z

We genuinely don't seem to package libtorch_python.so, also not in pytorch itself (which is empty).

I guess that stremlit should be taken with a grain of salt, in this case a 30MB package can't be empty ;)

h-vetinari · 2024-07-10T00:06:11Z

I guess that streamlit should be taken with a grain of salt, in this case a 30MB package can't be empty ;)

h-vetinari · 2024-07-10T08:45:00Z

OK, my view of the world is restored (thanks @jaimergp 🙏): the files for pytorch now show up in streamlit, and in particular, libtorch_python.so can be found in $SP_DIR/torch/lib.

It would be within the realm of possibility to move them to $PREFIX/lib and then play around with patchelf to correct the path in the binaries. Here's an example where ray does something similar. Not sure if that's worth it though. 🤷

hmaarrfk · 2024-07-10T12:32:37Z

@jeongseok-meta can you comment as to whether or not you are able to link directly to the non-python libraries?

it also seems that we do include the TorchConfig.cmake file

./libtorch-2.3.1-cuda120_h2b0da52_300/share/cmake/Torch/TorchConfig.cmake

are you using conda-forge cmake or an other?

jeongseok-meta · 2024-07-10T15:31:00Z

@hmaarrfk Yes, I can link to other libraries by using find_package(Torch CONFIG REQUIRED) and I am using cmake from conda-forge:

https://github.com/facebookincubator/momentum/blob/0d60cde3a5654c7b21599eefd1cace99da4edb45/pymomentum/CMakeLists.txt#L11

However, the problem is that I want to link to libtorch_python, which is not included in the TorchConfig.cmake.

By the way, thank you for the other comments. I will be AFK for the next few days, but I will catch up when I return.

hmaarrfk · 2024-07-11T00:55:45Z

@jeongseok-meta thanks for giving more context.

In my experience there is often little value in linking to the "Python library" when a "c" library exists.
I can't speak to your exact usecase, but my hunch is that linking to python will give little benefit.

Its also unclear what those at pytorch meant to have as public vs private.

I would consider what you need from the python side, and maybe just avoid using it for maintainability.

One thing that would be helpful for us to understand is, does this work from packages from the pytorch channel? While we try to do it the "conda-forge way" we do try to follow upstream "intentions". So if it works with the pytorch + default channel, then it would give us one extra push to try to make it work here (but again, seems like a very niche usecase, so we would need your help to do it well -- some good suggestions came from h-vetinari)

isuruf · 2024-08-02T22:07:27Z

@jeongseok-meta, this is an issue with our split build. I think the best solution is to move $PREFIX/lib/python$PY_VER/site-packages/torch/lib/libtorch_python.so to $PREFIX/lib and make a symlink back at site-packages.

This reverts commit 74c5d7a.

This reverts commit 84af8ac.

conda-forge-webservices · 2024-08-02T22:13:39Z

Hi! This is the friendly automated conda-forge-linting service.

I just wanted to let you know that I linted all conda-recipes in your PR (recipe/meta.yaml) and found it was in an excellent condition.

jeongseok-meta · 2024-08-07T01:20:41Z

@hmaarrfk Thank you for sharing your thoughts. Specifically, we need to use python_ivalue.h where the definitions are included in libtorch_python.so for Python binding with PyTorch interop. Unfortunately, we don't have a clear path to avoid depending on the API at the moment, and it's also unclear if PyTorch intended for it to be public or private as well.

One thing that would be helpful for us to understand is, does this work from packages from the pytorch channel?

It works with packages from the PyTorch channel, as we can build and import the Python binding linking to libtorch_python.so without errors locally (pixi.toml of our project)

jeongseok-meta · 2024-08-07T01:21:11Z

Hi @isuruf, your solution looks promising! However, I noticed some build failures in some jobs. Do you have any ideas on how to address them?

hmaarrfk · 2024-08-16T21:56:47Z

@conda-forge-admin please rerender

…nda-forge-pinning 2024.08.16.17.50.00

hmaarrfk · 2024-08-17T03:43:29Z

i thought i had access to the GPU runners... maybe they are down.

the package fails to build in docker locally for me.

hmaarrfk · 2024-08-17T03:45:22Z

i think perhaps a new flag was added in gcc 12.4 or something.
conda-forge/ctng-compiler-activation-feedstock#121

perhaps we can try to set the upper bound to 12.3???

h-vetinari · 2024-08-17T04:52:02Z

the package fails to build in docker locally for me.

Can you post an error log?

i think perhaps a new flag was added in gcc 12.4 or something.

The only thing that changed in our packaging was the way how meson sets a release flag, and we specifically didn't apply that change to 12.4, only 13.x and up.

hmaarrfk · 2024-08-17T17:38:53Z

Here is the log, had to pull it out of my tmux
log.txt

hmaarrfk · 2024-08-17T17:40:44Z

I think that this would help users that have libtorch.so in their missing dso whitelist:
https://github.com/search?q=org%3Aconda-forge+++%22libtorch_python.so%22&type=code

hmaarrfk · 2024-08-21T16:32:21Z

I think some compilation issues might stem from libmagma having been updated to 2.8 from 2.7.2: conda-forge/libmagma-feedstock#18

That said, i'm not too sure.

2.4.0 seems to build fine with libmagma 2.8.0 https://github.com/conda-forge/pytorch-cpu-feedstock/actions/runs/10477101071/job/29017474679?pr=250#step:3:734

conda-forge/libmagma-feedstock#21

Keep libtorch_python

74c5d7a

Increase build number

7e83af6

Add test for libtorch_python

84af8ac

h-vetinari reviewed Jul 9, 2024

View reviewed changes

isuruf added 3 commits August 2, 2024 17:08

Revert "Keep libtorch_python"

ab28630

This reverts commit 74c5d7a.

Revert "Add test for libtorch_python"

4ae4eb3

This reverts commit 84af8ac.

move libtorch_python.so to PREFIX/lib

a620157

trigger

11ed4ee

MNT: Re-rendered with conda-build 24.7.1, conda-smithy 3.38.0, and co…

e1ce95b

…nda-forge-pinning 2024.08.16.17.50.00

no need to specify c_stdlib for linux anymore

8ed8fec

hmaarrfk force-pushed the libtorch_python branch from 76c9848 to 8ed8fec Compare August 20, 2024 04:05

try to pin magma to 2.7.2

31956b1

jeongseok-meta mentioned this pull request Aug 23, 2024

Update to 2.4.0 and add libpytorch.so to lib location #250

Merged

5 tasks

hmaarrfk closed this Aug 23, 2024

jeongseok-meta deleted the libtorch_python branch August 26, 2024 15:47

jakirkham mentioned this pull request Aug 28, 2024

Rebuild for CUDA 12 conda-forge/autoawq-feedstock#13

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Preserving libtorch_python in package #246

Preserving libtorch_python in package #246

jeongseok-meta commented Jul 9, 2024 •

edited

Loading

conda-forge-webservices bot commented Jul 9, 2024

jeongseok-meta commented Jul 9, 2024

github-actions bot commented Jul 9, 2024

h-vetinari Jul 9, 2024

jeongseok-meta Jul 9, 2024

h-vetinari Jul 9, 2024

hmaarrfk commented Jul 9, 2024

h-vetinari commented Jul 9, 2024

hmaarrfk commented Jul 9, 2024

hmaarrfk commented Jul 9, 2024

hmaarrfk commented Jul 9, 2024

h-vetinari commented Jul 10, 2024

h-vetinari commented Jul 10, 2024 •

edited

Loading

hmaarrfk commented Jul 10, 2024

jeongseok-meta commented Jul 10, 2024

hmaarrfk commented Jul 11, 2024

isuruf commented Aug 2, 2024

conda-forge-webservices bot commented Aug 2, 2024

jeongseok-meta commented Aug 7, 2024

jeongseok-meta commented Aug 7, 2024

hmaarrfk commented Aug 16, 2024

hmaarrfk commented Aug 17, 2024

hmaarrfk commented Aug 17, 2024

h-vetinari commented Aug 17, 2024

hmaarrfk commented Aug 17, 2024

hmaarrfk commented Aug 17, 2024

hmaarrfk commented Aug 21, 2024 •

edited

Loading

Preserving libtorch_python in package #246

Preserving libtorch_python in package #246

Conversation

jeongseok-meta commented Jul 9, 2024 • edited Loading

conda-forge-webservices bot commented Jul 9, 2024

jeongseok-meta commented Jul 9, 2024

github-actions bot commented Jul 9, 2024

h-vetinari Jul 9, 2024

Choose a reason for hiding this comment

jeongseok-meta Jul 9, 2024

Choose a reason for hiding this comment

h-vetinari Jul 9, 2024

Choose a reason for hiding this comment

hmaarrfk commented Jul 9, 2024

h-vetinari commented Jul 9, 2024

hmaarrfk commented Jul 9, 2024

hmaarrfk commented Jul 9, 2024

hmaarrfk commented Jul 9, 2024

h-vetinari commented Jul 10, 2024

h-vetinari commented Jul 10, 2024 • edited Loading

hmaarrfk commented Jul 10, 2024

jeongseok-meta commented Jul 10, 2024

hmaarrfk commented Jul 11, 2024

isuruf commented Aug 2, 2024

conda-forge-webservices bot commented Aug 2, 2024

jeongseok-meta commented Aug 7, 2024

jeongseok-meta commented Aug 7, 2024

hmaarrfk commented Aug 16, 2024

hmaarrfk commented Aug 17, 2024

hmaarrfk commented Aug 17, 2024

h-vetinari commented Aug 17, 2024

hmaarrfk commented Aug 17, 2024

hmaarrfk commented Aug 17, 2024

hmaarrfk commented Aug 21, 2024 • edited Loading

jeongseok-meta commented Jul 9, 2024 •

edited

Loading

h-vetinari commented Jul 10, 2024 •

edited

Loading

hmaarrfk commented Aug 21, 2024 •

edited

Loading