Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[Feature Suggestion] Halide-lang codegen / generators #676

Closed
eszdman opened this issue Apr 1, 2023 · 2 comments
Closed

[Feature Suggestion] Halide-lang codegen / generators #676

eszdman opened this issue Apr 1, 2023 · 2 comments
Labels

Comments

@eszdman
Copy link

eszdman commented Apr 1, 2023

Prerequisites

Examples of work with neural networks in Halide:
https://github.com/halide/Halide/blob/main/apps/resnet_50/Resnet50Generator.cpp
https://github.com/halide/Halide/tree/main/apps/hannk
https://github.com/halide/Halide/blob/main/apps/onnx/model.cpp

Expected Behavior

Cross platform binary and code generation with best scheduling on computational graph applied by Halide or Halide autoschedulers, reduced memory usage by scheduling every used network model before execution

Current Behavior

Multiple conditional defines for different platforms and instruction bloated code, lack of GPU support i.e. OpenCL, OpenGL Compute, CUDA, current computational graph has great parallelism but very frustrating locality

@ei-grad
Copy link

ei-grad commented Dec 3, 2023

Despite of Apr 1 and a little bit offensive and outdated "Current Behaviour", it should be pretty straightforward to translate llama.cpp graph for some model to halide, to experiment on performance improvement which could be possible to achieve.

@github-actions github-actions bot added the stale label Mar 25, 2024
Copy link
Contributor

This issue was closed because it has been inactive for 14 days since being marked as stale.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Projects
None yet
Development

No branches or pull requests

2 participants