Skip to content

Conversation

@LeiWang1999
Copy link
Member

This pull request includes updates to the README.md file, a new script for building Python distributions, and modifications to the tox configuration to support multiple Python versions. The most important changes are summarized below:

Documentation updates:

  • README.md: Corrected the capitalization of "WebGPU Codegen" in the Latest News section.

Build and distribution improvements:

  • maint/scripts/pypi_distribution_tox.sh: Added a new script to install multiple Python versions, install build requirements, and build wheels for different Python versions using tox.
  • tox.ini: Updated the tox configuration to include environments for building wheels for multiple Python versions, and added a new environment for building PyPI distributions.

- Rename ThreadSync and TileLangThreadSync functions in C++ code
- Update Python docstring for ThreadSync with more detailed description
- Reorder library path detection in tilelang environment setup
- Minor comment and code cleanup in CUDA and warp specialization modules
- Standardize pointer type spacing in storage_access.h and storage_access.cc
- Update whitespace and indentation in thread_storage_sync.cc
- Reorder include statements in thread_partial_sync.cc
- Minor code formatting improvements across thread synchronization files
- Correct global function registration to use ThreadSync instead of TileLangThreadSync
- Update TVM global registration to match recent refactoring efforts
- Remove unnecessary whitespace in global function registration
- Compact the TVM global registration line for ThreadSync
- Implement WebGPU code generator (codegen_webgpu.cc and codegen_webgpu.h)
- Add WebGPU target support in lower.py and target.py
- Update CMakeLists.txt to include WebGPU codegen source files
- Introduce WebGPU-specific code generation for WGSL shader language
- Enhance code formatting in codegen_webgpu.cc and codegen_webgpu.h
- Standardize pointer type spacing and indentation
- Improve line breaks and reduce line length for better readability
- Minor code style improvements in WebGPU code generation
- Implement test_webgpu_codegen.py for WebGPU matrix multiplication
- Add assert_gemm_codegen function to validate WebGPU code generation
- Include basic matrix multiplication kernel test case
@LeiWang1999 LeiWang1999 merged commit e5f5ca6 into tile-ai:main Feb 18, 2025
3 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant