feat: Qwen3 VLM support #443

bradhilton · 2025-10-23T02:10:59Z

No description provided.

…andling

* Set execution counts for code cells to reflect their order of execution. * Added HTML output styling for better visualization. * Included detailed output logs for model tracking and warnings related to imports.

* Updated the LocalBackend to include AutoImageProcessor for handling image inputs. * Modified tokenize_trajectory_groups to accept an image processor, enabling image tokenization. * Improved type annotations for better clarity and maintainability. * Adjusted error handling for image processing to ensure robustness.

* Added support for pixel_values and image_grid_thw in PackedTensors and DiskPackedTensors. * Updated packed_tensors_from_tokenized_results to handle new tensor types. * Improved type annotations for clarity and maintainability. * Refactored methods to ensure proper handling of image-related tensors throughout the preprocessing pipeline.

…a notebook * Updated execution counts for code cells to maintain proper order. * Enhanced output handling with detailed HTML and stream outputs for better visualization and tracking. * Refactored tensor handling in preprocessing to ensure correct data types for pixel_values and image_grid_thw.

* Updated the tokenization logic to use an offset for indexing, ensuring correct handling of image tokens. * Improved the efficiency of token replacement in the token_ids list.

* Added conditional loading of LoRA adapter in _set_lora method to handle cases where load_lora is not available. * Updated ModelState initialization to directly assign peft_model if it is already an instance of PeftModelForCausalLM, enhancing type safety and clarity.

* Updated pixel_values and image_grid_thw assignments to conditionally return [None] during warmup, improving flexibility in tensor management. * Ensured consistent handling of tensor data types for better integration in the processing pipeline.

* Introduced a new script to generate images based on yes/no/maybe prompts, utilizing PIL for rendering. * Created a training notebook that integrates OpenAI's API for model training with generated images. * Enhanced the image processing pipeline with improved font handling and text wrapping for better visual output. * Updated the math-vista notebook to reset execution counts and clean up outputs for consistency.

…swering * Added a new script to train a model using image and question pairs from the MathVista dataset. * Integrated asynchronous processing for efficient training and trajectory logging. * Enhanced image handling by saving decoded images to a temporary directory for model input. * Improved argument parsing for customizable training runs.

…ctory logging * Enhanced pixel_values and image_grid_thw assignments to conditionally return [None] during warmup, improving flexibility in tensor management. * Added type ignore comment for content conversion in trajectory_logging to suppress type errors. * Updated test notebook to reflect changes in tokenized results for better accuracy in outputs.

* Bumped versions of `unsloth` to 2025.10.8 and `unsloth-zoo` to 2025.10.9 in `pyproject.toml` and `uv.lock`. * Updated `transformers` version to 4.56.2 in both `pyproject.toml` and `uv.lock`. * Modified the training notebook to change the base model from `Qwen/Qwen2.5-VL-7B-Instruct` to `Qwen/Qwen3-VL-8B-Instruct` and updated execution counts for consistency.

bradhilton added 24 commits September 17, 2025 16:08

chore: Start adding Math Vista task

fdef126

feat: Add text-only rollout function

87de7f1

refactor: Simplify message handling in Math Vista notebook

5297dd9

feat: Enhance Math Vista notebook with execution tracking and image h…

ff8bbb7

…andling

fix: Update run data paths and timestamps in Math Vista notebook

e316551

fix: Reset execution counts and clean up outputs in Math Vista notebook

4a1c34e

Merge branch 'main' into feat/vlm-support

657960e

Merge branch 'main' into feat/vlm-support

030ff87

chore: Ruff linting autofix

5343690

chore: Cast types

12124b0

fix: Update execution counts and outputs in math-vista notebook

dbfbcd3

* Set execution counts for code cells to reflect their order of execution. * Added HTML output styling for better visualization. * Included detailed output logs for model tracking and warnings related to imports.

Merge branch 'main' into feat/vlm-support

9645b86

refactor: Optimize token indexing in trajectory tokenization

6a8977c

* Updated the tokenization logic to use an offset for indexing, ensuring correct handling of image tokens. * Improved the efficiency of token replacement in the token_ids list.

chore: Clean up math-vista.ipynb

b81c2dc

Merge branch 'main' into feat/vlm-support

fb745b6

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

feat: Qwen3 VLM support #443

feat: Qwen3 VLM support #443

bradhilton commented Oct 23, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

feat: Qwen3 VLM support #443

Are you sure you want to change the base?

feat: Qwen3 VLM support #443

Conversation

bradhilton commented Oct 23, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants