Not reproducible example in documentation, typo. #1786

krammnic · 2024-10-09T16:01:02Z

Here it is probably typo:

from torchtune.models.llama3_2_vision import llama3_2_vision_transform
from torchtune.datasets.multimodal import multimodal_chat_dataset

transform = Llama3VisionTransform(
    path="/tmp/Meta-Llama-3-8B-Instruct/original/tokenizer.model",
    prompt_template="torchtune.data.QuestionAnswerTemplate",
    max_seq_len=8192,
    image_size=560,
)
ds = multimodal_chat_dataset(
    model_transform=model_transform,
    source="json",
    data_files="data/my_data.json",
    column_map={
        "dialogue": "conversations",
        "image_path": "image",
    },
    image_dir="/home/user/dataset/",  # /home/user/dataset/images/clock.jpg
    image_tag="<image>",
    split="train",
)
tokenized_dict = ds[0]
print(transform.decode(tokenized_dict["tokens"], skip_special_tokens=False))
# '<|begin_of_text|><|start_header_id|>user<|end_header_id|>\n\nQuestion:<|image|>What time is it on the clock?Answer:<|eot_id|><|start_header_id|>assistant<|end_header_id|>\n\nIt is 10:00AM.<|eot_id|>'
print(tokenized_dict["encoder_input"]["images"][0].shape)  # (num_tiles, num_channels, tile_height, tile_width)
# torch.Size([4, 3, 224, 224])

Shouldn't it be just transform, not model_transform?

The text was updated successfully, but these errors were encountered:

RdoubleA · 2024-10-09T16:46:13Z

Ah yeah, good catch. We should either use transform or model_transform across the example.

If you're able to fix it with a quick PR, happy to stamp it. Otherwise, I can address it.

krammnic changed the title ~~Not reproducible example from documentation, typo.~~ Not reproducible example in documentation, typo. Oct 9, 2024

krammnic mentioned this issue Oct 9, 2024

Fix typo in multimodal_datasets.rst #1787

Merged

13 tasks

RdoubleA closed this as completed Oct 9, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Not reproducible example in documentation, typo. #1786

Not reproducible example in documentation, typo. #1786

krammnic commented Oct 9, 2024

RdoubleA commented Oct 9, 2024

Not reproducible example in documentation, typo. #1786

Not reproducible example in documentation, typo. #1786

Comments

krammnic commented Oct 9, 2024

RdoubleA commented Oct 9, 2024