Rebase index_copy fix #62

wonjoolee95 · 2024-05-24T20:34:03Z

Rebase @bhavya01's index_copy fix from llama2-google-next-inference branch

JackCaoG · 2024-05-24T21:18:55Z

@wonjoolee95 can you do a run and see if this change has any performance implication?

wonjoolee95 · 2024-05-29T23:02:31Z

This rebase seems to give me the buffer with shape bf16[1,2048,32,128] on device SPMD:0 is null error (full paste: https://gist.github.com/wonjoolee95/73b1590e9432eabe39697708f8b2da71).

And when we change to openxla (instead of openxla_eval), the performance seems significantly worse.

We can tackle the SPMD inference work and this issue separately. For now, we can either keep using the existing out-of-place .index_copy and manually set the dynamo_cache_hit or use continuing using the 05012024 wheels.

cc @bhavya01

bhavya01 added 4 commits May 24, 2024 20:30

Add debug prints

8df68a9

Updated indexx_copy to in-place index_copy_

e07f0bf

Remove assignment

1a0a426

remove extra space

2b1217e

wonjoolee95 assigned bhavya01 and JackCaoG and unassigned bhavya01 and JackCaoG May 24, 2024

wonjoolee95 requested review from alanwaketan, bhavya01 and JackCaoG May 24, 2024 21:12

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Rebase index_copy fix #62

Rebase index_copy fix #62

wonjoolee95 commented May 24, 2024

JackCaoG commented May 24, 2024

wonjoolee95 commented May 29, 2024

Rebase index_copy fix #62

Are you sure you want to change the base?

Rebase index_copy fix #62

Conversation

wonjoolee95 commented May 24, 2024

JackCaoG commented May 24, 2024

wonjoolee95 commented May 29, 2024