Help regarding understanding Alpaca output without sampling. #342

aniquetahir · 2023-04-14T18:43:23Z

aniquetahir
Apr 14, 2023

Can someone help me understand what is happening if I call Alpaca without the sampling boilerplate. Intuitively, without sampling, the output should just be the most probable next words. However when I call the model, the output does not start with the sentence start token. Additionally, the words seem to be mixed up.

e.g.
Following is my input:

What is an alpaca?

I run the following to get the output:

    inp = tokenizer(prmpt, return_tensors='pt')
    output = model.base_model.model(inp['input_ids'].to(device), inp['attention_mask'].to(device))

Following is the output of tokenizer.decode(torch.argmax(output.logits[0], dim=1).tolist()):

' Below is the examplekalaca?\n'

This is not what I would expect.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Help regarding understanding Alpaca output without sampling. #342

{{title}}

Replies: 0 comments

Select a reply

Help regarding understanding Alpaca output without sampling. #342

aniquetahir Apr 14, 2023

Replies: 0 comments

aniquetahir
Apr 14, 2023