Int4(llama-2-chat-7b) converted model generates response in German language

**Describe the bug**

1. Install all the pip dependencies for latest 254-llmchatbot notebook
2.  Follow the steps to convert "llama-2-chat-7b" model to int4 format with default configuration.
3. Select device as "CPU"
4. Select model to run "INT4"
5. Run step to Load and compile the model
6. Set `max_new_token=500` and run `ov_model.generate` with prompt "Describe Intel in 100 words or less"

 **Expected behavior**

Output should be produced in English. However we are getting output in German language. 

1. Is there any issue with converting llama-2-chat-7b model into int4 format with OpenVino ?
2. Is the issue with latest openvino==2023.3.0 or nncf==2.9.0.dev0+84b46f58 ?

**Screenshots**
Screenshot 1
![image](https://github.com/openvinotoolkit/openvino_notebooks/assets/84007248/dc0277fa-1cee-4844-bd4c-2fb665c98b70)

Screenshot 2
![image](https://github.com/openvinotoolkit/openvino_notebooks/assets/84007248/db9fecb2-05e3-49b3-b75d-7074751e6834)

Screenshot 3
![image](https://github.com/openvinotoolkit/openvino_notebooks/assets/84007248/357021b4-a770-43ea-8084-533aa4eb82eb)

 **Installation instructions (Please mark the checkbox)**
- [x] I followed the installation guide at https://github.com/openvinotoolkit/openvino_notebooks#-installation-guide to install the notebooks. 

 
**Additional context**
I tried playing around with `model_compression_params` but it didn't help to resolve this issue.
```
"llama-2-chat-7b": {
    "mode": nncf.CompressWeightsMode.INT4_SYM,
    "group_size": 128,
    "ratio": 0.8,
},
```
```
"llama-2-chat-7b": {
    "mode": nncf.CompressWeightsMode.INT4_ASYM,
    "group_size": 128,
    "ratio": 0.8,
},
```
```
"llama-2-chat-7b": {
    "mode": nncf.CompressWeightsMode.INT4_SYM,
    "group_size": 64,
    "ratio": 0.8,
},
```
```
"llama-2-chat-7b": {
    "mode": nncf.CompressWeightsMode.INT4_SYM,
    "group_size": 64,
    "ratio": 0.6,
},
```

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Int4(llama-2-chat-7b) converted model generates response in German language #1683

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Int4(llama-2-chat-7b) converted model generates response in German language #1683

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions