Skip to content

Problem in nltkvid18.py (the charset shows unknown 8 bit in positive.txt and negative.txt) #3

Open
@pratimaupadhyay02

Description

@pratimaupadhyay02

Sir
When running nltkvid18.py, the following error is encountered:
UnicodeDecodeError: 'ascii' codec can't decode byte 0xed in position 6: ordinal not in range(128)
I am attaching the screenshot here
issue1
I found out that the files positive.txt and negative.txt show unknown charset type instead of ascii or utf-8 so the file was not able to read it.
I used " cat -v negative.txt > negative1.txt" and cat -v positive.txt > positive1.txt inorder to change the unknown charset type to ascii in a new file named positive1.txt and negative1.txt . After using these files it worked.

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions