You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Tika-python also doesn't work properly under Python 3:
from tika import detector
print(detector.from_file('/path/to/file'))
Results in:
UnicodeDecodeError: 'utf-8' codec can't decode byte 0xa9 in position 6: invalid start byte
There are some Python modules that wrap around libmagic (e.g. python-magic and filemagic) , but they require libmagic binaries to be installed separately which is a bit of a pain on Windows.
Other possibility: write custom detection function for only WAV, FLAC and ISO based on the Tika signatures:
Perhaps FIDO could be used for this?
The text was updated successfully, but these errors were encountered: