You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Describe the bug
.msg email file format has had several versions and it seems that Aleph doesn't parse all of them correctly. This leads to us needing to convert them to eml format before ingesting into Aleph. The tool I've been using to convert the msg emails is msgconvert (https://www.matijs.net/software/msgconv/) The current state is problematic as Aleph gives the perception that it does process them, but some might be processed correctly and some seem to only show parts of the body of the email and none of the attachments. If it is possible to detect the different versions and parse them accordingly, then we wouldn't necessarily need to pre-process them and journalists wouldn't be surprised by the results.
To Reproduce
Steps to reproduce the behavior:
Will share with you separately as the only examples I have are sensitive.
Expected behavior
All msg versions get parsed and ingested properly in Aleph.
Aleph version
4.0.0rc1
Screenshots
Cannot share.
Additional context
None
The text was updated successfully, but these errors were encountered:
Describe the bug
.msg email file format has had several versions and it seems that Aleph doesn't parse all of them correctly. This leads to us needing to convert them to eml format before ingesting into Aleph. The tool I've been using to convert the msg emails is msgconvert (https://www.matijs.net/software/msgconv/) The current state is problematic as Aleph gives the perception that it does process them, but some might be processed correctly and some seem to only show parts of the body of the email and none of the attachments. If it is possible to detect the different versions and parse them accordingly, then we wouldn't necessarily need to pre-process them and journalists wouldn't be surprised by the results.
To Reproduce
Steps to reproduce the behavior:
Expected behavior
All msg versions get parsed and ingested properly in Aleph.
Aleph version
4.0.0rc1
Screenshots
Cannot share.
Additional context
None
The text was updated successfully, but these errors were encountered: