Replies: 4 comments
-
2023-05-16
fswatch -r -x /Volumes/vault
/Volumes/vault/AND08journal.pdf IsFile Renamed
/Volumes/vault/fits-1.5.1.zip IsFile Renamed
/Volumes/vault/libmediainfo-23.04-x86_64.Arch_Extra.pkg.tar.zst IsFile Renamed
/Volumes/vault/.DS_Store AttributeModified IsFile Updated
/Volumes/vault/test_files IsDir Renamed
/Volumes/vault/.DS_Store AttributeModified IsFile Updated
/Volumes/vault/zettelkasten_v3.alfredworkflow IsFile Removed
/Volumes/vault/ocr2text Created AttributeModified IsDir AttributeModified
/Volumes/vault/ocr2text/.DS_Store IsFile
/Volumes/vault/ocr2text/.DS_Store OwnerModified Created PlatformSpecific AttributeModified IsFile AttributeModified
/Volumes/vault/ocr2text/LICENSE IsFile 2023-05-15
2023-05-14
Footnotes |
Beta Was this translation helpful? Give feedback.
-
2023-06-07
|
Beta Was this translation helpful? Give feedback.
-
2023-06-10
|
Beta Was this translation helpful? Give feedback.
-
2023-07-17
2023-07-05
{
"tools": {
"exiftool": {
"requires": [],
"provides": ["date","format","coverage"]
},
"tesseract_ocr": {
"requires": [
"format:image/jpeg",
"format:image/png",
"format:image/gif"
],
"provides": ["description"]
},
"ocrmypdf": {
"requires": ["format:application/pdf"],
"provides": []
},
"pdftotext": {
"tool": "pdftotext",
"requires": ["format:application/pdf"],
"provides": ["description"]
},
"pdftotext_postprocess": {
"tool": "pdftotext",
"requires": ["format:application/pdf", "*ocrmypdf"],
"provides": ["description"]
},
"context": {
"requires": [],
"provides": [
"creator",
"type",
"contributor",
"coverage",
"publisher",
"source"
]
},
"garmin_connect": {
"requires": ["format:application/json","type","source"],
"provides": ["subject","title","description","%image"]
},
"poliglot": {
"requires": ["title","description"],
"provides": ["language"]
},
"video2text": {
"requires": ["format:video/mp4"],
"provides": ["subject"]
},
"video-transcription": {
"requires": ["format:video/mp4"],
"provides": ["description"]
},
"video-ocr": {
"requires": ["format:video/mp4"],
"provides": ["description"]
}
}
} This way, tools that aren't dependent on others can run first, and then once we have the information, we can run those that are dependent. Tools that don't provide anything are usually tools that are being used as input to another tool (ex. Once a document has been scanned, the analyze task would determine the order of the tools needed to be run, and would run each of them. Some tools can only run on certain file types, or if specific information is available. This should also be configurable. The purpose is to add as much metadata as possible, in order to be able to make connections between documents. 2023-06-21
2023-06-14
2023-06-13
2023-06-12
|
Beta Was this translation helpful? Give feedback.
-
Whenever I get a chance to work on this, I'm writing down notes, or making small experiments. Since I've adopted the "build in public" mindset, and to maybe spark a conversation with the contributors, I'm going to post my notes in this thread.
Beta Was this translation helpful? Give feedback.
All reactions