-
Notifications
You must be signed in to change notification settings - Fork 22
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Using as a package/construct in external cdk apps #32
Comments
Hi @OperationalFallacy ! Yes it is possible to use constructs externally by simply referencing them as NPM module. All packages are available on the NPM registry. We do however do not document this on purpose, as the NPM packages might break, or change, until we reach |
Hey @HQarroum |
Hi Roman, I'm so sorry I didn't get back to you on this! To answer your first question on the issue, I think that the issue you were encountering with external dependencies is because ESM needs to be enabled. I use the following
Next, regarding the roadmap it is here, but I missed to keep it up to date in the past months due to the rate at which I was investing in new middlewares. Regarding Step Function integration, I did think about it for many months, and is something I wanted to get, using a Regarding your OCR use case, can't you try to use the That was a bit long, sorry for that :). |
Oh, yeah - the ESM, thanks for pointing out. I've started moving my projects from CommonJS, and that would probably solved the problem. Regarding step functions, its not necessary. Textract async processing publishes results to SNS topic, so it probably integrates with your framework. Async call can process multiple documents in parallel. https://docs.aws.amazon.com/textract/latest/dg/api-async.html They used step functions specifically in amazon-textract-idp-cdk repo for routing and pre-processing logic, I think. Textract has a lot of options and output formats for processing. The models can't do OCR well, textract and similar services produces structured output, including form data. I'm even using it to extract text from PDF documents that are not images, e.g. they are text documents already. Why not opensource packages to parse PDF? Because the structured output and how robust parsing is. |
Got it!
Definitely, that's possible to create a Textract middleware, or additional engine in the already existing PDF Text Converter middleware.
Great input, I'm adding it to my roadmap. Thanks Roman. |
You're right. The existing one could be easily refactored to use a different underlying service. I might get to this when I need more file types processed. I'll probably convert the existing OCR sfn into middleware. Thank you for a great project! |
What were you searching in the docs?
Hi,
I wanted to add the resources from Lakechain into my own cdk app and realized the construct relies on custom-compiled middleware.
Is it possible to use lakechain constructs in such way? Or this monorepo not designed for such use cases?
Thank you
Is this related to an existing documentation section?
No response
How can we improve?
Acknowledgment
The text was updated successfully, but these errors were encountered: