Skip to content

A repository to detect whether a PDF is written in the XML Forms Architecture

License

Notifications You must be signed in to change notification settings

fartbagxp/pdf-xfa-detection

Repository files navigation

Overview

This is a research repository on the direction of PDF documents and extracting various PDF metadata for information.

The research can be found here.

Source of Information

VA Lighthouse API

Running locally

  • Install NodeJS 12+.
  • Install all the 3rd party libraries npm install.
  • Write out the environment variables in a .env file (follow .sample.env).
  • Run npm run download to download all the PDF files locally.
  • Run npm run parse to parse the metadata and output the summary locally.

Helpful Links

About

A repository to detect whether a PDF is written in the XML Forms Architecture

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published