Skip to content

A web-based file converter that transforms various file formats into Markdown using the markitdown library.

License

Notifications You must be signed in to change notification settings

maheshrijal/file2md

Repository files navigation

file2md

GitHub Tag GitHub License Docker CI

A web-based file converter that transforms various file formats into Markdown using the Microsoft MarkItDown library. This tool leverages the powerful MarkItDown library to provide accurate and reliable file conversions.

Supported Formats

  • PDF (.pdf)
  • PowerPoint (.pptx)
  • Word (.docx)
  • Excel (.xlsx)
  • Images (EXIF metadata, and OCR)
  • Audio (EXIF metadata, and speech transcription)
  • HTML (special handling of Wikipedia, etc.)
  • Various other text-based formats (csv, json, xml, etc.)

Running the Application

docker run -p 5000:5000 maheshrijal/file2md

or

docker run -p 5000:5000 ghcr.io/maheshrijal/file2md

Access the Application

Open your browser and go to:

http://localhost:5000