DocIIW

Repository for the paper "Intrinsic Decomposition of Document Images In-the-Wild" (BMVC '20)

Quick Links: PDF | arXiv | Talk | Supplementary

Updates

Sep 5th, 2020: Initial data is released (90K images).
Mar 20th, 2021: Evaluation images are released.
Nov 8th, 2022: Training Code and models.
Coming Soon: Training details.

Doc3DShade

Doc3DShade extends Doc3D with realistic lighting and shading. Follows a similar synthetic rendering procedure using captured document 3D shapes but final image generation step combines real shading of different types of paper materials under numerous illumination conditions.
Following figure illustrates the image generation pipeline:

Following figure shows a side-by-side comparison of images in Doc3DShade and Doc3D:

Data Download Instructions

Doc3Dshade contains 90K images, 80K used for training and 10K for validation. Split used in the paper: train, val

Download the input images from img.zip .
Download the white-balanced images from wbl.zip .
Download synthetic textures from alb.zip .

Training Instructions

Upcoming

Pre-trained Models

All models: GDrive Link
WBNet: GDrive Link
SMTNet: GDrive Link
SMTNet(w/ adversarial loss): GDrive Link

Evaluation Images and Results

Real test images are given in: /testimgs/real
Shading removed real test images:
- Basic: GDrive Link
Shading removed DocUNet [1] images are available at:
- Basic: GDrive Link
- With adversarial loss: GDrive Link
Shading removed and unwarped [2] DocUNet [1] images are available at:
- Basic: GDrive Link
- With adversarial loss: GDrive Link

Citation:

If you use the dataset, please consider citing our work-

@inproceedings{DasDocIIW20,
  author    = {Sagnik Das, Hassan Ahmed Sial, Ke Ma, Ramon Baldrich, Maria Vanrell and Dimitris Samaras},
  title     = {Intrinsic Decomposition of Document Images In-the-Wild},
  booktitle = {31st British Machine Vision Conference 2020, {BMVC} 2020, Manchester, UK, September 7-10, 2020},
  publisher = {{BMVA} Press},
  year      = {2020},
}

References:

[1] DocUNet: https://www3.cs.stonybrook.edu/~cvl/docunet.html

[2] DewarpNet: https://sagniklp.github.io/dewarpnet-webpage/

Name		Name	Last commit message	Last commit date
Latest commit History 14 Commits
assets		assets
checkpoints		checkpoints
loaders		loaders
models		models
testimgs		testimgs
LICENSE.txt		LICENSE.txt
README.md		README.md
loss.py		loss.py
test.py		test.py
test_shdmat.py		test_shdmat.py
train.py		train.py
trainmattex.py		trainmattex.py
trainmattexadvshd.py		trainmattexadvshd.py
utils.py		utils.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

DocIIW

Updates

Doc3DShade

Data Download Instructions

Training Instructions

Pre-trained Models

Evaluation Images and Results

Citation:

References:

About

Releases

Packages

Languages

License

cvlab-stonybrook/DocIIW

Folders and files

Latest commit

History

Repository files navigation

DocIIW

Updates

Doc3DShade

Data Download Instructions

Training Instructions

Pre-trained Models

Evaluation Images and Results

Citation:

References:

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages