OpenCV Document Scanner

A demo API for document detection and extraction using:

A simple demo IPython notebook can be found here.

Usage

TL;DR

Deployment

Launch (and teardown) the docker containers for the FastAPI server using the following commands:

docker-compose -f docker-compose.yml -f docker-compose.prod.yml up -d --build
docker-compose down --remove-orphans

Request: The client shall send a JSON request containing these fields:
- image: A string containing image content encoded in Base64.
- doc_type: An integer (optional, defaults to 0). Indicates document type. This is ignored for now.
Response: The client shall receive a JSON response containing these fields:
- det_success: A boolean. Indicates whether document detection succeeded or not.
- doc: Extracted document image encoded in Base64.
- doc_points: A list of floats. The corner points of the detected document.
- doc_vis: Visualisation of the document image after some pre-processing, encoded in Base64.
Environment variables can be set in the .env file in root directory.

Debug / Visualisation

After the server is launched, one can navigate to http://localhost:5000/ for a test page.

Development

docker-compose -f docker-compose.yml -f docker-compose.dev.yml up -d
docker-compose ps       # List containers
docker-compose exec scanner bash
docker-compose down --remove-orphans

docker build -t scanner/python:3.7.10 .
docker run -it --ipc=host -v %cd%:/master/scanner -p 5000:5000 --rm scanner/python:3.7.10
docker run -it --ipc=host -v $(pwd):/master/scanner -p 5000:5000 --rm scanner/python:3.7.10

Explanation

User <-- WebSocket --> FastAPI <---> Document Detector

Client will communicate with our FastAPI framework via WebSocket protocol. The client shall send an image containing the document to be extracted (the query image). This query image will then be processed by the detectors.
The document detector can operate in 1 of 2 modes: simple and features. This can be controlled by setting the DET_SIFT_FEATURE environment variable.
- Simple (DET_SIFT_FEATURE = False, default): Otsu thresholding is performed on hue image after some pre-processing. The document corner points are then estimated via a contour operation. This mode is faster, but relies heavily having a background that is clean (one colour) and with large contrast (different colour than the document).
- Features (DET_SIFT_FEATURE = True): Local features (default = SIFT) are extracted from a reference document image and the query image. These image features / keypoints are then matched using either a brute-force matcher (default) or a FLANN-based matcher. The matched keypoints are then used to estimate a homography matrix using either LMEDS (default) or RANSAC. The homography matrix is then used to compute the document corner points. This mode is slower, but should be more flexible and less reliant on having a clean background.
After the document corner points are obtained, perspective transform is performed to extract the document from the query image.

Limitations / Known Issues

FastAPI / Uvicorn

Websocket will disconnect when uploading a base64 file with size > 1MB
- Alternatively, use Hypercorn which has a default message size limit of 16MB
- Links:

References

FastAPI

Deployment

Installing `docker-compose` on Linux

Guide

# Download the current stable release of Docker Compose
sudo curl -L "https://github.com/docker/compose/releases/download/1.28.5/docker-compose-$(uname -s)-$(uname -m)" -o /usr/local/bin/docker-compose
# Apply executable permissions to the binary
sudo chmod +x /usr/local/bin/docker-compose
# Create a symbolic link
sudo ln -s /usr/local/bin/docker-compose /usr/bin/docker-compose
# Check version
docker-compose --version
# Command completion (optional)
sudo curl -L https://raw.githubusercontent.com/docker/compose/1.28.5/contrib/completion/bash/docker-compose -o /etc/bash_completion.d/docker-compose

Name		Name	Last commit message	Last commit date
Latest commit History 15 Commits
.github/workflows		.github/workflows
scanner		scanner
static		static
templates		templates
tests		tests
.dockerignore		.dockerignore
.env		.env
.gitattributes		.gitattributes
.gitignore		.gitignore
Dockerfile		Dockerfile
LICENSE		LICENSE
README.md		README.md
demo.ipynb		demo.ipynb
dev.sh		dev.sh
docker-compose.dev.yml		docker-compose.dev.yml
docker-compose.prod.yml		docker-compose.prod.yml
docker-compose.yml		docker-compose.yml
main_fastapi_ws.py		main_fastapi_ws.py
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

OpenCV Document Scanner

Usage

TL;DR

Deployment

Debug / Visualisation

Development

Explanation

Limitations / Known Issues

FastAPI / Uvicorn

References

FastAPI

Installing `docker-compose` on Linux

About

Releases

Packages

Languages

License

jiahuei/document-scanner-opencv-ws

Folders and files

Latest commit

History

Repository files navigation

OpenCV Document Scanner

Usage

TL;DR

Deployment

Debug / Visualisation

Development

Explanation

Limitations / Known Issues

FastAPI / Uvicorn

References

FastAPI

Installing docker-compose on Linux

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Installing `docker-compose` on Linux

Packages