2022-02-05 14:42:00 +01:00

608 B

Table Parsing

This repository implements computer vision based approaches for detecting and parsing visual features such as tables or previous redactions.

Installation

git clone ssh://git@git.iqser.com:2222/rr/table_parsing.git
cd table_parsing

python3 -m venv env
source env/bin/activate

pip install -e .
pip install -r requirements.txt

Usage

# Parse tables on second page of a PDF
python scripts/annotate.py <path to pdf> 1 --type table

# Detect redactions (black filled rectangles) on first page of a PDF
python scripts/annotate.py <path to pdf> 0 --type redaction