Matthias Bisping 8dc2685d9a renaming
2022-02-05 14:44:32 +01:00
2022-02-05 14:44:32 +01:00
2022-02-05 14:44:32 +01:00
2022-01-21 11:18:53 +01:00
2022-02-05 14:43:01 +01:00
2022-02-04 12:49:51 +01:00
2022-01-20 15:54:02 +01:00

Table Parsing

This repository implements computer vision based approaches for detecting and parsing visual features such as tables or previous redactions in PDFs.

Installation

git clone ssh://git@git.iqser.com:2222/rr/table_parsing.git
cd table_parsing

python -m venv env
source env/bin/activate

pip install -e .
pip install -r requirements.txt

Usage

# Parse tables on second page of a PDF
python scripts/annotate.py <path to pdf> 1 --type table

# Detect redactions (black filled rectangles) on first page of a PDF
python scripts/annotate.py <path to pdf> 0 --type redaction
Description
Analysis container service for visual (CV-based) document parsing
Readme 58 MiB
2025-01-16 09:31:10 +01:00
Languages
Python 91.1%
Shell 3%
Makefile 2.4%
Dockerfile 2.3%
Nix 1.2%