23 Commits

Author SHA1 Message Date
cdietrich
dcab1e8616 black 2022-09-30 09:59:31 +02:00
Julius Unverfehrt
ce9e92876c Pull request #16: Add table parsing fixtures
Merge in RR/cv-analysis from add_table_parsing_fixtures to master

Squashed commit of the following:

commit cfc89b421b61082c8e92e1971c9d0bf4490fa07e
Merge: a7ecb05 73c66a8
Author: Julius Unverfehrt <julius.unverfehrt@iqser.com>
Date:   Mon Jul 11 12:19:01 2022 +0200

    Merge branch 'master' of ssh://git.iqser.com:2222/rr/cv-analysis into add_table_parsing_fixtures

commit a7ecb05b7d8327f0c7429180f63a380b61b06bc3
Author: Julius Unverfehrt <julius.unverfehrt@iqser.com>
Date:   Mon Jul 11 12:02:07 2022 +0200

    refactor

commit 466f217e5a9ee5c54fd38c6acd28d54fc38ff9bb
Author: llocarnini <lillian.locarnini@iqser.com>
Date:   Mon Jul 11 10:24:14 2022 +0200

    deleted unused imports and unused lines of code

commit c58955c8658d0631cdd1c24c8556d399e3fd9990
Author: llocarnini <lillian.locarnini@iqser.com>
Date:   Mon Jul 11 10:16:01 2022 +0200

    black reformatted files

commit f8bcb10a00ff7f0da49b80c1609b17997411985a
Author: llocarnini <lillian.locarnini@iqser.com>
Date:   Tue Jul 5 15:15:00 2022 +0200

    reformat files

commit 432e8a569fd70bd0745ce0549c2bfd2f2e907763
Author: llocarnini <lillian.locarnini@iqser.com>
Date:   Tue Jul 5 15:08:22 2022 +0200

    added better test for generic pages with table WIP as thicker lines create inconsistent results.
    added test for patchy tables which does not work yet

commit 2aac9ebf5c76bd963f8c136fe5dd4c2d7681b469
Author: llocarnini <lillian.locarnini@iqser.com>
Date:   Mon Jul 4 16:56:29 2022 +0200

    added new fixtures for table_parsing_test.py

commit 37606cac0301b13e99be2c16d95867477f29e7c4
Author: llocarnini <lillian.locarnini@iqser.com>
Date:   Fri Jul 1 16:02:44 2022 +0200

    added separate file for table parsing fixtures, where fixtures for generic tables were added. WIP tests for generic table fixtures
2022-07-11 12:25:16 +02:00
Julius Unverfehrt
a0abae195c update dependencies 2022-06-23 16:30:53 +02:00
llocarnini
179ad20165 minor changes, refactoring and testfiles added 2022-05-17 09:17:24 +02:00
Isaac Riley
0b96980cc5 keyword 'show' to fix annotation script without causing problems for non-script usage 2022-04-11 09:44:47 +02:00
Isaac Riley
af898a37ac fixed naming errors 2022-03-23 13:55:30 +01:00
Isaac Riley
8730b34018 change name from vidocp to cv-analysis 2022-03-23 13:46:57 +01:00
Isaac Riley
7d22db92cf added table tests for use with sonar 2022-03-22 12:54:10 +01:00
Isaac Riley
a089fa5e42 first working version with new API 2022-03-14 21:26:49 +01:00
Isaac Riley
64a42b5be3 fix build problem involving build_venv/ 2022-03-09 15:47:20 +01:00
Isaac Riley
56bfba1227 merge test branch, fix conflicts 2022-03-02 11:07:34 +01:00
llocarnini
496957051c added two tests for table_parsing.py
-testing number of parsed rectangles
-testing range of table coordinates (where to find a table)
2022-02-28 16:12:30 +01:00
Isaac Riley
fc4789101f merge hugo into main; deskew already merged 2022-02-25 11:20:40 +01:00
llocarnini
6fb34735a2 Merge branch 'uncommon-tables' into hugo
# Conflicts:
#	.gitignore
2022-02-25 10:09:38 +01:00
Isaac Riley
596aa2d5a3 prepare for merge; minor edits 2022-02-25 09:46:20 +01:00
Isaac Riley
08cce36940 add deskew.py to utils, also demo 2022-02-21 14:53:09 +01:00
llocarnini
57ca47f38d different approaches to isolate line components of tables in scanned pdf files. 2022-02-16 12:37:17 +01:00
llocarnini
885fc22f9d added changes to parse scanned pdfs 2022-02-11 15:59:54 +01:00
llocarnini
ee613f3e78 add to gitignore 2022-02-08 15:35:18 +01:00
llocarnini
a68f89af03 added files to git ignore 2022-01-27 00:20:16 +01:00
llocarnini
cf5851b652 changes for repush 2022-01-26 11:50:13 +01:00
llocarnini
6f346a6cad changes so no single rectangle is parsed as table in table_parsig.py 2022-01-26 09:42:59 +01:00
llocarnini
1cf8508dc3 changes of parameters in table parsing: l15 line_min_width = 5 so no cell is missing in tables, l37 bigger min. rectangle so no text will be detected as table 2022-01-24 16:55:29 +01:00