113 Commits

Author SHA1 Message Date
Isaac Riley
dac6d47dc2 uncomment testing code in sonar script 2022-03-22 13:04:11 +01:00
Isaac Riley
7d22db92cf added table tests for use with sonar master_14 2022-03-22 12:54:10 +01:00
Isaac Riley
635fb84811 post-monitoring debug, especially of deskewing and skew check master_13 2022-03-17 21:51:15 +01:00
Isaac Riley
fa479adfb0 manually added tests from test branch to avoid major conflicts master_12 2022-03-15 12:17:09 +01:00
Isaac Riley
a089fa5e42 first working version with new API master_11 2022-03-14 21:26:49 +01:00
Isaac Riley
8cd8c1b1f0 try yet another time because sonar hates me master_10 master_9 2022-03-09 16:03:27 +01:00
Isaac Riley
468c3353f4 fix again for sonar 2022-03-09 15:57:41 +01:00
Isaac Riley
1eb13f8d53 fix requirements for sonar 2022-03-09 15:55:03 +01:00
Isaac Riley
64a42b5be3 fix build problem involving build_venv/ 2022-03-09 15:47:20 +01:00
Christoph  Schabert
f24235b2f4 Pull request #10: Containerization
Merge in RR/vidocp from containerization to master

* commit '8b9621e798ed60d380eac5d54b8fe56b39324904':
  first fully working containerization; still needs environment variables; review request data format
  got container runningasdfa
  fully containerize app; first full attempt and still needs testing
  add new files for containerization; still some work to do, but want to merge in tests first
2022-03-09 15:28:26 +01:00
Isaac Riley
8b9621e798 first fully working containerization; still needs environment variables; review request data format 2022-03-08 10:01:25 +01:00
Isaac Riley
7784993d1f got container runningasdfa 2022-03-03 16:30:20 +01:00
Isaac Riley
66616c64c5 fully containerize app; first full attempt and still needs testing 2022-03-03 15:02:56 +01:00
Isaac Riley
bdd2146330 Merge branch 'master' into containerization 2022-03-02 11:52:35 +01:00
Isaac Riley
56bfba1227 merge test branch, fix conflicts 2022-03-02 11:07:34 +01:00
Isaac Riley
dc4c205435 add new files for containerization; still some work to do, but want to merge in tests first 2022-03-02 09:38:56 +01:00
Isaac Riley
ff84734ee8 add minor edits 2022-03-02 07:43:02 +01:00
llocarnini
496957051c added two tests for table_parsing.py
-testing number of parsed rectangles
-testing range of table coordinates (where to find a table)
2022-02-28 16:12:30 +01:00
Isaac Riley
4c06206445 merge minor changes before branching 2022-02-28 08:07:08 +01:00
Isaac Riley
44d4eb5a98 format and add functions in post_processing.py missing from merge 2022-02-25 12:34:34 +01:00
Isaac Riley
fc4789101f merge hugo into main; deskew already merged 2022-02-25 11:20:40 +01:00
llocarnini
6fb34735a2 Merge branch 'uncommon-tables' into hugo
# Conflicts:
#	.gitignore
2022-02-25 10:09:38 +01:00
Isaac Riley
596aa2d5a3 prepare for merge; minor edits 2022-02-25 09:46:20 +01:00
llocarnini
dcab29becb deleted not needed function 2022-02-25 08:59:19 +01:00
llocarnini
5b46367175 no new changes 2022-02-25 08:13:47 +01:00
Isaac Riley
2180ff924a make full demo 2022-02-23 13:41:57 +01:00
Isaac Riley
a09fe65ed2 minor changes to demo script 2022-02-22 16:04:11 +01:00
Isaac Riley
8ff5147ee4 change default deskew function from hough-line-based to pixel-histogram-based; use scipy.ndimage.rotation 2022-02-22 10:18:41 +01:00
Isaac Riley
59e082379c fix angle detection to make more sensitive to small angles; format with black 2022-02-21 16:52:24 +01:00
llocarnini
21c97298b4 adding modul for image rotation after rollback because of a gitpull fail 2022-02-21 15:21:42 +01:00
llocarnini
b19a9e35c8 added function to rotate pdfs which were scanned at an angle. Not working yet 2022-02-21 14:58:58 +01:00
Isaac Riley
08cce36940 add deskew.py to utils, also demo 2022-02-21 14:53:09 +01:00
llocarnini
2a68e1b221 kernel size for morphology ex set bit higher, so less non-table structure are detected. Reduced the kernel size of the directional motion blurr and increased the treshold a little bit so narrow cells wont be split up.
Problem with the cell filtering for certain scanned pdfs detected.
2022-02-18 16:36:25 +01:00
llocarnini
723c6606e1 kernel size for morphology ex set bit higher, so less non-table structure are detected. Reduced the kernel size of the directional motion blurr and increased the treshold a little bit so narrow cells wont be split up.
Problem with the cell filtering for certain scanned pdfs detected.
2022-02-18 16:35:50 +01:00
llocarnini
d70781f4aa changed tolerance in adjacent1 function in postprocessing.y from 2 to 4
added function so vertical and horizontal components do not overlap the layout box of the table
2022-02-17 16:45:55 +01:00
llocarnini
57ca47f38d different approaches to isolate line components of tables in scanned pdf files. 2022-02-16 12:37:17 +01:00
llocarnini
c2faf7d00b adjusted isolation of vertical and horizontal components to be more robust to scanned pages; work in progress 2022-02-14 11:04:04 +01:00
llocarnini
885fc22f9d added changes to parse scanned pdfs 2022-02-11 15:59:54 +01:00
llocarnini
07907d45dd some changes to fix some minor bugs in table_parsing.py and post_processing.py 2022-02-10 10:56:03 +01:00
llocarnini
4964c8f5a1 some changes to fix some minor bugs in table_parsing.py and post_processing.py 2022-02-10 10:22:22 +01:00
llocarnini
ee613f3e78 add to gitignore 2022-02-08 15:35:18 +01:00
Matthias Bisping
f7d3e39692 nix dolles 2022-02-08 15:05:12 +01:00
Matthias Bisping
87cecadb44 applied black 2022-02-06 21:27:39 +01:00
Matthias Bisping
295666c28f added todo comments 2022-02-06 21:25:01 +01:00
Matthias Bisping
90b8613bf8 filtering non-tables by bounding rect check WIP 2022-02-06 21:03:40 +01:00
Matthias Bisping
36284f9a78 removed obsolete lines 2022-02-06 20:01:00 +01:00
Matthias Bisping
0fc6cf8008 fixed bug in adjaceny test 2022-02-06 20:00:38 +01:00
Matthias Bisping
106b333dca filtering for connected cells... but does not quite work yet 2022-02-06 16:44:07 +01:00
llocarnini
27246f533a Merge branch 'master' of ssh://git.iqser.com:2222/rr/table_parsing into uncommon-tables
 Conflicts:
	scripts/annotate.py
	vidocp/table_parsig.py
2022-02-06 15:10:32 +01:00
Matthias Bisping
c9b2f6bf29 Pull request #9: Refactoring
Merge in RR/vidocp from refactoring to master

Squashed commit of the following:

commit 36a62a13e51148d2420cb12930e84d78629db6b0
Author: Matthias Bisping <matthias.bisping@iqser.com>
Date:   Sun Feb 6 14:54:53 2022 +0100

    refactoring

commit e652da1fa88a048f9a5211b4e8c0b96074fb5849
Author: Matthias Bisping <matthias.bisping@iqser.com>
Date:   Sun Feb 6 14:53:17 2022 +0100

    refactoring

commit d9567da428c81f9cd7971a657281df0a90166810
Author: Matthias Bisping <matthias.bisping@iqser.com>
Date:   Sun Feb 6 14:47:18 2022 +0100

    refactoring

commit 9d30009dceec0357db6499bfaffae8ce97718ee0
Author: Matthias Bisping <matthias.bisping@iqser.com>
Date:   Sun Feb 6 14:45:53 2022 +0100

    refactoring

commit e8863d67aaaff138fb088c4e496a91b6354cc059
Author: Matthias Bisping <matthias.bisping@iqser.com>
Date:   Sun Feb 6 14:42:45 2022 +0100

    refactoring

commit 89a99d3586db4fbafa743a45bdd02eaf0c1f341f
Author: Matthias Bisping <matthias.bisping@iqser.com>
Date:   Sun Feb 6 14:39:49 2022 +0100

    refactoring

commit aa66b6865b00b0490b9e7695a6bae386e6f96723
Author: Matthias Bisping <matthias.bisping@iqser.com>
Date:   Sun Feb 6 14:31:21 2022 +0100

    refactoring

commit 98d77cb522a08821c3a13ae2cffbe7239c654762
Author: Matthias Bisping <matthias.bisping@iqser.com>
Date:   Sun Feb 6 14:27:55 2022 +0100

    refactoring

commit fed3a7e4f1b8b7ca4e14f9e495459c26490fb50b
Author: Matthias Bisping <matthias.bisping@iqser.com>
Date:   Sun Feb 6 14:26:16 2022 +0100

    refactoring

commit 504cafbd5d4bba183d9943b36c60548aae34e402
Author: Matthias Bisping <matthias.bisping@iqser.com>
Date:   Sun Feb 6 14:25:44 2022 +0100

    renaming

commit c9780a57e5a048529d36958ba678eddb11759cef
Author: Matthias Bisping <matthias.bisping@iqser.com>
Date:   Sun Feb 6 14:24:41 2022 +0100

    removed obsolete import

commit d555e86475e82024f8e1a5fc5b0ac70faa091ee1
Author: Matthias Bisping <matthias.bisping@iqser.com>
Date:   Sun Feb 6 14:24:04 2022 +0100

    refactored figure detection once
2022-02-06 14:55:38 +01:00