58 Commits

Author SHA1 Message Date
Matthias Bisping
a5cd3d6ec9 [WIP] recursive random table 2023-01-18 13:11:15 +01:00
Matthias Bisping
893622a73e [WIP] recursive random table 2023-01-18 11:45:19 +01:00
Matthias Bisping
4d11a157e5 Cache font selection 2023-01-18 09:39:04 +01:00
Matthias Bisping
4c10d521e2 [WIP] random font selection 2023-01-17 14:58:54 +01:00
Matthias Bisping
0f6cbec1d5 Refactoring 2023-01-17 13:43:12 +01:00
Matthias Bisping
54484d9ad0 [WIP] random table segments: Table via tabulate and text -> image 2023-01-17 13:23:53 +01:00
Matthias Bisping
ca190721d6 [WIP] random table segments & refactoring 2023-01-17 13:17:33 +01:00
Matthias Bisping
5611314ff3 [WIP] random table segments 2023-01-17 11:42:11 +01:00
Matthias Bisping
4ecfe16df5 Constrain possible random layouts 2023-01-17 11:12:25 +01:00
Matthias Bisping
38c0614396 Assign box type by box aspect ratio 2023-01-17 10:59:53 +01:00
Matthias Bisping
64565f9cb0 Complete first iteraton of random plot generation 2023-01-17 10:55:09 +01:00
Matthias Bisping
232c6bed4b Refactoring: Rename 2023-01-17 09:54:50 +01:00
Matthias Bisping
8d34873d1c [WIP] random plot segments 2023-01-16 19:33:46 +01:00
Matthias Bisping
78a951a319 [WIP] random plot segments 2023-01-16 18:42:34 +01:00
Matthias Bisping
8d57d2043d [WIP] random text segments 2023-01-16 18:18:22 +01:00
Matthias Bisping
41fdda4955 [WIP] random text segments 2023-01-16 17:55:20 +01:00
Matthias Bisping
4dfdd579a2 [WIP] random text segments 2023-01-16 17:41:30 +01:00
Matthias Bisping
e831ab1382 [WIP] random text segments 2023-01-16 17:17:50 +01:00
Matthias Bisping
6fead2d9b9 [WIP] random text segments 2023-01-16 16:34:18 +01:00
Matthias Bisping
1012988475 Remove obsolete code 2023-01-16 13:35:59 +01:00
Matthias Bisping
5bc1550eae Complete page partitioning into empty boxes
Completed logic for partitioning page into content boxes. Next step is
to fill content boxes with random content.
2023-01-16 13:32:38 +01:00
Matthias Bisping
29741fc5da [WIP] random content box generation 2023-01-16 12:07:56 +01:00
Matthias Bisping
4772e3037c Remove obsolete code 2023-01-16 11:16:27 +01:00
Matthias Bisping
dd6ab94aa2 [WIP] Replace texture generation with loadig textures from files 2023-01-16 10:59:13 +01:00
Matthias Bisping
eaca8725de Balance colors of base textures
Make base textures more similar in color balance
2023-01-16 10:19:05 +01:00
Matthias Bisping
4af202f098 Add base paper textures 2023-01-16 10:09:34 +01:00
Matthias Bisping
4578413748 Improve page texture logic 2023-01-11 14:05:39 +01:00
Matthias Bisping
d5d67cb064 Fix image format (RGB/A, float/uint8, [0, 1/255]) issues 2023-01-11 12:17:07 +01:00
Matthias Bisping
d8542762e6 [WIP] Add augmentation pipeline to page generation 2023-01-10 17:13:39 +01:00
Matthias Bisping
caef416077 Tweak page generation 2023-01-10 16:37:35 +01:00
Matthias Bisping
a8708ffc56 [WIP] page generation for tests 2023-01-10 16:31:02 +01:00
Matthias Bisping
619f67f1fd Refactoring
Various
2023-01-09 16:51:58 +01:00
Matthias Bisping
94e9210faf Refactoring
Various
2023-01-09 11:21:43 +01:00
Matthias Bisping
dfd87cb4b0 Refactoring 2023-01-04 18:29:52 +01:00
Matthias Bisping
cd5457840b Refactoring
Various
2023-01-04 18:13:54 +01:00
Matthias Bisping
eee2f0e256 Refactoring
Rename module
2023-01-04 17:40:43 +01:00
Matthias Bisping
9d2f166fbf Refactoring
Various
2023-01-04 17:36:06 +01:00
Matthias Bisping
97fb4b645d Refactoring
Remove more code that is not adhering to separation of concerns from Rectangle class
2023-01-04 16:49:44 +01:00
Matthias Bisping
04f100386d Add dvc-data-pull fixture
This allows the tests that depend on input files to run without the need
to manuallt dvc-pull them beforehand.
2023-01-03 10:01:03 +01:00
Julius Unverfehrt
d14960da08 Pull request #35: RED-5527 update pdf2img with skip_pages_without_images flag, add ENV for this setting for table parsing
Merge in RR/cv-analysis from RED-5527 to master

Squashed commit of the following:

commit 1748095ce45f4a76e9d3d8547d9fc70f7deae2fb
Author: Julius Unverfehrt <julius.unverfehrt@iqser.com>
Date:   Tue Nov 8 15:15:16 2022 +0100

    RED-5527 update pdf2img with skip_pages_without_images flag, add ENV for this setting for table parsing
2022-11-08 15:16:53 +01:00
Julius Unverfehrt
b26253120c Pull request #33: Fix response coords
Merge in RR/cv-analysis from fix-response-coords to master

Squashed commit of the following:

commit 0c6178a564b48abc43f129f81d93091a277fc64a
Author: Julius Unverfehrt <julius.unverfehrt@iqser.com>
Date:   Thu Oct 6 14:53:02 2022 +0200

    update tests

commit 46ad8737593df976555e4f60db8dc7947784d46d
Author: Julius Unverfehrt <julius.unverfehrt@iqser.com>
Date:   Thu Oct 6 14:40:25 2022 +0200

    rename script

commit f541311d0aae22d5b76ba3c2580aada662812557
Author: Julius Unverfehrt <julius.unverfehrt@iqser.com>
Date:   Thu Oct 6 14:40:11 2022 +0200

    response now returns natural page index, update pdf2image to correct response coordinates
2022-10-06 14:56:28 +02:00
Julius Unverfehrt
309ae0d57b Pull request #27: Image service compat
Merge in RR/cv-analysis from image-service-compat to master

Squashed commit of the following:

commit 397d12a96a6b78de762f7b3a80a72427f5f51e97
Author: Julius Unverfehrt <julius.unverfehrt@iqser.com>
Date:   Tue Aug 16 16:14:40 2022 +0200

    update pdf2image, adjust response format for table-parsing & figure-detection

commit f2061bda8d25d64de974e97f36148dea29af50d9
Author: Julius Unverfehrt <julius.unverfehrt@iqser.com>
Date:   Mon Aug 15 08:56:39 2022 +0200

    add script to save figure detection data that can be used for image-service pipeline script
2022-08-16 17:04:05 +02:00
Julius Unverfehrt
20267f2715 remove nested list for figure detection response 2022-08-12 12:34:21 +02:00
Julius Unverfehrt
954c2f17a5 Pull request #26: adjust response for table parsing to be per page based
Merge in RR/cv-analysis from adjust-signature-to-per-page to master

Squashed commit of the following:

commit 1142a350d537453a7ac35d97bfa00bd2a64c4871
Author: Julius Unverfehrt <julius.unverfehrt@iqser.com>
Date:   Fri Aug 12 11:09:04 2022 +0200

    adjust response for table parsing to be per page based
2022-08-12 11:46:24 +02:00
Julius Unverfehrt
ea25b57dd9 update pdf2image module 2022-08-10 14:17:57 +02:00
Julius Unverfehrt
59a0a61708 Pull request #25: Pdf2image
Merge in RR/cv-analysis from pdf2image to master

Squashed commit of the following:

commit 1353f54d2dceb0a79b1f81bfa2c035f5a454275a
Author: Julius Unverfehrt <julius.unverfehrt@iqser.com>
Date:   Wed Aug 10 09:07:31 2022 +0200

    add deRotation and transformation vie rectanglePlus

commit 51459dbf57a86e3eac66ec0da02de40dc1b68796
Author: Julius Unverfehrt <julius.unverfehrt@iqser.com>
Date:   Tue Aug 9 08:53:50 2022 +0200

    add derotation and to pdf coords transformation to cv-analysis output

commit 733991e2f5a4664205b2f7cc756cebcbc9ee3930
Author: Julius Unverfehrt <julius.unverfehrt@iqser.com>
Date:   Mon Aug 8 15:15:13 2022 +0200

    update pipline with detrotation logic WIP
2022-08-10 09:17:59 +02:00
Julius Unverfehrt
016abe46de Pull request #23: Add pdf2image module
Merge in RR/cv-analysis from add-pdf2image-module to master

Squashed commit of the following:

commit 13355e2dd006fae9ee05c2d00acbbc8b38fd1e8e
Merge: eaf4627 edbda58
Author: Julius Unverfehrt <julius.unverfehrt@iqser.com>
Date:   Tue Aug 2 13:35:27 2022 +0200

    Merge branch 'master' of ssh://git.iqser.com:2222/rr/cv-analysis into add-pdf2image-module

commit eaf462768787642889d496203034d017c4ec959b
Author: Julius Unverfehrt <julius.unverfehrt@iqser.com>
Date:   Tue Aug 2 13:26:58 2022 +0200

    update build scripts

commit d429c713f4e5e74afca81c2354e8125bf389b865
Author: Julius Unverfehrt <julius.unverfehrt@iqser.com>
Date:   Tue Aug 2 13:11:07 2022 +0200

    purge target

commit 349b81c5db724bf70d6f31b58ded2b5414216bfe
Author: Julius Unverfehrt <julius.unverfehrt@iqser.com>
Date:   Tue Aug 2 13:07:58 2022 +0200

    Revert "extinguish target"

    This reverts commit d2bd4cefde0648d2487839b0344509b984435273.

commit d2bd4cefde0648d2487839b0344509b984435273
Author: Julius Unverfehrt <julius.unverfehrt@iqser.com>
Date:   Tue Aug 2 12:57:50 2022 +0200

    extinguish target

commit 5f6cc713db31e3e16c8e7f13a59804c86b5d77d7
Author: Julius Unverfehrt <julius.unverfehrt@iqser.com>
Date:   Tue Aug 2 11:58:52 2022 +0200

    refactor

commit 576019378a39b580b816d9eb7957774f1faf48b9
Author: Julius Unverfehrt <julius.unverfehrt@iqser.com>
Date:   Tue Aug 2 11:52:04 2022 +0200

    add test for adjustesd server analysis pipeline logic

commit bdf0121929d6941cbba565055f37df7970925c79
Author: Julius Unverfehrt <julius.unverfehrt@iqser.com>
Date:   Tue Aug 2 11:30:17 2022 +0200

    update analysis pipline logic to use imported pdf2image

commit f7cef98d5e6d7b95517bbd047dd3e958acebb3d8
Author: Julius Unverfehrt <julius.unverfehrt@iqser.com>
Date:   Tue Aug 2 11:04:34 2022 +0200

    add pdf2image as git submodule
2022-08-02 13:36:50 +02:00
Isaac Riley
9d98945ff9 Pull request #20: New pyinfra
Merge in RR/cv-analysis from new_pyinfra to master

Squashed commit of the following:

commit f7a01a90aad1c402ac537de5bdf15df628ad54df
Author: Julius Unverfehrt <julius.unverfehrt@iqser.com>
Date:   Wed Jul 27 10:40:59 2022 +0200

    fix typo

commit ff4d549fac5b612c2d391ae85823c5eca1e91916
Author: Julius Unverfehrt <julius.unverfehrt@iqser.com>
Date:   Wed Jul 27 10:34:04 2022 +0200

    adjust build scripts for new pyinfra

commit ecd70f60d46406d8b6cc7f36a1533d706c917ca8
Author: Julius Unverfehrt <julius.unverfehrt@iqser.com>
Date:   Wed Jul 27 09:42:55 2022 +0200

    simplify logging by using default configurations

commit 20193c14c940eed2b0a7a72058167e26064119d0
Author: Julius Unverfehrt <julius.unverfehrt@iqser.com>
Date:   Tue Jul 26 17:16:57 2022 +0200

    tidy-up, refactor config logic to not dependent on external files

commit d8069cd4d404a570bb04a04278161669d1c83332
Author: Isaac Riley <Isaac.Riley@iqser.com>
Date:   Tue Jul 26 15:14:59 2022 +0200

    update pyinfra

commit c3bc11037cca9baf016043ab997c566f5b4a2586
Author: Isaac Riley <Isaac.Riley@iqser.com>
Date:   Tue Jul 26 15:09:14 2022 +0200

    repair tests

commit 6f4e4f2863ee16ae056c1d432f663858c5f10221
Author: Isaac Riley <Isaac.Riley@iqser.com>
Date:   Tue Jul 26 14:52:38 2022 +0200

    updated server logic to work with new pyinfra; update scripts for pyinfra as submodule

commit 2a18dba81de5ee84d0bdf0e77f478693e8d8aef4
Author: Isaac Riley <Isaac.Riley@iqser.com>
Date:   Tue Jul 26 14:10:41 2022 +0200

    formatting

commit d87ce9328de9aa2341228af9b24473d5e583504e
Author: Isaac Riley <Isaac.Riley@iqser.com>
Date:   Tue Jul 26 14:10:11 2022 +0200

    make server logic compatible with new pyinfra
2022-07-27 10:50:10 +02:00
Isaac Riley
1618909d8e Merge branch 'master' of ssh://git.iqser.com:2222/rr/cv-analysis 2022-07-26 13:13:29 +02:00
Julius Unverfehrt
a871fa3bd3 Pull request #19: Refactor evaluate
Merge in RR/cv-analysis from refactor-evaluate to master

Squashed commit of the following:

commit cde03a492452610322f8b7d3eb804a51afb76d81
Author: Julius Unverfehrt <julius.unverfehrt@iqser.com>
Date:   Fri Jul 22 12:37:36 2022 +0200

    add optional show analysis metadata dict

commit fb8bb9e2afa7767f2560f865516295be65f97f20
Author: Julius Unverfehrt <julius.unverfehrt@iqser.com>
Date:   Fri Jul 22 12:13:18 2022 +0200

    add script to evaluate runtime per page for all cv-analysis operations for multiple PDFs

commit 721e823e2ec38aae3fea51d01e2135fc8f228d94
Author: Julius Unverfehrt <julius.unverfehrt@iqser.com>
Date:   Fri Jul 22 10:30:31 2022 +0200

    refactor

commit a453753cfa477e162e5902ce191ded61cb678337
Author: Julius Unverfehrt <julius.unverfehrt@iqser.com>
Date:   Fri Jul 22 10:19:24 2022 +0200

    add logic to transform result coordinates accordingly to page rotation, update annotation script to use this logic

commit 71c09758d0fb763a2c38c6871e1d9bf51f2e7c41
Author: Julius Unverfehrt <julius.unverfehrt@iqser.com>
Date:   Thu Jul 21 15:57:49 2022 +0200

    introduce pipeline for image conversion, analysis and result formatting

commit aef252a41b9658dd0c4f55aa2d9f84de933586e0
Author: Julius Unverfehrt <julius.unverfehrt@iqser.com>
Date:   Thu Jul 21 15:57:38 2022 +0200

    introduce pipeline for image conversion, analysis and result formatting
2022-07-22 15:11:40 +02:00