36 Commits

Author SHA1 Message Date
Matthias Bisping
75748a1d82 refactoring 2022-04-25 11:19:26 +02:00
Matthias Bisping
26ef5fce8a Pull request #8: Pipeline refactoring
Merge in RR/image-prediction from pipeline_refactoring to tdd_refactoring

Squashed commit of the following:

commit 6989fcb3313007b7eecf4bba39077fcde6924a9a
Author: Matthias Bisping <matthias.bisping@iqser.com>
Date:   Mon Apr 25 09:49:49 2022 +0200

    removed obsolete module

commit 7428aeee37b11c31cffa597c85b018ba71e79a1d
Author: Matthias Bisping <matthias.bisping@iqser.com>
Date:   Mon Apr 25 09:45:45 2022 +0200

    refactoring

commit 0dcd3894154fdf34bd3ba4ef816362434474f472
Author: Matthias Bisping <matthias.bisping@iqser.com>
Date:   Mon Apr 25 08:57:21 2022 +0200

    refactoring; removed obsolete extractor-classifier

commit 1078aa81144f4219149b3fcacdae8b09c4b905c0
Author: Matthias Bisping <matthias.bisping@iqser.com>
Date:   Fri Apr 22 17:18:10 2022 +0200

    removed obsolete imports

commit 71f61fc5fc915da3941cf5ed5d9cc90fccc49031
Author: Matthias Bisping <matthias.bisping@iqser.com>
Date:   Fri Apr 22 17:16:25 2022 +0200

    comment changed

commit b582726cd1de233edb55c5a76c91e99f9dd3bd13
Author: Matthias Bisping <matthias.bisping@iqser.com>
Date:   Fri Apr 22 17:12:11 2022 +0200

    refactoring

commit 8abc9010048078868b235d6793ac6c8b20abb985
Author: Matthias Bisping <matthias.bisping@iqser.com>
Date:   Thu Apr 21 21:25:47 2022 +0200

    formatting

commit 2c87c419fe3185a25c27139e7fcf79f60971ad24
Author: Matthias Bisping <matthias.bisping@iqser.com>
Date:   Thu Apr 21 21:24:05 2022 +0200

    formatting

commit 50b161192db43a84464125c6d79650225e1010d6
Author: Matthias Bisping <matthias.bisping@iqser.com>
Date:   Thu Apr 21 21:20:18 2022 +0200

    refactoring

commit 9a1446cccfa070852a5d9c0bdbc36037b82541fc
Author: Matthias Bisping <matthias.bisping@iqser.com>
Date:   Thu Apr 21 21:04:57 2022 +0200

    refactoring

commit 6c10b55ff8e61412cb2fe5a5625e660ecaf1d7d1
Author: Matthias Bisping <matthias.bisping@iqser.com>
Date:   Thu Apr 21 19:48:05 2022 +0200

    refactoring

commit 72e785e3e31c132ab352119e9921725f91fac9e2
Author: Matthias Bisping <matthias.bisping@iqser.com>
Date:   Thu Apr 21 19:43:39 2022 +0200

    refactoring

commit f036ee55e6747daf31e3929bdc2d93dc5f2a56ca
Author: Matthias Bisping <matthias.bisping@iqser.com>
Date:   Wed Apr 20 18:30:41 2022 +0200

    refactoring pipeline WIP
2022-04-25 10:08:49 +02:00
Matthias Bisping
b79bfd7db7 containerized tests 2022-04-19 17:58:19 +02:00
Matthias Bisping
1fcec06e91 batch size and progressbar message forwarding 2022-04-19 15:09:14 +02:00
Matthias Bisping
5df3d1e85b return -> yield from 2022-04-19 14:29:02 +02:00
Matthias Bisping
2f37d20f68 removed obsolete imports 2022-04-19 14:21:26 +02:00
Matthias Bisping
7632fa8d7e corrected image set comparison; refactoring 2022-04-19 14:20:54 +02:00
Matthias Bisping
03e7b00cfd refactoring 2022-04-14 12:20:05 +02:00
Matthias Bisping
7aee00cb49 alpha channel querying improved 2022-04-13 17:31:33 +02:00
Matthias Bisping
f5881f2229 formatting 2022-04-13 13:06:20 +02:00
Matthias Bisping
62bfedfea8 alpha channel test fix 2022-04-13 12:06:55 +02:00
Matthias Bisping
1d88876ab1 alpha channel info WIP 2022-04-12 18:44:04 +02:00
Matthias Bisping
bbafad5561 refactoring in preparationfor alpha channel info 2022-04-12 18:22:38 +02:00
Matthias Bisping
e82a81f5c8 refactoring 2022-04-12 16:34:00 +02:00
Matthias Bisping
35c5b15e32 tolerance forwarding through pipeline constructor; box validation; tiny box filtering 2022-04-12 16:29:20 +02:00
Matthias Bisping
3d335783dc topological sorting of definitions by caller hierarchy 2022-04-11 16:08:54 +02:00
Matthias Bisping
bb79f9dd55 applied black 2022-04-11 13:57:32 +02:00
Matthias Bisping
585cdf5c70 integrated stitching into parsable pdf extractor 2022-04-11 13:57:10 +02:00
Matthias Bisping
d80af336eb refactoring 2022-04-11 13:28:39 +02:00
Matthias Bisping
37ee086b5d applied black 2022-04-05 17:55:38 +02:00
Matthias Bisping
2c908162f1 refactoring 2022-04-05 16:31:57 +02:00
Matthias Bisping
4756b8c9bd refactoring 2022-04-05 13:03:22 +02:00
Matthias Bisping
e0885c545a added page range paramter to extractor 2022-04-05 13:03:17 +02:00
Matthias Bisping
ce69f7d160 removed obsolete imports 2022-04-04 21:50:10 +02:00
Matthias Bisping
8f61c4cba2 doc.extract_image(xref) can yield None; hence added filtering for None images 2022-04-04 21:49:45 +02:00
Matthias Bisping
5c23898280 added log messages to all pipelien components; converting pipelien output to list for REST transport; refactoring; added e2e test (flask + pipeline)... but hangs 2022-04-02 02:44:30 +02:00
Matthias Bisping
91dd467142 applied black 2022-03-30 19:38:15 +02:00
Matthias Bisping
258c1ab02d testing laberl mappers for raising of excpetions when encountering unexpected input formats 2022-03-30 18:15:45 +02:00
Matthias Bisping
45a07c620a fixed chaining bug that lead to greedy evaluation 2022-03-30 00:53:34 +02:00
Matthias Bisping
ade318c7b7 made classifier accept tupls of images in addition to np.arrays; added pipeline (wip) 2022-03-29 22:00:34 +02:00
Matthias Bisping
7340fb6dda replaced string keys for metadata fields with enum members 2022-03-29 20:29:44 +02:00
Matthias Bisping
e818b05472 applied black 2022-03-28 16:39:34 +02:00
Matthias Bisping
b818ee4724 fixed misaligned metadata and images 2022-03-28 16:38:46 +02:00
Julius Unverfehrt
9461be29d5 add ParsablePDFImageExtractor test 2022-03-28 15:42:54 +02:00
Matthias Bisping
643ab99bd3 added parsable pdf image extractor 2022-03-28 11:27:05 +02:00
Matthias Bisping
a5147c9a58 added image extractor interface and mock 2022-03-27 23:05:27 +02:00