135 Commits

Author SHA1 Message Date
Matthias Bisping
585cdf5c70 integrated stitching into parsable pdf extractor 2022-04-11 13:57:10 +02:00
Matthias Bisping
04cf0245ed formatting 2022-04-11 13:38:09 +02:00
Matthias Bisping
3530ef72c5 docstring update 2022-04-11 13:37:46 +02:00
Matthias Bisping
d80af336eb refactoring 2022-04-11 13:28:39 +02:00
Matthias Bisping
bcf6dc5c47 generalized split mapper 2022-04-11 13:03:02 +02:00
Matthias Bisping
f4c0547405 refactoring: replaced split mapper with dataclass 2022-04-11 12:16:42 +02:00
Matthias Bisping
1bea5fb9a8 refactoring 2022-04-11 10:29:13 +02:00
Matthias Bisping
57440f5106 refactoring 2022-04-11 09:53:32 +02:00
Matthias Bisping
710783a2f8 merging algorithm explanation adjusted 2022-04-11 09:28:00 +02:00
Matthias Bisping
887b8339a2 renaming 2022-04-08 14:17:05 +02:00
Matthias Bisping
43cb0fffed refactoring 2022-04-08 14:13:03 +02:00
Matthias Bisping
6e7645e319 topological sorting of definitions by caller hierarchy 2022-04-08 14:04:48 +02:00
Matthias Bisping
3b18fc6158 refactoring 2022-04-08 13:56:57 +02:00
Matthias Bisping
1b10445f91 refactoring 2022-04-08 12:01:20 +02:00
Matthias Bisping
5967149c49 refactoring 2022-04-07 21:49:55 +02:00
Matthias Bisping
303970db51 refactoring 2022-04-07 21:44:04 +02:00
Matthias Bisping
51793d19e9 refactoring 2022-04-07 21:39:01 +02:00
Matthias Bisping
37ee086b5d applied black 2022-04-05 17:55:38 +02:00
Matthias Bisping
2c908162f1 refactoring 2022-04-05 16:31:57 +02:00
Matthias Bisping
4756b8c9bd refactoring 2022-04-05 13:03:22 +02:00
Matthias Bisping
e0885c545a added page range paramter to extractor 2022-04-05 13:03:17 +02:00
Matthias Bisping
fdb7ebe618 logging change 2022-04-04 23:37:49 +02:00
Matthias Bisping
ce69f7d160 removed obsolete imports 2022-04-04 21:50:10 +02:00
Matthias Bisping
8f61c4cba2 doc.extract_image(xref) can yield None; hence added filtering for None images 2022-04-04 21:49:45 +02:00
Matthias Bisping
692e72b3b2 refactoring 2022-04-04 18:29:17 +02:00
Matthias Bisping
38869d52c6 refactoring 2022-04-04 18:17:49 +02:00
Matthias Bisping
ab382646b7 applied black 2022-04-03 04:47:49 +02:00
Matthias Bisping
d134884553 misc 2022-04-03 04:35:44 +02:00
Matthias Bisping
2d0545c928 refactoring 2022-04-03 04:31:50 +02:00
Matthias Bisping
65a4a8e34e refactoring 2022-04-03 04:25:10 +02:00
Matthias Bisping
39c111fd42 integrated PDFNet coordinate transformer into pipeline 2022-04-03 04:08:00 +02:00
Matthias Bisping
0376223c9d coordinate transformers refac 2022-04-03 04:00:15 +02:00
Matthias Bisping
bf85ef357c coordinate transformers version 1 completed 2022-04-03 03:51:31 +02:00
Matthias Bisping
f6a7a14a20 pdfnet coordinate transformer wip 2022-04-03 03:19:46 +02:00
Matthias Bisping
f44e6f4fd7 coordinate transformer, added Fitz transformer 2022-04-03 02:15:41 +02:00
Matthias Bisping
9663cec12d coordinate transformer wip 2022-04-03 01:54:51 +02:00
Matthias Bisping
1cf6ab256c muting logger in tests 2022-04-02 18:34:13 +02:00
Matthias Bisping
a89e374c67 removed obsolete code 2022-04-02 03:41:55 +02:00
Matthias Bisping
0861e22542 fixed pipeline not working with flask... model was loaded in external process, probably; known issue 2022-04-02 03:38:44 +02:00
Matthias Bisping
7827869af4 fixed logger's logging level 2022-04-02 02:58:30 +02:00
Matthias Bisping
613bba8cfc ... 2022-04-02 02:45:21 +02:00
Matthias Bisping
5c23898280 added log messages to all pipelien components; converting pipelien output to list for REST transport; refactoring; added e2e test (flask + pipeline)... but hangs 2022-04-02 02:44:30 +02:00
Matthias Bisping
e8d0299e46 refactoring 2022-04-02 01:27:30 +02:00
Matthias Bisping
cb00aed62c refactoring 2022-04-02 01:23:57 +02:00
Matthias Bisping
1501653673 coverage increased for flask tests 2022-04-02 00:16:01 +02:00
Matthias Bisping
b4b929b65f added mocked server tests with flask testing uitilities 2022-04-01 21:55:59 +02:00
Matthias Bisping
3d1c251e10 removed redundant TF env var export 2022-04-01 21:35:10 +02:00
Matthias Bisping
c80549d5d3 refactoring: model wrapper to base class and derived class for efficient net 2022-04-01 21:32:18 +02:00
Matthias Bisping
070749880e removed obsolete code 2022-04-01 21:13:15 +02:00
Matthias Bisping
94783c54f2 eliminated redai dependency; updated requirement versions 2022-04-01 21:10:41 +02:00