147 Commits

Author SHA1 Message Date
Matthias Bisping
62bfedfea8 alpha channel test fix 2022-04-13 12:06:55 +02:00
Matthias Bisping
1d88876ab1 alpha channel info WIP 2022-04-12 18:44:04 +02:00
Matthias Bisping
bbafad5561 refactoring in preparationfor alpha channel info 2022-04-12 18:22:38 +02:00
Matthias Bisping
f17a232009 tests for box validation 2022-04-12 16:54:40 +02:00
Matthias Bisping
e82a81f5c8 refactoring 2022-04-12 16:34:00 +02:00
Matthias Bisping
35c5b15e32 tolerance forwarding through pipeline constructor; box validation; tiny box filtering 2022-04-12 16:29:20 +02:00
Matthias Bisping
698e647c6f applied black 2022-04-12 15:06:18 +02:00
Matthias Bisping
d8f86d14a5 fuzzy stitching completed 2022-04-12 15:04:32 +02:00
Matthias Bisping
bb7c1be630 fuzzy stitching WIP: mostly works, but sometimes fails. run test_image_stitcher_with_gaps to debug 2022-04-11 19:20:47 +02:00
Matthias Bisping
79cd31850d fuzzy stitching WIP: added tolerance to stitching; added fuzzification function; added tests for grouping and (fuzzy and exact) 2022-04-11 16:47:47 +02:00
Matthias Bisping
3d335783dc topological sorting of definitions by caller hierarchy 2022-04-11 16:08:54 +02:00
Matthias Bisping
bb79f9dd55 applied black 2022-04-11 13:57:32 +02:00
Matthias Bisping
585cdf5c70 integrated stitching into parsable pdf extractor 2022-04-11 13:57:10 +02:00
Matthias Bisping
04cf0245ed formatting 2022-04-11 13:38:09 +02:00
Matthias Bisping
3530ef72c5 docstring update 2022-04-11 13:37:46 +02:00
Matthias Bisping
d80af336eb refactoring 2022-04-11 13:28:39 +02:00
Matthias Bisping
bcf6dc5c47 generalized split mapper 2022-04-11 13:03:02 +02:00
Matthias Bisping
f4c0547405 refactoring: replaced split mapper with dataclass 2022-04-11 12:16:42 +02:00
Matthias Bisping
1bea5fb9a8 refactoring 2022-04-11 10:29:13 +02:00
Matthias Bisping
57440f5106 refactoring 2022-04-11 09:53:32 +02:00
Matthias Bisping
710783a2f8 merging algorithm explanation adjusted 2022-04-11 09:28:00 +02:00
Matthias Bisping
887b8339a2 renaming 2022-04-08 14:17:05 +02:00
Matthias Bisping
43cb0fffed refactoring 2022-04-08 14:13:03 +02:00
Matthias Bisping
6e7645e319 topological sorting of definitions by caller hierarchy 2022-04-08 14:04:48 +02:00
Matthias Bisping
3b18fc6158 refactoring 2022-04-08 13:56:57 +02:00
Matthias Bisping
1b10445f91 refactoring 2022-04-08 12:01:20 +02:00
Matthias Bisping
5967149c49 refactoring 2022-04-07 21:49:55 +02:00
Matthias Bisping
303970db51 refactoring 2022-04-07 21:44:04 +02:00
Matthias Bisping
51793d19e9 refactoring 2022-04-07 21:39:01 +02:00
Matthias Bisping
37ee086b5d applied black 2022-04-05 17:55:38 +02:00
Matthias Bisping
2c908162f1 refactoring 2022-04-05 16:31:57 +02:00
Matthias Bisping
4756b8c9bd refactoring 2022-04-05 13:03:22 +02:00
Matthias Bisping
e0885c545a added page range paramter to extractor 2022-04-05 13:03:17 +02:00
Matthias Bisping
fdb7ebe618 logging change 2022-04-04 23:37:49 +02:00
Matthias Bisping
ce69f7d160 removed obsolete imports 2022-04-04 21:50:10 +02:00
Matthias Bisping
8f61c4cba2 doc.extract_image(xref) can yield None; hence added filtering for None images 2022-04-04 21:49:45 +02:00
Matthias Bisping
692e72b3b2 refactoring 2022-04-04 18:29:17 +02:00
Matthias Bisping
38869d52c6 refactoring 2022-04-04 18:17:49 +02:00
Matthias Bisping
ab382646b7 applied black 2022-04-03 04:47:49 +02:00
Matthias Bisping
d134884553 misc 2022-04-03 04:35:44 +02:00
Matthias Bisping
2d0545c928 refactoring 2022-04-03 04:31:50 +02:00
Matthias Bisping
65a4a8e34e refactoring 2022-04-03 04:25:10 +02:00
Matthias Bisping
39c111fd42 integrated PDFNet coordinate transformer into pipeline 2022-04-03 04:08:00 +02:00
Matthias Bisping
0376223c9d coordinate transformers refac 2022-04-03 04:00:15 +02:00
Matthias Bisping
bf85ef357c coordinate transformers version 1 completed 2022-04-03 03:51:31 +02:00
Matthias Bisping
f6a7a14a20 pdfnet coordinate transformer wip 2022-04-03 03:19:46 +02:00
Matthias Bisping
f44e6f4fd7 coordinate transformer, added Fitz transformer 2022-04-03 02:15:41 +02:00
Matthias Bisping
9663cec12d coordinate transformer wip 2022-04-03 01:54:51 +02:00
Matthias Bisping
1cf6ab256c muting logger in tests 2022-04-02 18:34:13 +02:00
Matthias Bisping
a89e374c67 removed obsolete code 2022-04-02 03:41:55 +02:00