379 Commits

Author SHA1 Message Date
maverickstuder
1856fed640 RED-7074: Design Subsection section tree structure algorithm
* improved merging of headlines as well as splitting logic so that more headlines are detected correctly
2024-05-14 17:41:44 +02:00
maverickstuder
2fcaeb3d8c RED-7074: Design Subsection section tree structure algorithm
* added supersection and changed logic so that each normal section only contains leaf nodes
* added SectionIdentifier logic for headline splitting and merging
* fixed many edge cases which resulted in error state files
2024-05-14 10:51:05 +02:00
maverickstuder
4e07ba4ff1 RED-7074: Design Subsection section tree structure algorithm
* import optimized
2024-05-08 14:16:29 +02:00
maverickstuder
cfb6f0acfa RED-7074: Design Subsection section tree structure algorithm
* lots of refactoring to splitting logic for text blocks which resulted in some empty blocks to be created which can then not be localized (i.e. by containsBlock)
2024-05-08 14:15:27 +02:00
maverickstuder
a9338262c5 RED-7074: Design Subsection section tree structure algorithm
* fix for boundary error
2024-05-07 15:51:54 +02:00
maverickstuder
d2dc369df3 RED-7074: Design Subsection section tree structure algorithm
* temp
2024-05-07 14:25:54 +02:00
maverickstuder
f7aeb9a406 RED-7074: Design Subsection section tree structure algorithm
* refactoring
2024-05-02 10:36:36 +02:00
maverickstuder
9bf2f5c56c Merge remote-tracking branch 'origin/RED-7074' into RED-7074
# Conflicts:
#	layoutparser-service/layoutparser-service-processor/src/main/java/com/knecon/fforesight/service/layoutparser/processor/LayoutParsingPipeline.java
#	layoutparser-service/layoutparser-service-processor/src/main/java/com/knecon/fforesight/service/layoutparser/processor/model/ClassificationDocument.java
#	layoutparser-service/layoutparser-service-processor/src/main/java/com/knecon/fforesight/service/layoutparser/processor/model/outline/OutlineValidationService.java
#	layoutparser-service/layoutparser-service-processor/src/main/java/com/knecon/fforesight/service/layoutparser/processor/model/outline/TableOfContentItem.java
#	layoutparser-service/layoutparser-service-processor/src/main/java/com/knecon/fforesight/service/layoutparser/processor/model/outline/TableOfContents.java
#	layoutparser-service/layoutparser-service-processor/src/main/java/com/knecon/fforesight/service/layoutparser/processor/services/classification/RedactManagerClassificationService.java
#	layoutparser-service/layoutparser-service-server/src/test/java/com/knecon/fforesight/service/layoutparser/server/graph/ViewerDocumentTest.java
#	layoutparser-service/layoutparser-service-server/src/test/resources/files/new/UTT-Books-53.pdf
2024-04-30 14:44:26 +02:00
maverickstuder
c071a133e6 RED-7074: Design Subsection section tree structure algorithm
* added toc enrichment logic and changed section computation to build upon created toc
2024-04-30 14:41:17 +02:00
maverickstuder
9f9ea68706 RED-7074: Design Subsection section tree structure algorithm
* first draft: further implementations
2024-04-29 15:00:49 +02:00
maverickstuder
85e3cf0ecc RED-7074: Design Subsection section tree structure algorithm
* first draft: further implementations
2024-04-29 15:00:49 +02:00
maverickstuder
17756f5977 RED-7074: Design Subsection section tree structure algorithm
* first draft: further implementations
2024-04-29 15:00:48 +02:00
maverickstuder
59d9d6c3e6 RED-7074: Design Subsection section tree structure algorithm
* first draft: further implementations
2024-04-29 15:00:34 +02:00
maverickstuder
c888746761 RED-7074: Design Subsection section tree structure algorithm
* first draft: further implementations
2024-04-29 15:00:34 +02:00
maverickstuder
7279d0a870 RED-7074: Design Subsection section tree structure algorithm
* first draft
2024-04-29 15:00:34 +02:00
maverickstuder
c84a199f9d RED-7074: Design Subsection section tree structure algorithm
* first draft
2024-04-29 15:00:32 +02:00
Kilian Schüttler
6fb1a0bef3 Merge branch 'RED-8992' into 'main'
RED-8992 - Enable to add annotation on header with line breaks

See merge request fforesight/layout-parser!143
0.121.0
2024-04-25 13:03:40 +02:00
Corina Olariu
4e7c3f584b RED-8992 - Enable to add annotation on header with line breaks
- don't reorder textblocks classified as headers and footers
- add unit test
2024-04-25 11:23:10 +03:00
Yannik Hampe
84bdb4d1ed Merge branch 'RED-8701' into 'main'
RED-8701 - Move files to customer data repositories

See merge request fforesight/layout-parser!137
0.120.0
2024-04-25 09:06:35 +02:00
Dominique Eifländer
75ab4df592 Merge branch 'RED-8932' into 'main'
RED-8932 Fixed not merged headline with identifier

See merge request fforesight/layout-parser!141
0.119.0
2024-04-24 11:55:01 +02:00
Dominique Eifländer
8442e60055 RED-8932 Fixed not merged headline with identifier 2024-04-24 11:45:38 +02:00
Corina Olariu
0ef67fc07b RED-8701 - Move files to customer data repositories
- update junit tests and syngenta submodule
2024-04-23 14:54:56 +03:00
Corina Olariu
ea02f31a84 Merge branch 'main' into RED-8701
# Conflicts:
#	layoutparser-service/layoutparser-service-server/src/test/java/com/knecon/fforesight/service/layoutparser/server/graph/ViewerDocumentTest.java
2024-04-23 14:20:00 +03:00
Dominique Eifländer
58acbab85f Merge branch 'RED-8826' into 'main'
Red 8826

See merge request fforesight/layout-parser!138
0.116.0
2024-04-23 13:12:51 +02:00
Kilian Schüttler
d38d023485 Merge branch 'RED-7384' into 'main'
Red 7384

See merge request fforesight/layout-parser!140
0.115.0
2024-04-23 12:13:21 +02:00
Kilian Schüttler
c1afe9b11f Red 7384 2024-04-23 12:13:19 +02:00
Corina Olariu
bdcb9aeda4 RED-8701 - Move files to customer data repositories
- update junit tests
2024-04-23 11:49:29 +03:00
Corina Olariu
6a86036a78 Merge branch 'main' into RED-8701 2024-04-23 11:46:59 +03:00
Corina Olariu
a358d7565e RED-8701 - Move files to customer data repositories
- update junit tests
2024-04-23 11:12:57 +03:00
Corina Olariu
069a6c0b49 RED-8701 - Move files to customer data repositories
- update syngenta submodule
2024-04-23 10:44:23 +03:00
Dominique Eifländer
683f7f1fb8 RED-8826: Do not classify textblocks in graphics as headlines 2024-04-23 09:28:28 +02:00
Corina Olariu
7eab3a4088 RED-8701 - Move files to customer data repositories
- remove customer files from project
2024-04-22 14:57:51 +03:00
Corina Olariu
970fc99ed1 RED-8701 - Move files to customer data repositories
- update junit test
2024-04-22 14:14:47 +03:00
Corina Olariu
48c54f63a0 RED-8701 - Move files to customer data repositories
- update submodules
2024-04-22 13:57:39 +03:00
Corina Olariu
20e4e5ddff RED-8701 - Move files to customer data repositories
- update unit tests with the new path to submodules for customer files
2024-04-22 13:37:27 +03:00
Dominique Eifländer
b53930328a RED-8826: Implemented graphics detection 2024-04-19 15:05:17 +02:00
maverickstuder
09148960cf RED-7074: Design Subsection section tree structure algorithm
* first draft: further implementations
2024-04-19 11:31:34 +02:00
Dominique Eifländer
c947d552d2 Merge branch 'RED-8995-fp' into 'main'
RED-8995: unclassified text might be missing from document data

See merge request fforesight/layout-parser!135
0.114.0
2024-04-19 09:21:50 +02:00
Corina Olariu
6b1b5eab84 RED-8701 - Move files to customer data repositories
- add syngenta submodule
2024-04-18 20:33:00 +03:00
Corina Olariu
cc9816c8cb RED-8701 - Move files to customer data repositories
- use git lfs to store customer files
2024-04-18 20:31:35 +03:00
maverickstuder
77ee8dd5bd RED-7074: Design Subsection section tree structure algorithm
* first draft: further implementations
2024-04-18 17:52:33 +02:00
Kilian Schuettler
f256f9b30f RED-8995: unclassified text might be missing from document data
* treat TablePageBlock.OTHER like PARAGRAPH (no special treatment)
2024-04-18 17:42:34 +02:00
Yannik Hampe
6167e3fb57 Merge branch 'RED-8402' into 'main'
RED-8402: Header and footer are not indexed / searched

See merge request fforesight/layout-parser!134
0.113.0
2024-04-18 15:08:00 +02:00
yhampe
a78fb0244a Merge remote-tracking branch 'origin/RED-8402' into RED-8402 2024-04-18 14:39:10 +02:00
yhampe
8099a00bb6 RED-8402: Header and footer are not indexed / searched
added unit test and file
2024-04-18 14:39:01 +02:00
yhampe
9bb0468b2b RED-8402: Header and footer are not indexed / searched
added unit test and file
2024-04-18 14:36:25 +02:00
maverickstuder
e9d1bdc94f RED-7074: Design Subsection section tree structure algorithm
* first draft: further implementations
2024-04-17 14:31:48 +02:00
maverickstuder
894355c7cd RED-7074: Design Subsection section tree structure algorithm
* first draft: further implementations
2024-04-16 12:35:26 +02:00
maverickstuder
ca35feeb63 RED-7074: Design Subsection section tree structure algorithm
* first draft: further implementations
2024-04-15 16:43:40 +02:00
maverickstuder
a32a43fc62 RED-7074: Design Subsection section tree structure algorithm
* first draft
2024-04-10 12:28:42 +02:00