Kilian Schuettler
1916e626df
RED-8825: general improvements
...
* classify rulings as underline/striketrough
* improve performance of CleanRulings.lineBetween
* use lineBetween where possible
* wip, still todo:
- Header/Footer by Ruling for all rotations
- actually the ticket, optimizing layoutparsing for documine
2024-04-29 17:15:19 +02:00
Kilian Schuettler
e4663ac8db
RED-8825: added split by ruling into every step of docstrum
2024-04-29 15:54:56 +02:00
Kilian Schuettler
3dd215288a
RED-8825: improve layoutparsing
...
* added improved debugging capabilities to viewer-doc
* refactored coordinates (wip)
* refactored line intersection algorithm
* removed cropbox correction from pdfbox text positions
2024-04-29 15:54:53 +02:00
Corina Olariu
4e7c3f584b
RED-8992 - Enable to add annotation on header with line breaks
...
- don't reorder textblocks classified as headers and footers
- add unit test
2024-04-25 11:23:10 +03:00
Yannik Hampe
84bdb4d1ed
Merge branch 'RED-8701' into 'main'
...
RED-8701 - Move files to customer data repositories
See merge request fforesight/layout-parser!137
2024-04-25 09:06:35 +02:00
Dominique Eifländer
8442e60055
RED-8932 Fixed not merged headline with identifier
2024-04-24 11:45:38 +02:00
Corina Olariu
0ef67fc07b
RED-8701 - Move files to customer data repositories
...
- update junit tests and syngenta submodule
2024-04-23 14:54:56 +03:00
Corina Olariu
bdcb9aeda4
RED-8701 - Move files to customer data repositories
...
- update junit tests
2024-04-23 11:49:29 +03:00
Corina Olariu
6a86036a78
Merge branch 'main' into RED-8701
2024-04-23 11:46:59 +03:00
Corina Olariu
a358d7565e
RED-8701 - Move files to customer data repositories
...
- update junit tests
2024-04-23 11:12:57 +03:00
Corina Olariu
069a6c0b49
RED-8701 - Move files to customer data repositories
...
- update syngenta submodule
2024-04-23 10:44:23 +03:00
Corina Olariu
7eab3a4088
RED-8701 - Move files to customer data repositories
...
- remove customer files from project
2024-04-22 14:57:51 +03:00
Corina Olariu
970fc99ed1
RED-8701 - Move files to customer data repositories
...
- update junit test
2024-04-22 14:14:47 +03:00
Corina Olariu
48c54f63a0
RED-8701 - Move files to customer data repositories
...
- update submodules
2024-04-22 13:57:39 +03:00
Corina Olariu
20e4e5ddff
RED-8701 - Move files to customer data repositories
...
- update unit tests with the new path to submodules for customer files
2024-04-22 13:37:27 +03:00
Dominique Eifländer
b53930328a
RED-8826: Implemented graphics detection
2024-04-19 15:05:17 +02:00
Corina Olariu
cc9816c8cb
RED-8701 - Move files to customer data repositories
...
- use git lfs to store customer files
2024-04-18 20:31:35 +03:00
yhampe
8099a00bb6
RED-8402: Header and footer are not indexed / searched
...
added unit test and file
2024-04-18 14:39:01 +02:00
Corina Olariu
319268c53d
RED-8747 - Entities not merged properly - fp
...
- update test
2024-04-09 12:24:19 +03:00
Corina Olariu
014eba9fc3
RED-8747 - Entities not merged properly - fp
...
- fix typo
- add validate table test
2024-04-09 12:14:57 +03:00
Corina Olariu
f185b13f2b
RED-8747 - Entities not merged properly - fp
...
- use the rullings from the found tables instead of all rullings as splitting rullings in the blockification service
2024-04-08 09:42:32 +03:00
Dominique Eifländer
8e7e588d26
RED-8627: Fixed scrambled text after sorting
2024-03-19 10:58:36 +01:00
Dominique Eifländer
1d765a6baa
RED-7141: Fixed more overlap problems
2024-03-14 16:30:52 +01:00
Dominique Eifländer
27aa418029
RED-7141: Fixed overlapping blocks
2024-03-13 16:14:55 +01:00
Dominique Eifländer
92fd1a72de
RED-7141: Readded lost mergeLinesInZones
2024-03-12 13:42:40 +01:00
maverickstuder
16be2467fd
RED-8715: Improve NearestNeighbor Algorithm in LayoutParser
...
* replaced the old algorithm with an algorithm based on a kd-tree
2024-03-11 14:42:28 +01:00
Timo Bejan
dfc23955d7
Linespacing claryfind
2024-03-11 11:30:51 +02:00
Dominique Eifländer
d6e3d6fe22
Clarifynd
2024-03-11 11:24:58 +02:00
Timo Bejan
65ab7a1912
CLARI-30 - forward analysis headers
2024-03-08 16:47:27 +02:00
Timo Bejan
56c07a4491
CLARI-30 - identifier fix for clarifynd
2024-03-08 16:23:27 +02:00
Dominique Eifländer
d659fe7234
RED-7141: Performance improvments
2024-03-08 10:00:52 +01:00
Timo Bejan
05523585c0
orchestrator/persistence service should control queues
2024-03-06 16:55:44 +02:00
Timo Bejan
4ced572949
orchestrator/persistence service should control queues
2024-03-06 16:53:10 +02:00
Dominique Eifländer
79239b751d
RED-7141: Implemented docstrum layout parsing
2024-03-06 11:18:40 +01:00
Maverick Studer
74f55a5cbf
RED-8550: Faulty table recognition and text duplication leads to huge sections
2024-02-28 16:13:56 +01:00
Maverick Studer
1d64028158
RED-8550: Faulty table recognition and text duplication leads to huge sections
2024-02-21 13:54:30 +01:00
yhampe
cc77d19500
RED-8481: Use visual layout parsing to detect signatures
...
addressed review comments
2024-02-15 13:01:30 +01:00
yhampe
bdf1161c91
RED-8481: Use visual layout parsing to detect signatures
...
addressed review comments
2024-02-15 12:12:23 +01:00
yhampe
b4a225144d
RED-8481: Use visual layout parsing to detect signatures
...
working on failing tests
2024-02-15 10:16:07 +01:00
yhampe
fbd0196719
RED-8481: Use visual layout parsing to detect signatures
...
implemented visuallayoutparsingresult
2024-02-14 12:16:37 +01:00
Kilian Schuettler
015984891f
RED-8156: refactor ViewerDocumentService as a dependency for ocr-service
...
* fix pmd
2024-02-06 17:17:26 +01:00
Kilian Schuettler
66fcb62833
RED-8156: refactor ViewerDocumentService as a dependency for ocr-service
...
* fix pmd
2024-02-06 17:09:21 +01:00
Kilian Schuettler
23eb0c40a3
RED-8156: refactor ViewerDocumentService as a dependency for ocr-service
...
* various improvements to experimental parsing steps
* added embed fonts functionality to viewer doc
2024-02-06 16:59:51 +01:00
Dominique Eifländer
e4f3557b36
RED-8171: Traces do not stop at @Async
2024-02-02 13:22:57 +01:00
Timo Bejan
88855de2da
Red 8085
2024-01-29 10:31:36 +01:00
Dominique Eifländer
b779c72041
RED-1137: Do not observe actuator endpoints
2023-12-20 14:05:00 +01:00
Kilian Schüttler
ba1c7c07ab
RED-7384: fixes for migration
2023-12-20 12:40:00 +01:00
Dominique Eifländer
da2cdc288e
RED-5223: Use tracing-commons from fforesight
2023-12-13 15:31:26 +01:00
Dominique Eifländer
711548d1a7
hotfix: removed dlq from response queue to be equal to persistence-service
2023-12-13 09:47:27 +01:00
Dominique Eifländer
750ccf4ce2
RED-5223: Enabled tracing, upgrade spring, use logstash-logback-encoder for json logs
2023-12-11 15:06:23 +01:00