Maverick Studer
8a80abfff1
RED-9010: remove redaction log
2024-09-19 11:34:32 +02:00
Kilian Schüttler
469da38952
Red 9974: improce headline classification, fix font size calculation
2024-09-16 14:06:48 +02:00
Kilian Schüttler
393103e074
RED-9975: improve SuperSection handling
2024-09-11 13:38:09 +02:00
maverickstuder
4a06059258
Update tenant-commons for dlq fix
2024-09-03 13:15:08 +02:00
Kilian Schüttler
8e14b74da2
Red 9975: fix outline detection
2024-09-02 09:02:36 +02:00
Kilian Schüttler
c5178ea5c2
RED-9964: don't merge tables on non-consecutive pages
2024-08-30 14:00:48 +02:00
Maverick Studer
933054b332
Tenants retry logic and queue renames
2024-08-29 13:46:54 +02:00
Maverick Studer
3b33405cbf
RED-9331: Explore possibilities for fair upload / analysis processing per tenant
2024-08-27 09:27:37 +02:00
Kilian Schüttler
69bcd4f68d
hotfix reading order
2024-08-09 11:49:12 +02:00
Timo Bejan
cdc2081785
CLARI-140 - case issue
2024-08-08 22:40:11 +03:00
Timo Bejan
5b6a706c28
CLAR-139 - fixed outline error for unparsable object
2024-08-08 16:20:14 +03:00
Maverick Studer
8c052c38d7
CLARI: document-data-markdown
2024-07-18 17:19:43 +02:00
Kilian Schüttler
2726fc3fe1
RED-8800: adjust coordinates in BE to ignore cropbox
2024-07-15 17:45:13 +02:00
Kilian Schüttler
ec0dd032c9
RED-9353: refactor PDFTronViewerDocumentService
2024-07-15 12:54:17 +02:00
Andrei Isvoran
65b1f7d179
RED-9496 - Implement graceful shutdown
2024-07-04 14:21:20 +03:00
Kilian Schuettler
e920eb5a78
CLARI-003: add treeId to StructureObject
2024-07-01 13:56:16 +02:00
Kilian Schüttler
66d3433e04
RED-9353: use azure ocr service
2024-07-01 11:13:26 +02:00
Yannik Hampe
39f527a57c
Merge branch 'main' into 'RED-3813'
...
# Conflicts:
# layoutparser-service/layoutparser-service-processor/src/main/java/com/knecon/fforesight/service/layoutparser/processor/LayoutParsingPipeline.java
2024-06-26 09:10:59 +02:00
yhampe
5c2844fe31
RED-3813: Recategorize same image as experimental feature
...
fixed failing test
2024-06-26 09:08:37 +02:00
Corina Olariu
5f5a6258c5
Merge branch 'main' into RED-9206
2024-06-05 13:34:14 +03:00
Maverick Studer
5d33ad570e
RED-7074: Design Subsection section tree structure algorithm
2024-06-05 12:28:00 +02:00
Corina Olariu
fd698a78fc
RED-9206 - Sections are no longer correctly separated from each other in the test file
...
- introduce new layout parsing type: REDACT_MANAGER_WITHOUT_DUPLICATE_PARAGRAPH to include changes from REDACT_MANAGER apart from duplicate paragraph.
- updated junit tests
-
2024-06-04 20:55:37 +03:00
Maverick Studer
fc06dba2ce
RED-7074: Design Subsection section tree structure algorithm
2024-06-04 15:07:40 +02:00
Maverick Studer
efb1a748af
RED-7074: Design Subsection section tree structure algorithm
2024-05-28 14:48:21 +02:00
Maverick Studer
48b7a22e2b
RED-7074: Design Subsection section tree structure algorithm
2024-05-24 13:30:25 +02:00
Kilian Schuettler
bcd1eb9afa
RED-8825: general layoutparsing improvements
...
* added test for table line classification
2024-05-03 00:13:48 +02:00
Kilian Schuettler
60acbac53f
RED-8825: general layoutparsing improvements
...
* fixing a bunch of coordinates
2024-05-03 00:06:29 +02:00
Kilian Schuettler
b6f0a21886
RED-8825: general layoutparsing improvements
...
* refactor all coordinates
2024-05-02 21:01:25 +02:00
Kilian Schuettler
ae46c5f1ca
RED-8825: general layoutparsing improvements
...
* fix tests
2024-04-30 11:55:18 +02:00
Kilian Schuettler
15ea385f4d
RED-8825: general improvements
...
* some more refactoring
* fixed text ruling classification for vertical text
* shrunk min graphics size
2024-04-30 10:44:32 +02:00
Kilian Schuettler
08be18db2d
RED-8825: general improvements
...
* some more refactoring
2024-04-29 20:09:53 +02:00
Kilian Schuettler
1916e626df
RED-8825: general improvements
...
* classify rulings as underline/striketrough
* improve performance of CleanRulings.lineBetween
* use lineBetween where possible
* wip, still todo:
- Header/Footer by Ruling for all rotations
- actually the ticket, optimizing layoutparsing for documine
2024-04-29 17:15:19 +02:00
Kilian Schuettler
e4663ac8db
RED-8825: added split by ruling into every step of docstrum
2024-04-29 15:54:56 +02:00
Kilian Schuettler
3dd215288a
RED-8825: improve layoutparsing
...
* added improved debugging capabilities to viewer-doc
* refactored coordinates (wip)
* refactored line intersection algorithm
* removed cropbox correction from pdfbox text positions
2024-04-29 15:54:53 +02:00
Corina Olariu
4e7c3f584b
RED-8992 - Enable to add annotation on header with line breaks
...
- don't reorder textblocks classified as headers and footers
- add unit test
2024-04-25 11:23:10 +03:00
Yannik Hampe
84bdb4d1ed
Merge branch 'RED-8701' into 'main'
...
RED-8701 - Move files to customer data repositories
See merge request fforesight/layout-parser!137
2024-04-25 09:06:35 +02:00
Dominique Eifländer
8442e60055
RED-8932 Fixed not merged headline with identifier
2024-04-24 11:45:38 +02:00
Corina Olariu
0ef67fc07b
RED-8701 - Move files to customer data repositories
...
- update junit tests and syngenta submodule
2024-04-23 14:54:56 +03:00
Corina Olariu
bdcb9aeda4
RED-8701 - Move files to customer data repositories
...
- update junit tests
2024-04-23 11:49:29 +03:00
Corina Olariu
6a86036a78
Merge branch 'main' into RED-8701
2024-04-23 11:46:59 +03:00
Corina Olariu
a358d7565e
RED-8701 - Move files to customer data repositories
...
- update junit tests
2024-04-23 11:12:57 +03:00
Corina Olariu
069a6c0b49
RED-8701 - Move files to customer data repositories
...
- update syngenta submodule
2024-04-23 10:44:23 +03:00
Corina Olariu
7eab3a4088
RED-8701 - Move files to customer data repositories
...
- remove customer files from project
2024-04-22 14:57:51 +03:00
Corina Olariu
970fc99ed1
RED-8701 - Move files to customer data repositories
...
- update junit test
2024-04-22 14:14:47 +03:00
Corina Olariu
48c54f63a0
RED-8701 - Move files to customer data repositories
...
- update submodules
2024-04-22 13:57:39 +03:00
Corina Olariu
20e4e5ddff
RED-8701 - Move files to customer data repositories
...
- update unit tests with the new path to submodules for customer files
2024-04-22 13:37:27 +03:00
Dominique Eifländer
b53930328a
RED-8826: Implemented graphics detection
2024-04-19 15:05:17 +02:00
Corina Olariu
cc9816c8cb
RED-8701 - Move files to customer data repositories
...
- use git lfs to store customer files
2024-04-18 20:31:35 +03:00
yhampe
8099a00bb6
RED-8402: Header and footer are not indexed / searched
...
added unit test and file
2024-04-18 14:39:01 +02:00
Corina Olariu
319268c53d
RED-8747 - Entities not merged properly - fp
...
- update test
2024-04-09 12:24:19 +03:00