Kilian Schuettler
8005c1f25f
RED-9139: move document to module in redaction-service
...
* add feature version
2024-11-14 16:39:48 +01:00
Kilian Schuettler
84b054a4cc
RED-9139: move document to module in redaction-service
...
* add feature version
2024-11-14 16:39:48 +01:00
Kilian Schuettler
905b65a5fa
RED-9139: move document to module in redaction-service
...
* add feature version
2024-11-14 16:39:48 +01:00
Kilian Schuettler
7617c1f308
RED-9139: move document to module in redaction-service
...
* add feature version
2024-11-14 16:39:48 +01:00
Kilian Schuettler
96acefed78
RED-9139: move document to module in redaction-service
...
* add TableOfContents node
2024-11-14 16:39:48 +01:00
Kilian Schuettler
7f472ccc52
RED-9139: move document to module in redaction-service
...
* add TableOfContents node
2024-11-14 16:39:48 +01:00
Kilian Schuettler
6e04c15f3d
RED-9139: add new TableOfContents Node
...
* rename previous TableOfContent to SectionTree
* added protobuf compile script
2024-11-14 16:39:48 +01:00
Kilian Schüttler
7ee1f9e360
RED-9139: more robust TOC detection
2024-11-13 10:54:39 +01:00
Kilian Schüttler
af05218e37
RED-10127: rename TextPositionSequence to Word
2024-10-18 12:20:15 +02:00
Kilian Schüttler
7b073eb4f3
RED-10127: add list classification
2024-10-10 10:50:10 +02:00
Kilian Schüttler
6c7442ac6d
RED-10127: improve headline detection
2024-10-09 08:48:48 +02:00
Maverick Studer
fe2ed1807e
RED-9123: Improve performance of re-analysis (Spike)
2024-10-07 12:28:10 +02:00
Maverick Studer
8a80abfff1
RED-9010: remove redaction log
2024-09-19 11:34:32 +02:00
Kilian Schüttler
469da38952
Red 9974: improce headline classification, fix font size calculation
2024-09-16 14:06:48 +02:00
Kilian Schüttler
393103e074
RED-9975: improve SuperSection handling
2024-09-11 13:38:09 +02:00
maverickstuder
4a06059258
Update tenant-commons for dlq fix
2024-09-03 13:15:08 +02:00
Kilian Schüttler
8e14b74da2
Red 9975: fix outline detection
2024-09-02 09:02:36 +02:00
Kilian Schüttler
c5178ea5c2
RED-9964: don't merge tables on non-consecutive pages
2024-08-30 14:00:48 +02:00
Maverick Studer
933054b332
Tenants retry logic and queue renames
2024-08-29 13:46:54 +02:00
Maverick Studer
3b33405cbf
RED-9331: Explore possibilities for fair upload / analysis processing per tenant
2024-08-27 09:27:37 +02:00
Kilian Schüttler
69bcd4f68d
hotfix reading order
2024-08-09 11:49:12 +02:00
Timo Bejan
cdc2081785
CLARI-140 - case issue
2024-08-08 22:40:11 +03:00
Timo Bejan
5b6a706c28
CLAR-139 - fixed outline error for unparsable object
2024-08-08 16:20:14 +03:00
Maverick Studer
8c052c38d7
CLARI: document-data-markdown
2024-07-18 17:19:43 +02:00
Kilian Schüttler
2726fc3fe1
RED-8800: adjust coordinates in BE to ignore cropbox
2024-07-15 17:45:13 +02:00
Kilian Schüttler
ec0dd032c9
RED-9353: refactor PDFTronViewerDocumentService
2024-07-15 12:54:17 +02:00
Andrei Isvoran
65b1f7d179
RED-9496 - Implement graceful shutdown
2024-07-04 14:21:20 +03:00
Kilian Schuettler
e920eb5a78
CLARI-003: add treeId to StructureObject
2024-07-01 13:56:16 +02:00
Kilian Schüttler
66d3433e04
RED-9353: use azure ocr service
2024-07-01 11:13:26 +02:00
Yannik Hampe
39f527a57c
Merge branch 'main' into 'RED-3813'
...
# Conflicts:
# layoutparser-service/layoutparser-service-processor/src/main/java/com/knecon/fforesight/service/layoutparser/processor/LayoutParsingPipeline.java
2024-06-26 09:10:59 +02:00
yhampe
5c2844fe31
RED-3813: Recategorize same image as experimental feature
...
fixed failing test
2024-06-26 09:08:37 +02:00
Corina Olariu
5f5a6258c5
Merge branch 'main' into RED-9206
2024-06-05 13:34:14 +03:00
Maverick Studer
5d33ad570e
RED-7074: Design Subsection section tree structure algorithm
2024-06-05 12:28:00 +02:00
Corina Olariu
fd698a78fc
RED-9206 - Sections are no longer correctly separated from each other in the test file
...
- introduce new layout parsing type: REDACT_MANAGER_WITHOUT_DUPLICATE_PARAGRAPH to include changes from REDACT_MANAGER apart from duplicate paragraph.
- updated junit tests
-
2024-06-04 20:55:37 +03:00
Maverick Studer
fc06dba2ce
RED-7074: Design Subsection section tree structure algorithm
2024-06-04 15:07:40 +02:00
Maverick Studer
efb1a748af
RED-7074: Design Subsection section tree structure algorithm
2024-05-28 14:48:21 +02:00
Maverick Studer
48b7a22e2b
RED-7074: Design Subsection section tree structure algorithm
2024-05-24 13:30:25 +02:00
Kilian Schuettler
bcd1eb9afa
RED-8825: general layoutparsing improvements
...
* added test for table line classification
2024-05-03 00:13:48 +02:00
Kilian Schuettler
60acbac53f
RED-8825: general layoutparsing improvements
...
* fixing a bunch of coordinates
2024-05-03 00:06:29 +02:00
Kilian Schuettler
b6f0a21886
RED-8825: general layoutparsing improvements
...
* refactor all coordinates
2024-05-02 21:01:25 +02:00
Kilian Schuettler
ae46c5f1ca
RED-8825: general layoutparsing improvements
...
* fix tests
2024-04-30 11:55:18 +02:00
Kilian Schuettler
15ea385f4d
RED-8825: general improvements
...
* some more refactoring
* fixed text ruling classification for vertical text
* shrunk min graphics size
2024-04-30 10:44:32 +02:00
Kilian Schuettler
08be18db2d
RED-8825: general improvements
...
* some more refactoring
2024-04-29 20:09:53 +02:00
Kilian Schuettler
1916e626df
RED-8825: general improvements
...
* classify rulings as underline/striketrough
* improve performance of CleanRulings.lineBetween
* use lineBetween where possible
* wip, still todo:
- Header/Footer by Ruling for all rotations
- actually the ticket, optimizing layoutparsing for documine
2024-04-29 17:15:19 +02:00
Kilian Schuettler
e4663ac8db
RED-8825: added split by ruling into every step of docstrum
2024-04-29 15:54:56 +02:00
Kilian Schuettler
3dd215288a
RED-8825: improve layoutparsing
...
* added improved debugging capabilities to viewer-doc
* refactored coordinates (wip)
* refactored line intersection algorithm
* removed cropbox correction from pdfbox text positions
2024-04-29 15:54:53 +02:00
Corina Olariu
4e7c3f584b
RED-8992 - Enable to add annotation on header with line breaks
...
- don't reorder textblocks classified as headers and footers
- add unit test
2024-04-25 11:23:10 +03:00
Yannik Hampe
84bdb4d1ed
Merge branch 'RED-8701' into 'main'
...
RED-8701 - Move files to customer data repositories
See merge request fforesight/layout-parser!137
2024-04-25 09:06:35 +02:00
Dominique Eifländer
8442e60055
RED-8932 Fixed not merged headline with identifier
2024-04-24 11:45:38 +02:00
Corina Olariu
0ef67fc07b
RED-8701 - Move files to customer data repositories
...
- update junit tests and syngenta submodule
2024-04-23 14:54:56 +03:00