Kilian Schuettler
abb249e966
RED-8825: general layoutparsing improvements
...
* fix checkstyle
2024-05-03 00:15:31 +02:00
Kilian Schuettler
bcd1eb9afa
RED-8825: general layoutparsing improvements
...
* added test for table line classification
2024-05-03 00:13:48 +02:00
Kilian Schuettler
60acbac53f
RED-8825: general layoutparsing improvements
...
* fixing a bunch of coordinates
2024-05-03 00:06:29 +02:00
Kilian Schuettler
a3decd292d
RED-8825: general layoutparsing improvements
...
* fix RulingCleaningService
2024-05-02 23:00:22 +02:00
Kilian Schuettler
b6f0a21886
RED-8825: general layoutparsing improvements
...
* refactor all coordinates
2024-05-02 21:01:25 +02:00
Kilian Schuettler
d61cac8b4f
RED-8825: general layoutparsing improvements
...
* fix tests
2024-04-30 16:06:22 +02:00
Kilian Schuettler
ae46c5f1ca
RED-8825: general layoutparsing improvements
...
* fix tests
2024-04-30 11:55:18 +02:00
Kilian Schuettler
f0a70a5242
RED-8825: general improvements
...
* some more refactoring
* fixed text ruling classification for vertical text
* shrunk min graphics size
2024-04-30 11:09:23 +02:00
Kilian Schuettler
15ea385f4d
RED-8825: general improvements
...
* some more refactoring
* fixed text ruling classification for vertical text
* shrunk min graphics size
2024-04-30 10:44:32 +02:00
Kilian Schuettler
08be18db2d
RED-8825: general improvements
...
* some more refactoring
2024-04-29 20:09:53 +02:00
Kilian Schuettler
64209255cb
RED-8825: general improvements
...
* classify rulings as underline/striketrough
* improve performance of CleanRulings.lineBetween
* use lineBetween where possible
* wip, still todo:
- Header/Footer by Ruling for all rotations
- actually the ticket, optimizing layoutparsing for documine
2024-04-29 17:24:15 +02:00
Kilian Schuettler
4761d2e1a2
RED-8825: general improvements
...
* classify rulings as underline/striketrough
* improve performance of CleanRulings.lineBetween
* use lineBetween where possible
* wip, still todo:
- Header/Footer by Ruling for all rotations
- actually the ticket, optimizing layoutparsing for documine
2024-04-29 17:22:33 +02:00
Kilian Schuettler
1916e626df
RED-8825: general improvements
...
* classify rulings as underline/striketrough
* improve performance of CleanRulings.lineBetween
* use lineBetween where possible
* wip, still todo:
- Header/Footer by Ruling for all rotations
- actually the ticket, optimizing layoutparsing for documine
2024-04-29 17:15:19 +02:00
Kilian Schuettler
e4663ac8db
RED-8825: added split by ruling into every step of docstrum
2024-04-29 15:54:56 +02:00
Kilian Schuettler
6a691183dc
RED-8825: improve layoutparsing
...
* added improved debugging capabilities to viewer-doc
* refactored coordinates (wip)
* refactored line intersection algorithm
* removed cropbox correction from pdfbox text positions
2024-04-29 15:54:56 +02:00
Kilian Schuettler
3dd215288a
RED-8825: improve layoutparsing
...
* added improved debugging capabilities to viewer-doc
* refactored coordinates (wip)
* refactored line intersection algorithm
* removed cropbox correction from pdfbox text positions
2024-04-29 15:54:53 +02:00
Kilian Schüttler
6fb1a0bef3
Merge branch 'RED-8992' into 'main'
...
RED-8992 - Enable to add annotation on header with line breaks
See merge request fforesight/layout-parser!143
0.121.0
2024-04-25 13:03:40 +02:00
Corina Olariu
4e7c3f584b
RED-8992 - Enable to add annotation on header with line breaks
...
- don't reorder textblocks classified as headers and footers
- add unit test
2024-04-25 11:23:10 +03:00
Yannik Hampe
84bdb4d1ed
Merge branch 'RED-8701' into 'main'
...
RED-8701 - Move files to customer data repositories
See merge request fforesight/layout-parser!137
0.120.0
2024-04-25 09:06:35 +02:00
Dominique Eifländer
75ab4df592
Merge branch 'RED-8932' into 'main'
...
RED-8932 Fixed not merged headline with identifier
See merge request fforesight/layout-parser!141
0.119.0
2024-04-24 11:55:01 +02:00
Dominique Eifländer
8442e60055
RED-8932 Fixed not merged headline with identifier
2024-04-24 11:45:38 +02:00
Corina Olariu
0ef67fc07b
RED-8701 - Move files to customer data repositories
...
- update junit tests and syngenta submodule
2024-04-23 14:54:56 +03:00
Corina Olariu
ea02f31a84
Merge branch 'main' into RED-8701
...
# Conflicts:
# layoutparser-service/layoutparser-service-server/src/test/java/com/knecon/fforesight/service/layoutparser/server/graph/ViewerDocumentTest.java
2024-04-23 14:20:00 +03:00
Dominique Eifländer
58acbab85f
Merge branch 'RED-8826' into 'main'
...
Red 8826
See merge request fforesight/layout-parser!138
0.116.0
2024-04-23 13:12:51 +02:00
Kilian Schüttler
d38d023485
Merge branch 'RED-7384' into 'main'
...
Red 7384
See merge request fforesight/layout-parser!140
0.115.0
2024-04-23 12:13:21 +02:00
Kilian Schüttler
c1afe9b11f
Red 7384
2024-04-23 12:13:19 +02:00
Corina Olariu
bdcb9aeda4
RED-8701 - Move files to customer data repositories
...
- update junit tests
2024-04-23 11:49:29 +03:00
Corina Olariu
6a86036a78
Merge branch 'main' into RED-8701
2024-04-23 11:46:59 +03:00
Corina Olariu
a358d7565e
RED-8701 - Move files to customer data repositories
...
- update junit tests
2024-04-23 11:12:57 +03:00
Corina Olariu
069a6c0b49
RED-8701 - Move files to customer data repositories
...
- update syngenta submodule
2024-04-23 10:44:23 +03:00
Dominique Eifländer
683f7f1fb8
RED-8826: Do not classify textblocks in graphics as headlines
2024-04-23 09:28:28 +02:00
Corina Olariu
7eab3a4088
RED-8701 - Move files to customer data repositories
...
- remove customer files from project
2024-04-22 14:57:51 +03:00
Corina Olariu
970fc99ed1
RED-8701 - Move files to customer data repositories
...
- update junit test
2024-04-22 14:14:47 +03:00
Corina Olariu
48c54f63a0
RED-8701 - Move files to customer data repositories
...
- update submodules
2024-04-22 13:57:39 +03:00
Corina Olariu
20e4e5ddff
RED-8701 - Move files to customer data repositories
...
- update unit tests with the new path to submodules for customer files
2024-04-22 13:37:27 +03:00
Dominique Eifländer
b53930328a
RED-8826: Implemented graphics detection
2024-04-19 15:05:17 +02:00
Dominique Eifländer
c947d552d2
Merge branch 'RED-8995-fp' into 'main'
...
RED-8995: unclassified text might be missing from document data
See merge request fforesight/layout-parser!135
0.114.0
2024-04-19 09:21:50 +02:00
Corina Olariu
6b1b5eab84
RED-8701 - Move files to customer data repositories
...
- add syngenta submodule
2024-04-18 20:33:00 +03:00
Corina Olariu
cc9816c8cb
RED-8701 - Move files to customer data repositories
...
- use git lfs to store customer files
2024-04-18 20:31:35 +03:00
Kilian Schuettler
f256f9b30f
RED-8995: unclassified text might be missing from document data
...
* treat TablePageBlock.OTHER like PARAGRAPH (no special treatment)
2024-04-18 17:42:34 +02:00
Yannik Hampe
6167e3fb57
Merge branch 'RED-8402' into 'main'
...
RED-8402: Header and footer are not indexed / searched
See merge request fforesight/layout-parser!134
0.113.0
2024-04-18 15:08:00 +02:00
yhampe
a78fb0244a
Merge remote-tracking branch 'origin/RED-8402' into RED-8402
2024-04-18 14:39:10 +02:00
yhampe
8099a00bb6
RED-8402: Header and footer are not indexed / searched
...
added unit test and file
2024-04-18 14:39:01 +02:00
yhampe
9bb0468b2b
RED-8402: Header and footer are not indexed / searched
...
added unit test and file
2024-04-18 14:36:25 +02:00
Kilian Schüttler
c4d9c5df02
Merge branch 'RED-8747-fp' into 'main'
...
RED-8747 - Entities not merged properly - fp
See merge request fforesight/layout-parser!131
0.112.0
2024-04-09 16:30:02 +02:00
Corina Olariu
976f408237
RED-8747 - Entities not merged properly - fp
...
- rework the extraction of rulings from the table cells
2024-04-09 14:38:48 +03:00
Corina Olariu
319268c53d
RED-8747 - Entities not merged properly - fp
...
- update test
2024-04-09 12:24:19 +03:00
Corina Olariu
014eba9fc3
RED-8747 - Entities not merged properly - fp
...
- fix typo
- add validate table test
2024-04-09 12:14:57 +03:00
Yannik Hampe
9bd8419770
Merge branch 'RED-8402' into 'main'
...
RED-8402: Header and footer are not indexed / searched
See merge request fforesight/layout-parser!128
0.110.0
2024-04-08 12:28:06 +02:00
yhampe
c13ff7fbf6
RED-8402: Header and footer are not indexed / searched
...
checkstyle
added review comments
2024-04-08 12:17:49 +02:00