507 Commits

Author SHA1 Message Date
Kilian Schuettler
fcbc005102 RED-10127: Paragraphs with multiple table, appendix, figure can't be headlines 2024-12-06 14:49:35 +01:00
Kilian Schuettler
2fdc53429c fix accidental push to main 0.159.19 2024-11-28 12:57:13 +01:00
Kilian Schüttler
3b30732352 RED-9139: more robust TOC detection
(cherry picked from commit 7ee1f9e360d1cdd1bf85f9441e27fe1ed0e4ce7e)
2024-11-28 12:52:55 +01:00
Kilian Schüttler
e01c0a8d3b Merge branch 'RED-10270-bp' into 'release/0.159.x'
RED-10270: fix NumberFormatException

See merge request fforesight/layout-parser!250
0.159.18
2024-10-24 17:14:51 +02:00
Kilian Schüttler
5ef5d5509b RED-10270: fix NumberFormatException 2024-10-24 17:14:51 +02:00
Kilian Schüttler
ab70536d06 Merge branch 'RED-10204' into 'release/0.159.x'
RED-10204: backport of NPE hotfix and rename TextPositionSequence to Word

See merge request fforesight/layout-parser!247
0.159.17
2024-10-24 10:05:00 +02:00
Kilian Schüttler
5e091402c7 RED-10204: backport of NPE hotfix and rename TextPositionSequence to Word 2024-10-24 10:04:59 +02:00
Kilian Schüttler
03f5acd417 Merge branch 'feature/RED-10127-bp' into 'release/0.159.x'
RED-10127: add more units

See merge request fforesight/layout-parser!241
0.159.16
2024-10-15 09:57:11 +02:00
Kilian Schuettler
8ca41cf340 RED-10127: add more units 2024-10-15 09:47:07 +02:00
Kilian Schüttler
cee6c74d73 Merge branch 'feature/RED-10127-bp' into 'release/0.159.x'
RED-10127: improve list classification

See merge request fforesight/layout-parser!239
2024-10-14 17:31:24 +02:00
Kilian Schuettler
d8394d9a78 RED-10127: improve list classification
* add one more format to list identification
* add 'ppb' to known units
* special case for headlines continuing with 14C after the identifier (quite often in some specific files)
2024-10-14 17:22:19 +02:00
Kilian Schüttler
d3c4413ece Merge branch 'feature/RED-10127-bp' into 'release/0.159.x'
RED-10127: add list classification

See merge request fforesight/layout-parser!238
0.159.15
2024-10-10 10:50:18 +02:00
Kilian Schüttler
d614aed96a RED-10127: add list classification 2024-10-10 10:50:17 +02:00
Kilian Schüttler
63953ecf2d Merge branch 'feature/RED-10127-bp' into 'release/0.159.x'
RED-10127: improve headline detection

See merge request fforesight/layout-parser!236
0.159.14
2024-10-09 09:56:05 +02:00
Kilian Schüttler
8c28a46817 RED-10127: improve headline detection 2024-10-09 09:56:04 +02:00
Maverick Studer
072ad3bf23 Merge branch 'RED-10126-bp' into 'release/0.159.x'
RM-187: Footers are recognized in the middle of the page

See merge request fforesight/layout-parser!234
0.159.13
2024-10-08 14:27:55 +02:00
Maverick Studer
8a11d838b9 RM-187: Footers are recognized in the middle of the page 2024-10-08 14:27:55 +02:00
Dominique Eifländer
ed37b4bedf Merge branch 'RED-9975-4.2' into 'release/0.159.x'
RED-9975: Fixed missing section numbers in layout grid

See merge request fforesight/layout-parser!229
0.159.12
2024-09-18 11:26:10 +02:00
Dominique Eifländer
dda5a2c719 RED-9975: Fixed missing section numbers in layout grid 2024-09-18 11:20:15 +02:00
Dominique Eifländer
0f641670f7 Merge branch 'RED-9974-4.2' into 'release/0.159.x'
Red 9974 4.2

See merge request fforesight/layout-parser!228
0.159.11
2024-09-16 14:06:40 +02:00
Dominique Eifländer
b08c102f76 RED-9974: Disabled failing test because of different header/footers 2024-09-16 13:32:44 +02:00
Dominique Eifländer
6acc85266c RED-9974: Ignore enoughChars when section identifierer regex matches for documine old 2024-09-16 12:16:11 +02:00
Dominique Eifländer
a4d6d2326e RED-9974: Do not rewrite outline as pdftron crashes in some cases 2024-09-16 10:50:24 +02:00
Dominique Eifländer
a337fdf684 RED-9974: Ignore pmd errors that only occur on build server 2024-09-16 10:18:27 +02:00
Kilian Schuettler
95e6fdecd7 RED-9974: wip 2024-09-16 09:46:41 +02:00
Kilian Schuettler
1337c56591 RED-9974: wip 2024-09-16 09:46:31 +02:00
Kilian Schuettler
31bf4ba8c8 hotfix: viewerDocService doesn't remove existing marked content 2024-09-16 09:46:16 +02:00
Kilian Schüttler
f034c5bfa0 Merge branch 'RED-9975-bp' into 'release/0.159.x'
RED-9975: improve SuperSection handling

See merge request fforesight/layout-parser!224
0.159.10
2024-09-11 13:38:04 +02:00
Kilian Schüttler
41ba531734 RED-9975: improve SuperSection handling 2024-09-11 13:38:04 +02:00
Dominique Eifländer
c392813402 Merge branch 'RED-9976-4.2' into 'release/0.159.x'
RED-9976: Removed sorting that scrambles text in PDFTextStripper

See merge request fforesight/layout-parser!221
0.159.9
2024-09-10 13:02:22 +02:00
Dominique Eifländer
4a624f9642 RED-9976: Removed sorting that scrambles text in PDFTextStripper 2024-09-10 12:48:28 +02:00
Kilian Schüttler
f6c60aa5eb Merge branch 'hotfix-bp' into 'release/0.159.x'
hotfix: unmerge super large tables

See merge request fforesight/layout-parser!219
0.159.8
2024-09-05 15:05:11 +02:00
Kilian Schuettler
90a1187921 hotfix: unmerge super large tables 2024-09-05 14:50:35 +02:00
Kilian Schuettler
09c18c110a hotfix: unmerge super large tables 2024-09-05 14:26:45 +02:00
Kilian Schüttler
9012162542 Merge branch 'hotfix-bp' into 'release/0.159.x'
hotfix: add Java advanced imaging

See merge request fforesight/layout-parser!216
2024-09-04 15:44:02 +02:00
Kilian Schuettler
49604cd96e hotfix: add Java advanced imaging 2024-09-04 15:19:43 +02:00
Kilian Schüttler
943a6b6536 Merge branch 'RED-9964-bp' into 'release/0.159.x'
RED-9964: fix errors with images

See merge request fforesight/layout-parser!213
0.159.7
2024-09-04 09:17:19 +02:00
Kilian Schuettler
302d8b884f RED-9964: fix errors with images 2024-09-03 16:38:17 +02:00
Dominique Eifländer
a50b047cbb Merge branch 'RED-9988-4.2' into 'release/0.159.x'
RED-9988: Fixed NPE when image representation is not present

See merge request fforesight/layout-parser!209
0.159.6
2024-09-02 09:26:16 +02:00
Dominique Eifländer
8de9d8309f RED-9988: Fixed NPE when image representation is not present 2024-09-02 09:18:38 +02:00
Kilian Schüttler
3b12242355 Merge branch 'RED-9975-bp' into 'release/0.159.x'
Red 9975: fix outline detection

See merge request fforesight/layout-parser!208
0.159.5
2024-08-30 17:48:02 +02:00
Kilian Schüttler
e8605f4956 Red 9975: fix outline detection 2024-08-30 17:48:02 +02:00
Kilian Schüttler
f4a5b5fcbf Merge branch 'RED-9975-bp' into 'release/0.159.x'
Red 9975: add outline debug layer

See merge request fforesight/layout-parser!207
0.159.4
2024-08-30 14:18:09 +02:00
Kilian Schüttler
8496b48cde Red 9975: add outline debug layer 2024-08-30 14:18:09 +02:00
Kilian Schüttler
de266dcfe5 Merge branch 'RED-9964' into 'release/0.159.x'
Red 9964: don't merge tables on non-consecutive pages or with tables in between

See merge request fforesight/layout-parser!204
0.159.3
2024-08-30 14:00:50 +02:00
Kilian Schüttler
10e525f0de Red 9964: don't merge tables on non-consecutive pages or with tables in between 2024-08-30 14:00:50 +02:00
Dominique Eifländer
e0e5e35b30 Merge branch 'RED-9974-4.2' into 'release/0.159.x'
RED-9974: Improved headline detection for documine old

See merge request fforesight/layout-parser!203
0.159.2
2024-08-30 10:52:31 +02:00
Dominique Eifländer
e1d8d1ea3b RED-9974: Improved headline detection for documine old 2024-08-30 10:35:24 +02:00
Kilian Schüttler
1546c05dd8 Merge branch 'RED-9975-bp' into 'release/0.159.x'
activate outline detection

See merge request fforesight/layout-parser!200
0.159.1
2024-08-29 14:26:14 +02:00
Kilian Schuettler
7c88c30ca7 RED-9975: activate outline detection 2024-08-29 14:17:20 +02:00