Dominique Eifländer
92fd1a72de
RED-7141: Readded lost mergeLinesInZones
2024-03-12 13:42:40 +01:00
Dominique Eifländer
0d3d25e7d7
Merge branch 'RED-7141-hotfix' into 'main'
...
RED-7141: Align backend text sorting with Webviewer sorting
See merge request fforesight/layout-parser!115
2024-03-12 11:15:41 +01:00
maverickstuder
956fbff872
RED-7141: Align backend text sorting with Webviewer sorting
...
* hotfix for tables not being detected due to wrong x-y-sorting
2024-03-12 11:06:53 +01:00
maverickstuder
16be2467fd
RED-8715: Improve NearestNeighbor Algorithm in LayoutParser
...
* replaced the old algorithm with an algorithm based on a kd-tree
2024-03-11 14:42:28 +01:00
Timo Bejan
dfc23955d7
Linespacing claryfind
2024-03-11 11:30:51 +02:00
Dominique Eifländer
d6e3d6fe22
Clarifynd
2024-03-11 11:24:58 +02:00
Timo Bejan
56c07a4491
CLARI-30 - identifier fix for clarifynd
2024-03-08 16:23:27 +02:00
Dominique Eifländer
0ad0cd45d6
RED-7141: Moved docstrum to root level of processor package
2024-03-08 14:20:28 +01:00
Dominique Eifländer
d659fe7234
RED-7141: Performance improvments
2024-03-08 10:00:52 +01:00
Dominique Eifländer
cb9127b4f3
RED-7141: Fixed pr finding and improved speed
2024-03-07 16:51:48 +01:00
Dominique Eifländer
79239b751d
RED-7141: Implemented docstrum layout parsing
2024-03-06 11:18:40 +01:00
yhampe
a6ba501fa8
RED-8481: Use visual layout parsing to detect signatures
...
fixed some nullpointer errors
2024-02-29 09:22:27 +01:00
Maverick Studer
74f55a5cbf
RED-8550: Faulty table recognition and text duplication leads to huge sections
2024-02-28 16:13:56 +01:00
Kilian Schuettler
f4d789311c
hotfix: double viewerdoc writes in rare cases lead to some contentstreams not being written
2024-02-26 12:24:15 +01:00
yhampe
477f6af886
RED-8481: Use visual layout parsing to detect signatures
...
added a new layer for visual parsing results
checkstyle
2024-02-23 14:02:53 +01:00
yhampe
2c171b6a9e
RED-8481: Use visual layout parsing to detect signatures
...
added a new layer for visual parsing results
codestyle
2024-02-23 13:55:11 +01:00
yhampe
71477dabde
RED-8481: Use visual layout parsing to detect signatures
...
added a new layer for visual parsing results
codestyle
2024-02-23 12:46:51 +01:00
yhampe
a927cbd9dc
RED-8481: Use visual layout parsing to detect signatures
...
added a new layer for visual parsing results
fixed tests
2024-02-23 12:38:05 +01:00
yhampe
a1521877d7
RED-8481: Use visual layout parsing to detect signatures
...
added a new layer for visual parsing results
added a source label to image properties to enable rules
2024-02-23 12:20:11 +01:00
Maverick Studer
1d64028158
RED-8550: Faulty table recognition and text duplication leads to huge sections
2024-02-21 13:54:30 +01:00
yhampe
cc77d19500
RED-8481: Use visual layout parsing to detect signatures
...
addressed review comments
2024-02-15 13:01:30 +01:00
yhampe
fa048b2fe0
RED-8481: Use visual layout parsing to detect signatures
...
addressed review comments
2024-02-15 12:19:26 +01:00
yhampe
bdf1161c91
RED-8481: Use visual layout parsing to detect signatures
...
addressed review comments
2024-02-15 12:12:23 +01:00
yhampe
b4a225144d
RED-8481: Use visual layout parsing to detect signatures
...
working on failing tests
2024-02-15 10:16:07 +01:00
yhampe
903b1c1fd4
RED-8481: Use visual layout parsing to detect signatures
...
fixed failing tests because of null pointer
2024-02-15 09:27:07 +01:00
yhampe
c3e7582ee3
RED-8481: Use visual layout parsing to detect signatures
...
fixed failing tests because of null pointer
2024-02-14 12:33:36 +01:00
yhampe
cfc5db45cd
RED-8481: Use visual layout parsing to detect signatures
...
fixed failing tests because of null pointer
2024-02-14 12:24:32 +01:00
yhampe
fbd0196719
RED-8481: Use visual layout parsing to detect signatures
...
implemented visuallayoutparsingresult
2024-02-14 12:16:37 +01:00
Kilian Schuettler
23eb0c40a3
RED-8156: refactor ViewerDocumentService as a dependency for ocr-service
...
* various improvements to experimental parsing steps
* added embed fonts functionality to viewer doc
2024-02-06 16:59:51 +01:00
Dominique Eifländer
e4f3557b36
RED-8171: Traces do not stop at @Async
2024-02-02 13:22:57 +01:00
Timo Bejan
88855de2da
Red 8085
2024-01-29 10:31:36 +01:00
Kilian Schüttler
ba1c7c07ab
RED-7384: fixes for migration
2023-12-20 12:40:00 +01:00
Dominique Eifländer
dacc2f7f43
DM-589: Filter wrong detected cells that borders from rotation at scanning
2023-11-20 15:54:02 +01:00
yhampe
207d9dec97
* added back in if statement
...
* removed not needed commentar
2023-11-16 12:40:49 +01:00
yhampe
1316a067fe
* removed double chechking for height of cell
2023-11-16 08:51:12 +01:00
yhampe
e203210ade
* removed not needed properties
2023-11-16 08:23:58 +01:00
Dominique Eifländer
a6ba66b1aa
TAAS-103: Fixed values in wrong cells
2023-11-15 13:36:46 +01:00
yhampe
c3e69b2cdf
* fixed bug with incorrect empty cell count by adding threshhold to cell.contains
2023-11-15 10:44:47 +01:00
yhampe
f69331e7d8
*renamed page to firstPage in DocumentStructure and Table
2023-11-07 10:21:19 +01:00
yhampe
01493dc033
TAAS-103: Table Detection and rotated text
...
* added page property to DocumentStructure to be able to get page of found tables
* added a method to TableExtractionService to get the table area
* added calculateMinCharWidthAndMaxCharHeightInsideTable to LayoutParsingPipeline to calculate the values based upon table area
* refactored PDFLinesTextStripper for better readability
*removed textMatrix from RedTextPosition as it is no longer needed
2023-11-07 08:47:28 +01:00
Corina Olariu
0e0a811f9d
RED-7806 - Specific customer document cannot be processed
...
- add brackets
2023-10-25 11:36:54 +03:00
Corina Olariu
efa3d75479
RED-7806 - Specific customer document cannot be processed
...
- check for font name null before using to avoid the NPE
2023-10-25 09:16:47 +03:00
Corina Olariu
3bab61c446
RED-7434 - Remove Section Grid entirely
...
- remove sectionGrid relation (including SectionGridCreatorService)
- update junit tests
2023-10-20 09:09:22 +03:00
Dominique Eifländer
567cbc178b
hotfix: Fixed parsing for specific taas document
2023-10-17 15:52:19 +02:00
Corina Olariu
3839de215c
RED-7607 - Rotating pages leads to lost annotations (RM & DM)
...
- rollback to getDir().getDegrees()
2023-10-04 15:27:13 +03:00
Corina Olariu
b4d68594f1
RED-7607 - Rotating pages leads to lost annotations (RM & DM)
...
- use rotation instead of getDir().getDegrees()
2023-10-04 14:22:15 +03:00
Corina Olariu
99ed331a1e
RED-7607 - Rotating pages leads to lost annotations (RM & DM)
...
- use getXDirAdj instead of getX
- add fontSizeCounter for landscape pages also
2023-10-04 14:13:38 +03:00
Corina Olariu
f2c0991987
RED-7607 - Rotating pages leads to lost annotations (RM & DM)
...
- fix PMD findings
2023-10-04 14:09:46 +03:00
Kilian Schuettler
5792ff4a93
TAAS-104: merge visually intersecting Paragraphs
...
* fix build
2023-09-05 16:54:23 +02:00
Kilian Schuettler
621c3f269d
TAAS-104: merge visually intersecting Paragraphs
2023-09-05 16:09:05 +02:00