Corina Olariu
e400a04644
Pull request #22 : RED-6072
...
Merge in RED/ocr-service from RED-6072 to master
* commit '961129b917e92b4dda2085215d3f15e205ec35a3':
RED-6072 - As Operation I want to see why files are in an ERROR state - update the ocr failed with error information - in case of exception the reason is caught and added tot the message received from the queue
RED-6072 - As Operation I want to see why files are in an ERROR state update from iqser to knecon
2023-05-25 14:59:04 +02:00
deiflaender
ee94f1c711
RED-5567: Upgraded to lastest pdftron version
2023-05-25 14:14:59 +02:00
devplant
961129b917
RED-6072 - As Operation I want to see why files are in an ERROR state
...
- update the ocr failed with error information
- in case of exception the reason is caught and added tot the message received from the queue
2023-05-25 14:29:40 +03:00
devplant
828b4f53c8
RED-5694 - Upgrade spring-boot to 3.0
...
- add -DknownExploitedEnabled=false
- import StorageAutoConfiguration to Application
2023-04-20 16:14:28 +03:00
devplant
f72001c4ac
RED-5694 - Upgrade spring-boot to 3.0
...
- remove commented code
2023-04-06 13:36:43 +03:00
devplant
81fd35f9c0
RED-5694 - Upgrade spring-boot to 3.0
...
- remove versions already defined in spring boot
2023-04-06 13:35:42 +03:00
devplant
17b1932926
RED-5694 - Upgrade spring-boot to 3.0
...
- remove unused import
2023-04-06 11:57:36 +03:00
devplant
c00e9290bc
RED-5694 - Upgrade spring-boot to 3.0
...
- update platform-dependency
- update other dependencies versions to the latest
2023-04-06 11:52:44 +03:00
Thomas Beyer
036203c24a
RED-4875 - delete commented out classes
2023-03-21 18:20:26 +01:00
Thomas Beyer
6ac71f6d94
RED-4875 - set version of pdftron-common-logics to newest (release)
2023-03-21 17:00:41 +01:00
Thomas Beyer
dd12611fdc
RED-4875 - removed duration log
2023-03-21 13:18:41 +01:00
Thomas Beyer
b0c4c25bec
RED-4875 - update version of pdftron-commons to newest
2023-03-21 12:28:57 +01:00
Thomas Beyer
5efa0e96a8
RED-4875 - update version of pdftron logic commons to newest
2023-03-20 11:25:38 +01:00
Thomas Beyer
fd92419895
RED-4875 - set version of common pdftron logics to newest and move PdfTextExtraction to this new repo
2023-03-20 10:01:33 +01:00
Thomas Beyer
142e8cf957
RED-4875 - update version of pdftron-logic-commons to newest
2023-03-17 17:25:52 +01:00
Thomas Beyer
143538fa40
RED-4875 - call logic of new repo pdftron-logic-commons instead of local one
2023-03-17 10:33:48 +01:00
Timo Bejan
8ab4092046
Pull request #18 : RED-6162 - bumped version
...
Merge in RED/ocr-service from RED-6162 to master
* commit 'c86cabf70a9ec90687c3f123768f2293da49c4ff':
RED-6162 - bumped version
RED-6162 - bumped version
2023-03-10 21:45:38 +01:00
Timo Bejan
c86cabf70a
RED-6162 - bumped version
2023-03-10 22:43:35 +02:00
deiflaender
e5df0ec658
RED-4645: Multitenancy for storage
2023-03-10 16:31:34 +01:00
Timo Bejan
0d9365d020
RED-6162 - bumped version
2023-03-10 15:59:25 +02:00
Timo Bejan
5156c560f2
RED-6162 - test issue on bamboo
2023-03-10 08:45:48 +02:00
Timo Bejan
c2a7e32789
RED-6162 - test issue on bamboo
2023-03-10 08:45:48 +02:00
Timo Bejan
da5a1cc042
RED-6162 - persistence update - reverse dependency cleanup
2023-03-10 08:45:48 +02:00
Kilian Schuettler
2d503c74a6
RED-6321: OCR not working correctly with 3.6.0
...
*added end() statements to formWriters to write their changes to the PDF contentStream
*moved replaceOriginalPageWithOcrPage outside the main OCR Loop since it caused exponential RAM Usage in some cases
2023-03-06 10:08:32 +01:00
Kilian Schuettler
7427258349
RED-6280: Performance Test Issue with OCR-Service
...
*removed init/terminate calls again
*manual memory cleanup at every opportunity
2023-02-28 10:21:21 +01:00
Kilian Schuettler
caff5580dd
RED-6126: In the OCRService, OCR Text is not applied to Document
...
*refactored Tests with inheritance
*called PDFNet init/terminate in tests
*don't call init on startup
2023-02-22 11:58:07 +01:00
Kilian Schuettler
430ad45a67
RED-6126: In the OCRService, OCR Text is not applied to Document
...
*called PDFNet.initialize and terminate before and after message receive
*updated comments
*renamed some variables
2023-02-22 10:32:55 +01:00
Kilian Schuettler
0bc4fea2a5
RED-6126: In the OCRService, OCR Text is not applied to Document
...
*removed unnecessary getXObject() call, since it fails for inline_images
2023-02-13 17:55:02 +01:00
Kilian Schuettler
37f1e03ebc
RED-6126: performance-test
...
*refactor to improve cleanness
*closed inputStream
2023-02-10 14:49:10 +01:00
Kilian Schuettler
b3fa14b342
RED-6126: performance-test
...
*fixed NullPointerException
*fixed StackOverFlowError by ignoring very small images and moving to while loop instead of recursion
2023-02-10 12:27:16 +01:00
Kilian Schuettler
7065d098f3
RED-6126: performance-test
...
*fixed time calculation
2023-02-09 16:31:42 +01:00
Kilian Schuettler
8db0b712f7
RED-6126: performance-test
...
*improved error logging
2023-02-09 13:57:21 +01:00
Kilian Schuettler
6ccf3f80fc
RED-6126: performance-test
...
*re-enabled overlap detection
*re-creating helper document for every page instead of reusing and adding/removing pages
2023-02-09 11:22:39 +01:00
Kilian Schuettler
e705f869fd
RED-6126: Performance Tests
...
*moved to streams for pdf file transfer
*disabled overlap detection
2023-02-09 11:09:52 +01:00
Timo Bejan
7c71d8ad04
RED-4609 - added ocr metric, enabled prometheus, added test for metric
2023-02-08 16:46:51 +02:00
Kilian Schuettler
00cfe9e449
RED-6126: In the OCRService, OCR Text is not applied to Document
...
*reformatted InvisibleElementRemovalService with new Code Style
2023-02-07 12:52:09 +01:00
Kilian Schuettler
d0d6bf70a4
RED-6126: In the OCRService, OCR Text is not applied to Document
...
*updated some comments
*very slight refactor
2023-02-07 12:09:04 +01:00
Kilian Schuettler
a415224db5
RED-6126: In the OCRService, OCR Text is not applied to Document
...
*complete refactor of the OCRService
*moved image position retrieval to new class instead of image service
*added new tests for image rotation
2023-02-07 12:05:24 +01:00
Kilian Schuettler
355887c865
RED-6126: In the OCRService, OCR Text is not applied to Document
...
*removed private configuration
2023-02-03 13:16:56 +01:00
Kilian Schuettler
ab566a11a9
RED-6126: In the OCRService, OCR Text is not applied to Document
...
*formatted one line
2023-02-03 13:03:47 +01:00
Kilian Schuettler
edd044395e
RED-6126: In the OCRService, OCR Text is not applied to Document
...
*reverted application of OCR Text to Document to old state
*refactored OCR Service slightly
*added meaningful test cases
2023-02-03 13:01:01 +01:00
Kilian Schuettler
a96260f77f
RED-6019: Remove hidden text when processing OCR
...
*moved InvisibleElementRemovalDto to private inner record of InvisibleElementRemovalService
*added comments for color choices
2023-02-02 13:01:58 +01:00
Kilian Schuettler
12fbdbee50
RED-6019: Remove hidden text when processing OCR
...
*moved to release version of platform-dependencies
*restored annotationProcessors
2023-02-02 10:53:19 +01:00
Kilian Schuettler
99a0cb51d0
RED-6019: Remove hidden text when processing OCR
...
*code refactor
*upgrade to java 17
2023-02-02 10:27:01 +01:00
Kilian Schuettler
fd7ec6e7aa
RED-6019: Remove hidden text when processing OCR
...
handled cases:
Text which is transparent or is set to not render
Elements outside of clipping path
Elements that have been painted over by visible and filled Paths
unhandled cases:
Elements covered by widely stroked path
Elements same color as background
Any Text set to clipping with its many interactions with other elements
2023-01-30 16:13:51 +01:00
deiflaender
7a4c5c2f89
RED-5911: Reverted to old ocr logic that uses ContentReplacer/TextExtractor to remove text behind images
2023-01-17 12:15:34 +01:00
Philipp Schramm
0e8dfed441
RED-5911 Bugfix for removed texts within tables
2023-01-04 12:31:51 +01:00
deiflaender
998d69ba48
RED-4556: Implemented ocr standalone service
2022-12-05 12:15:55 +01:00