25 Commits

Author SHA1 Message Date
Kilian Schuettler
6ccf3f80fc RED-6126: performance-test
*re-enabled overlap detection
*re-creating helper document for every page instead of reusing and adding/removing pages
performance-test
2023-02-09 11:22:39 +01:00
Kilian Schuettler
e705f869fd RED-6126: Performance Tests
*moved to streams for pdf file transfer
*disabled overlap detection
2023-02-09 11:09:52 +01:00
Timo Bejan
efaa291e43 Pull request #6: RED-4609 - added ocr metric, enabled prometheus, added test for metric
Merge in RED/ocr-service from RED-4609 to master

* commit '7c71d8ad041f839c21ec26023ee8eaef670a4924':
  RED-4609 - added ocr metric, enabled prometheus, added test for metric
1.7.0
2023-02-09 10:57:37 +01:00
Timo Bejan
7c71d8ad04 RED-4609 - added ocr metric, enabled prometheus, added test for metric RED-4609_2 2023-02-08 16:46:51 +02:00
Kilian Schuettler
b0a658213d Pull request #5: RED-6126
Merge in RED/ocr-service from RED-6126 to master

* commit '00cfe9e44948c153857ad59442dbc9349e1d4555':
  RED-6126: In the OCRService, OCR Text is not applied to Document *reformatted InvisibleElementRemovalService with new Code Style
  RED-6126: In the OCRService, OCR Text is not applied to Document *updated some comments *very slight refactor
  RED-6126: In the OCRService, OCR Text is not applied to Document *complete refactor of the OCRService *moved image position retrieval to new class instead of image service *added new tests for image rotation
  RED-6126: In the OCRService, OCR Text is not applied to Document *removed private configuration
  RED-6126: In the OCRService, OCR Text is not applied to Document *formatted one line
  RED-6126: In the OCRService, OCR Text is not applied to Document *reverted application of OCR Text to Document to old state *refactored OCR Service slightly *added meaningful test cases
1.6.0
2023-02-07 13:35:32 +01:00
Kilian Schuettler
00cfe9e449 RED-6126: In the OCRService, OCR Text is not applied to Document
*reformatted InvisibleElementRemovalService with new Code Style
RED-6126_7
2023-02-07 12:52:09 +01:00
Kilian Schuettler
d0d6bf70a4 RED-6126: In the OCRService, OCR Text is not applied to Document
*updated some comments
*very slight refactor
RED-6126_6
2023-02-07 12:09:04 +01:00
Kilian Schuettler
a415224db5 RED-6126: In the OCRService, OCR Text is not applied to Document
*complete refactor of the OCRService
*moved image position retrieval to new class instead of image service
*added new tests for image rotation
RED-6126_5
2023-02-07 12:05:24 +01:00
Kilian Schuettler
355887c865 RED-6126: In the OCRService, OCR Text is not applied to Document
*removed private configuration
RED-6126_4
2023-02-03 13:16:56 +01:00
Kilian Schuettler
ab566a11a9 RED-6126: In the OCRService, OCR Text is not applied to Document
*formatted one line
RED-6126_3
2023-02-03 13:03:47 +01:00
Kilian Schuettler
edd044395e RED-6126: In the OCRService, OCR Text is not applied to Document
*reverted application of OCR Text to Document to old state
*refactored OCR Service slightly
*added meaningful test cases
RED-6126_2
2023-02-03 13:01:01 +01:00
Kilian Schuettler
b37ec5afc9 Pull request #4: RED-6019: Remove hidden text when processing OCR
Merge in RED/ocr-service from RED-6019 to master

* commit 'a96260f77fd5b546a5d27d84f34861742f13ddff':
  RED-6019: Remove hidden text when processing OCR *moved InvisibleElementRemovalDto to private inner record of InvisibleElementRemovalService *added comments for color choices
  RED-6019: Remove hidden text when processing OCR *moved to release version of platform-dependencies *restored annotationProcessors
  RED-6019: Remove hidden text when processing OCR *code refactor *upgrade to java 17
  RED-6019: Remove hidden text when processing OCR handled cases:      Text which is transparent or is set to not render      Elements outside of clipping path      Elements that have been painted over by visible and filled Paths unhandled cases:      Elements covered by widely stroked path      Elements same color as background      Any Text set to clipping with its many interactions with other elements
1.5.0
2023-02-02 13:05:03 +01:00
Kilian Schuettler
a96260f77f RED-6019: Remove hidden text when processing OCR
*moved InvisibleElementRemovalDto to private inner record of InvisibleElementRemovalService
*added comments for color choices
RED-6019_5
2023-02-02 13:01:58 +01:00
Kilian Schuettler
12fbdbee50 RED-6019: Remove hidden text when processing OCR
*moved to release version of platform-dependencies
*restored annotationProcessors
RED-6019_4
2023-02-02 10:53:19 +01:00
Kilian Schuettler
99a0cb51d0 RED-6019: Remove hidden text when processing OCR
*code refactor
*upgrade to java 17
RED-6019_3
2023-02-02 10:27:01 +01:00
Kilian Schuettler
fd7ec6e7aa RED-6019: Remove hidden text when processing OCR
handled cases:
     Text which is transparent or is set to not render
     Elements outside of clipping path
     Elements that have been painted over by visible and filled Paths
unhandled cases:
     Elements covered by widely stroked path
     Elements same color as background
     Any Text set to clipping with its many interactions with other elements
RED-6019_2
2023-01-30 16:13:51 +01:00
Dominique Eiflaender
265fac8099 Pull request #3: RED-5911: Reverted to old ocr logic that uses ContentReplacer/TextExtractor to remove text behind images
Merge in RED/ocr-service from RED-5911 to master

* commit '7a4c5c2f898e83623a66ef29ab9ed696e2057e24':
  RED-5911: Reverted to old ocr logic that uses ContentReplacer/TextExtractor to remove text behind images
1.4.0
2023-01-17 12:42:10 +01:00
deiflaender
7a4c5c2f89 RED-5911: Reverted to old ocr logic that uses ContentReplacer/TextExtractor to remove text behind images cVersion1 2023-01-17 12:15:34 +01:00
Philipp Schramm
e535861da8 Pull request #2: RED-5911 Bugfix for removed texts within tables
Merge in RED/ocr-service from RED-5911 to master

* commit '0e8dfed4410b28a6316f8e328fc166339852565f':
  RED-5911 Bugfix for removed texts within tables
1.3.0
2023-01-04 15:02:15 +01:00
Philipp Schramm
0e8dfed441 RED-5911 Bugfix for removed texts within tables RED-5911_4 2023-01-04 12:31:51 +01:00
Dominique Eiflaender
2fe0eb024a Pull request #1: RSS-146: Install Ghostscript to imporve ocr quality
Merge in RED/ocr-service from RSS-146 to master

* commit '37548ddbf89c0ca5e25d18b19af9400ef5e608c0':
  RSS-146: Install Ghostscript to imporve ocr quality
1.2.0 RED-5911_2
2022-12-14 17:41:01 +01:00
deiflaender
37548ddbf8 RSS-146: Install Ghostscript to imporve ocr quality RSS-146_2 2022-12-14 16:11:00 +01:00
Christoph Schabert
e73824ed09 build-java.sh edited online with Bitbucket 1.0.0 1.1.0 2022-12-05 12:29:42 +01:00
deiflaender
998d69ba48 RED-4556: Implemented ocr standalone service 0.1.0 2022-12-05 12:15:55 +01:00
deiflaender
5a3dcde3ad RSS-256: Added .gitignore 2022-12-05 09:05:11 +01:00