259 Commits

Author SHA1 Message Date
Dominique Eifländer
d87cbdeaeb Ignore images that are contained in others 2021-05-20 09:20:22 +02:00
Dominique Eifländer
13974d1373 RED-1390: Do not classify images that are nearly as big as the page 2021-05-19 11:46:08 +02:00
Dominique Eifländer
816f44edf0 RED-1506: Fixed unpocess images in strange formats 2021-05-19 10:18:11 +02:00
Timo Bejan
be92e3c036 Pull request #155: RED-785
Merge in RED/redaction-service from RED-785 to master

* commit '513501abf476dc7f08446bb3035da888d2fd30a2':
  save legal basis version on redaction log
  adjusted tests
  legal basis is now stored in redaction log
2021-05-14 14:07:56 +02:00
Timo
2334a6cb5b save legal basis version on redaction log 2021-05-14 15:07:07 +03:00
Timo
513501abf4 save legal basis version on redaction log 2021-05-14 15:00:22 +03:00
Timo
6a660c6a73 adjusted tests 2021-05-14 14:50:06 +03:00
Timo
8b898a9038 legal basis is now stored in redaction log 2021-05-14 14:49:21 +03:00
Dominique Eifländer
bdf40f5e4e RED-1477: Fixed duplicate images 2021-05-12 11:28:17 +02:00
Dominique Eifländer
184f55535a Fixed dossier dictionary renalaysis 2021-04-30 13:52:46 +02:00
Dominique Eifländer
b7ae6cf390 Added isDossierDictionaryEntry to manualRedactions and redactionLog 2021-04-29 12:29:50 +02:00
Dominique Eifländer
48beb984ca RED-1334: Enabled to use dictionaries per dossier 2021-04-28 15:11:11 +02:00
Timo
1ddcafbdd2 cleaned up image code 2021-04-26 19:01:01 +03:00
Timo Bejan
e1b8ca4956 Pull request #149: moved changelog to redaction service, removed filename from changelog
Merge in RED/redaction-service from improved-redaction-performance to master

* commit 'c05fdb48545b829ae82502c6d6ad486f2210f8f2':
  moved changelog to redaction service, removed filename from changelog
2021-04-26 14:52:44 +02:00
Timo
c05fdb4854 moved changelog to redaction service, removed filename from changelog 2021-04-26 15:48:17 +03:00
Clarissa.Dietrich
047ac9bf84 RED-1378: fix: merging images condition 2021-04-26 10:50:35 +02:00
Clarissa.Dietrich
4d1ad3b5a5 RED-1351: merge images into one 2021-04-22 17:10:12 +02:00
Timo
44ff9cb775 fixed changelog storage 2021-04-21 20:41:18 +03:00
Timo
735cfdd6c4 reworked api 2021-04-21 18:53:29 +03:00
Timo
3af1211932 fixed reanalyse condition 2021-04-21 18:16:39 +03:00
Timo
c1d5be8f63 fixed reanalyse condition 2021-04-21 18:13:35 +03:00
Timo
567bb00377 removed long test, fixed NPE 2021-04-21 18:13:35 +03:00
Timo
f6d0b2d0db temporary ignore this 2021-04-21 17:34:01 +03:00
Timo
43217e6abe added missing config yml 2021-04-21 17:28:01 +03:00
Timo
8182b1a0fa Updated code to respond to file-status service on successful/failed analysis 2021-04-21 17:15:49 +03:00
Timo
ab3b30b282 prepared redaction service for queue handling 2021-04-21 14:11:22 +03:00
Timo
0d63f9372f added reanalyse if possible flag to analyse request 2021-04-21 09:16:36 +03:00
Timo
5b24c3a52c fixed some test issues 2021-04-20 10:56:19 +03:00
Timo
385dff63ce fixed pmd 2021-04-20 10:37:53 +03:00
Timo
e48e4e1797 updated redrect 2021-04-20 10:32:54 +03:00
Timo
ba28a3e0d3 code format, dependecy and test update, logging for reanalysis 2021-04-20 10:26:27 +03:00
Timo
1d4708ad13 reworked reanalysis and text storage 2021-04-20 09:51:50 +03:00
Timo
5c2596e268 Serialization of text 2021-04-19 13:08:32 +03:00
Timo
42fcea85d3 set image type on error 2021-04-18 11:31:33 +03:00
Timo
a34d2fb675 proper error handling for image clasification 2021-04-18 11:30:53 +03:00
Timo
ed5686dc51 Re-worked cache issues 2021-04-17 09:55:46 +03:00
Timo
8112f2035a close final PDDocument 2021-04-17 09:18:01 +03:00
Timo
169ab20351 fixed in-memory storage issues 2021-04-17 09:10:40 +03:00
Timo
8060e3a29f "fixed" memory issues by calling GC manually, removing soft reference cache and disposing images properly 2021-04-16 23:13:12 +03:00
Timo
4749858e80 attempted fix for image clasification 2021-04-16 20:53:09 +03:00
Timo
93d75e2f1c added versions to analyze result 2021-04-16 15:52:17 +03:00
Timo
5cb4ea287c Reworked re-analysis and analysis to use memory model / directly read/store files, and dumped pd doc wherever possible 2021-04-16 14:50:04 +03:00
Dominique Eifländer
c157a80630 Fixed endless loop on documents that contains no text 2021-04-16 09:42:29 +02:00
Dominique Eifländer
55ba351362 RED-1260: Enabled to add rules and manuel redaction actions for images 2021-04-15 12:51:29 +02:00
Dominique Eifländer
ae28555bf4 RED-1260: First steps for image classification 2021-04-09 13:37:00 +02:00
Dominique Eifländer
2558b3cab8 Integrate image classification 2021-04-09 11:44:12 +02:00
Dominique Eifländer
fc70f972da Do not remove images on reanalysis 2021-04-08 15:47:11 +02:00
Dominique Eifländer
e7c24487c7 Do not remove images at reanalysis 2021-04-08 15:22:25 +02:00
Dominique Eifländer
8375c04829 RED-1276: Fixed annotation position problem for 270° rotated pages 2021-04-08 10:30:51 +02:00
Dominique Eifländer
0638877d0a RED-1061: Upgraded to newest spring boot/cloud and reenabled actuator metrics 2021-03-15 13:22:14 +01:00