Dominique Eifländer
fc2ac03691
Fixed table extraction problems
2021-02-03 14:34:29 +01:00
Dominique Eifländer
35f3582d08
Expand CBI Authors with firstname initials
2021-02-02 12:20:40 +01:00
Dominique Eifländer
d8e444280b
Fixed position problem for rotated images
2021-02-01 16:04:45 +01:00
Dominique Eifländer
39ca191b9c
Ignore too small images
2021-02-01 15:14:46 +01:00
Dominique Eifländer
acddfafa5b
Annotate images
2021-02-01 13:52:18 +01:00
Dominique Eifländer
1ed9941259
RED-1019: Fixed returning numberOfPages
2021-01-29 13:17:25 +01:00
Timo
1091d6a886
fixed issue affecting service on restarts
2021-01-28 11:05:01 +02:00
Dominique Eifländer
d739a4f2f5
RED-1010: Splittet redaction endpoint to analyis and annotation endpoints
2021-01-27 15:19:38 +01:00
Dominique Eifländer
62b960f2ea
RED-1004: Fixed position problems when mediabox is bigger than cropbox
2021-01-27 11:34:17 +01:00
Dominique Eifländer
a76095c5d6
Always check dictionary rank when overriding annotations
2021-01-26 12:04:26 +01:00
Dominique Eifländer
43a3d76f1c
Added possibility to redact all Author tables
2021-01-25 15:59:08 +01:00
Dominique Eifländer
7dfaf604e7
Fixed recognize rules and exceptions for redactAndRecommend, fixed wrong order of rules(published information)
2021-01-25 11:20:35 +01:00
Dominique Eifländer
531eeebae1
Recoginze rules and exceptions for redactAndRecommend, Do not redact publish information authors in new rulesset
2021-01-25 09:46:09 +01:00
Dominique Eifländer
33795527fd
RED-934: Added rule to redact purity and et al Authors will be redacted where they are found
2021-01-20 13:18:37 +01:00
Dominique Eifländer
89f642ba90
RED-937: Handle rules version per RuleSetId
2021-01-13 12:52:59 +01:00
Dominique Eifländer
9104db6fa4
Clean recommendation values starting with :
2021-01-12 13:41:19 +01:00
Dominique Eifländer
f3b0eb2911
Recommend CBI_address in Test Organism/Animals sections
2021-01-12 11:30:40 +01:00
Dominique Eifländer
d10c0a7900
Revert idRemoval fix, removed duplicate code
2021-01-08 13:46:55 +01:00
Timo
e3a960d086
idRemoval fix
2021-01-08 11:29:47 +02:00
Dominique Eifländer
e23ed69e04
Avoid IndexOutOfBoundsException if dictionary entry has blank at end
2021-01-07 16:20:51 +01:00
Dominique Eifländer
633fb403e0
Fixed RegEx for et al. recommendations
2021-01-07 13:07:21 +01:00
Timo Bejan
e58b4ff6c1
Pull request #95 : Feature/ruleset integration
...
Merge in RED/redaction-service from feature/ruleset-integration to master
* commit '2c4350b8f369c00177781edc4567df1f2806a2fe':
Rules Tester
rule update fix
2021-01-06 18:47:57 +01:00
Timo
2c4350b8f3
Rules Tester
2021-01-06 19:37:39 +02:00
Timo
07ffeab3ae
rule update fix
2021-01-06 19:30:40 +02:00
Timo Bejan
f087d4afdb
Pull request #94 : dev mode features, exception generalisation
...
Merge in RED/redaction-service from feature/ruleset-integration to master
* commit 'c2669ab56843f3ec335cfb1cd7d1e59dc828fb98':
fixed tests
fixed tests
dev mode features, exception generalisation
2021-01-06 16:23:56 +01:00
Timo
c2669ab568
fixed tests
2021-01-06 17:04:07 +02:00
Timo
a824aa20a5
fixed tests
2021-01-06 17:02:43 +02:00
Timo
6412cf37d9
dev mode features, exception generalisation
2021-01-06 16:41:16 +02:00
Dominique Eifländer
de725a630c
RED-727: Added possibility to redact/addRecommendations by regEx in rules. Added email regEx and et al. author recommendation regEx
2021-01-06 14:53:49 +01:00
Timo
09069d11ad
RedactionLog now stores ruleSetId
2021-01-06 10:11:16 +02:00
Timo
5aba4b69ba
RuleSetId integration and drools update
2021-01-06 01:45:38 +02:00
Dominique Eifländer
e8256c49dc
Fixed annotating cell with more than one TextBlock (Mismatch between EntityPositionSequence and found Entity)
2021-01-05 14:53:09 +01:00
Dominique Eifländer
599c7bd6e4
Tables with only 2 column are treated as on text
2021-01-05 12:23:24 +01:00
Dominique Eifländer
609018a051
Fixed false positive dictionary problems
2021-01-04 16:34:55 +01:00
Dominique Eifländer
704e6a4b5a
Find annotations also in Header cells
2021-01-04 11:53:03 +01:00
Timo
0bc5abb29d
fixed text-after and text-before spacing
2021-01-03 12:40:21 +02:00
Dominique Eifländer
79b57e85cd
Handle 'u00A0' character the same way as ' '
2020-12-23 10:57:58 +01:00
Dominique Eifländer
000b145e71
Fixed 'Comparison method violates its general contract' by using QuickSort from PDFBox
2020-12-22 16:04:29 +01:00
Dominique Eifländer
caf6277de9
RED-882: Added textBefore and textAfter to redaction log
2020-12-18 14:31:27 +01:00
deiflaender
bfa363a3d2
RED-871: Fixed endless processing on document with corrupted contentStream
2020-12-11 11:26:48 +01:00
deiflaender
50ec16601c
Fixed table offset bug
2020-12-10 19:33:58 +01:00
deiflaender
e43bd1b711
RED-864, Added isDictionaryEntry to redactionLog. Fixed order of dictionary types
2020-12-10 12:37:51 +01:00
deiflaender
44613ee117
Made dictionaries Theadsafe
2020-12-09 17:09:11 +01:00
deiflaender
608ea4bbcc
RED-824: Add author recommendations based on vertebrate study tables
2020-12-07 15:53:20 +01:00
deiflaender
c90eee23c4
Fixed duplicate Textblock in Tables
2020-12-03 15:17:33 +01:00
deiflaender
4ef6e0e2ef
RED-740: Improved section recognition
2020-11-27 15:39:31 +01:00
Timo
3c4d6dd2f2
fixed awt auto *
2020-11-26 20:20:43 +02:00
Timo
0e645ab273
fixed Tests & fallback
2020-11-26 20:12:36 +02:00
Timo
cc1a3c9e49
removed import
2020-11-26 18:56:02 +02:00
Timo
536d4689f3
Added rank of dictionary to processing entities in redaction service, simplified code
2020-11-26 18:52:44 +02:00