robert-bor
|
bcde097070
|
Issue #4 Trie.tokenize() available. It returns a list of tokens. A token can be either a fragment (unmatched text) or a match. If it is a match, the original emit can be queried.
|
2014-02-01 22:01:15 +01:00 |
|
robert-bor
|
ae20429936
|
Issue #3 added case insensitivity when matching keywords
|
2014-02-01 21:04:53 +01:00 |
|
robert-bor
|
cb44a6bff2
|
Issue #2 implemented whole word matching
|
2014-02-01 20:35:38 +01:00 |
|
robert-bor
|
4c8ea8ba57
|
Issue #1 fixed bug in compareTo method of Interval. Problem was the compareTo only worked on start position, whereas it should also work on end position.
|
2014-01-31 16:16:21 +01:00 |
|
robert-bor
|
1785a554f3
|
Issue #1 remove overlapping intervals. Resolution rule: longer matches over smaller ones, left-most over right-most
|
2014-01-31 14:56:11 +01:00 |
|
robert-bor
|
922da26965
|
Emit now also contains the start position of the found keyword
|
2014-01-30 09:36:53 +01:00 |
|
robert-bor
|
d140afc0da
|
first setup for the Aho-Corasick algorithm
|
2014-01-29 21:27:46 +01:00 |
|