robert-bor
|
bcde097070
|
Issue #4 Trie.tokenize() available. It returns a list of tokens. A token can be either a fragment (unmatched text) or a match. If it is a match, the original emit can be queried.
|
2014-02-01 22:01:15 +01:00 |
|
robert-bor
|
ae20429936
|
Issue #3 added case insensitivity when matching keywords
|
2014-02-01 21:04:53 +01:00 |
|
robert-bor
|
cb44a6bff2
|
Issue #2 implemented whole word matching
|
2014-02-01 20:35:38 +01:00 |
|
robert-bor
|
47383f1dbc
|
Issue #1 added documentation for removing overlapping matches
|
2014-01-31 16:26:36 +01:00 |
|
robert-bor
|
4c8ea8ba57
|
Issue #1 fixed bug in compareTo method of Interval. Problem was the compareTo only worked on start position, whereas it should also work on end position.
|
2014-01-31 16:16:21 +01:00 |
|
robert-bor
|
1785a554f3
|
Issue #1 remove overlapping intervals. Resolution rule: longer matches over smaller ones, left-most over right-most
|
2014-01-31 14:56:11 +01:00 |
|
robert-bor
|
922da26965
|
Emit now also contains the start position of the found keyword
|
2014-01-30 09:36:53 +01:00 |
|
robert-bor
|
b689165eb9
|
FTP link does not work. Fixed
|
2014-01-30 09:12:33 +01:00 |
|
robert-bor
|
d9c81428c0
|
updated docs
|
2014-01-30 08:22:02 +01:00 |
|
robert-bor
|
cab2404293
|
v0.1.0
v0.1.0
|
2014-01-30 06:49:49 +01:00 |
|
robert-bor
|
10b59654b7
|
More explanation on the usage
|
2014-01-29 21:58:22 +01:00 |
|
robert-bor
|
e1da9bd274
|
added explanation on the algorithm
|
2014-01-29 21:51:14 +01:00 |
|
robert-bor
|
d140afc0da
|
first setup for the Aho-Corasick algorithm
|
2014-01-29 21:27:46 +01:00 |
|
Robert Bor
|
b6e257146a
|
Initial commit
|
2014-01-29 12:17:12 -08:00 |
|