14 Commits

Author SHA1 Message Date
robert-bor
bcde097070 Issue #4 Trie.tokenize() available. It returns a list of tokens. A token can be either a fragment (unmatched text) or a match. If it is a match, the original emit can be queried. 2014-02-01 22:01:15 +01:00
robert-bor
ae20429936 Issue #3 added case insensitivity when matching keywords 2014-02-01 21:04:53 +01:00
robert-bor
cb44a6bff2 Issue #2 implemented whole word matching 2014-02-01 20:35:38 +01:00
robert-bor
47383f1dbc Issue #1 added documentation for removing overlapping matches 2014-01-31 16:26:36 +01:00
robert-bor
4c8ea8ba57 Issue #1 fixed bug in compareTo method of Interval. Problem was the compareTo only worked on start position, whereas it should also work on end position. 2014-01-31 16:16:21 +01:00
robert-bor
1785a554f3 Issue #1 remove overlapping intervals. Resolution rule: longer matches over smaller ones, left-most over right-most 2014-01-31 14:56:11 +01:00
robert-bor
922da26965 Emit now also contains the start position of the found keyword 2014-01-30 09:36:53 +01:00
robert-bor
b689165eb9 FTP link does not work. Fixed 2014-01-30 09:12:33 +01:00
robert-bor
d9c81428c0 updated docs 2014-01-30 08:22:02 +01:00
robert-bor
cab2404293 v0.1.0 v0.1.0 2014-01-30 06:49:49 +01:00
robert-bor
10b59654b7 More explanation on the usage 2014-01-29 21:58:22 +01:00
robert-bor
e1da9bd274 added explanation on the algorithm 2014-01-29 21:51:14 +01:00
robert-bor
d140afc0da first setup for the Aho-Corasick algorithm 2014-01-29 21:27:46 +01:00
Robert Bor
b6e257146a Initial commit 2014-01-29 12:17:12 -08:00