robert-bor
b5aaa51fdd
Optimize imports
...
Reformatted code (Java convention; tab is 4 spaces)
2016-11-30 09:10:21 +01:00
robert-bor
90d4645d49
Merge branch 'jdk1.7' of https://github.com/robert-bor/aho-corasick into jdk1.7
2016-11-30 09:06:52 +01:00
djarvis
503a0f1c76
Updated source base to leverage JDK 1.7 syntax. Added more final modifiers. Eliminated parameter modification inside method. Some formatting. Changed TrieBuilder to offer CharSequence instead of String; revised Trie accordingly. Removed some duplication. NetBeans automatically translated the code to use static imports (as per JDK 1.7 syntax).
2016-11-30 09:06:14 +01:00
Robert Bor
a89a6bac95
Merge pull request #46 from robert-bor/feature/badges-and-quality
...
Dev tooling enabled
2016-11-30 08:47:08 +01:00
djarvis
2b5c2d654d
Updated source base to leverage JDK 1.7 syntax. Added more final modifiers. Eliminated parameter modification inside method. Some formatting. Changed TrieBuilder to offer CharSequence instead of String; revised Trie accordingly. Removed some duplication. NetBeans automatically translated the code to use static imports (as per JDK 1.7 syntax).
2016-11-29 18:38:00 -08:00
robert-bor
a88ad48e05
Added Jacoco plugin
2016-11-29 20:00:59 +01:00
robert-bor
8ae9636201
4 spaces for code
...
Badges for Travis, Codacy, Codecov, Maven and Javadoc
Added Travis CI build instructions
2016-11-29 19:54:23 +01:00
Robert Bor
2f1ec8d041
Merge pull request #45 from robert-bor/simplifications
...
Added final modifier. Added helper methods for adding keywords using …
2016-11-29 19:15:54 +01:00
djarvis
f6a7103f5f
Added final modifier. Added helper methods for adding keywords using arrays and collections. Added test for large character strings. Simplified code for adding keywords. Renamed a few methods for consistency. Some code formatting. Updated unit tests with constant arrays, as a first step to reducing the duplication in the unit tests; migrated away from deprecated methods.
2016-11-28 21:20:57 -08:00
Robert Bor
8c422583b5
Merge pull request #44 from DaveJarvis/patch-1
...
Added source code comments.
2016-11-28 07:58:29 +01:00
Dave Jarvis
69781c0ae8
Added source code comments.
...
Added source code comments that should be useful for developers looking for more details.
2016-11-27 17:38:33 -08:00
robert-bor
5203efbbcb
Extra explanation on containsMatch
2015-09-23 20:56:24 +02:00
robert-bor
a46177415f
Updated README.md documentation
2015-09-23 08:39:13 +02:00
robert-bor
e365689391
v0.3.0
v0.3.0
2015-09-22 22:27:30 +02:00
robert-bor
dc27d6e3e9
pull #17 changes adopted to implement a whole word check on the entire keyword, including whitespaces.
2015-09-22 22:22:20 +02:00
robert-bor
76ae8222ea
issue #12 adopted the suggestion by yim1990 with a small change, so that the keyword emit is lowercased as well
2015-09-22 22:10:19 +02:00
robert-bor
e2c5334234
pull #14 implemented pull request by rripken for containsMatch and firstMatch
2015-09-22 22:02:30 +02:00
robert-bor
4633b1ba2a
Merge branch 'rripken-master' into feature/footprint-reduction
...
Conflicts:
src/main/java/org/ahocorasick/trie/Trie.java
src/test/java/org/ahocorasick/trie/TrieTest.java
2015-09-22 20:38:29 +02:00
robert-bor
30f003c5ae
Issue #18 fixed link to broken PDF, now points to http://cr.yp.to/bib/1975/aho.pdf
2015-09-22 20:22:24 +02:00
robert-bor
c18e030459
Merge branch 'SubOptimal-contrib' into feature/footprint-reduction
2015-09-22 20:18:52 +02:00
robert-bor
023c253c93
Issue #16 #20 #21 adopted pull request from remen which makes sure the failure states are constructed as part of the trie construction. This prevents the NPE which the referenced issues are complaining about.
2015-09-22 20:14:48 +02:00
robert-bor
fcefdfdaf9
Merge branch 'remen-master' into feature/footprint-reduction
...
Conflicts:
src/main/java/org/ahocorasick/trie/Trie.java
2015-09-22 20:06:04 +02:00
robert-bor
b85f8fc08f
Issue #22 added possibility to stop processing on generating at least one emit
2015-09-22 19:31:05 +02:00
robert-bor
4399e42b99
Issue #23 apply CharSequence to top-level parseText as well
2015-09-22 06:25:50 +02:00
robert-bor
055e13c298
Issue #23 removed the ParseConfiguration, rely on CharSequence instead
2015-09-21 22:03:59 +02:00
robert-bor
88799fb3da
Issue #23 added callback handler concept which omits the custom setting up of a list, but instead places direct calls to the handler. The handler are only supported on the lowest level of aho-corasick, ie no overlap, whole words and token support
...
Also added the possibility to pass a reader to the same level as above.
2015-09-21 21:09:26 +02:00
Petter Remen
9bce51e001
Issue #16 Use builder pattern to create Trie
...
Previously, there was a race condition in Trie#parseText since
it called constructFailureStates on first run without synchronization.
See https://github.com/robert-bor/aho-corasick/issues/16
This commit fixes this by using the builder pattern in order to
create a fully initialized Trie.
N.B. This changes the API
2015-07-03 12:29:31 +02:00
Frank Dietrich
285a74c37f
fix broken link to the white paper
2015-04-30 01:10:47 +02:00
ryan
d1478c7480
HashMap has better performance in my test cases.
2014-10-06 13:34:03 -07:00
ryan
a46e7dfe1d
Fixed formatting changes.
2014-10-06 11:02:01 -07:00
ryan
df503bae43
Added method and tests for a faster path to return the first match.
2014-10-06 10:52:35 -07:00
robert-bor
25eeef5168
v0.2.4 with bugfix #10
v0.2.4
2014-08-27 08:44:06 +02:00
robert-bor
2b125d2689
Issue #10 make sure that State emits a specific match only once
2014-08-27 08:42:46 +02:00
robert-bor
c96c57399a
update README.md
2014-08-26 10:11:05 +02:00
robert-bor
c572d234e1
v0.2.3 bugfix
v0.2.3
2014-08-26 10:05:18 +02:00
robert-bor
e8b5be0497
Issue #8 fixed Unicode issue by converting characters individually, not the entire search text
2014-08-26 09:50:15 +02:00
robert-bor
7431c74a7f
Issue #7 bugfix release v0.2.2
v0.2.2
2014-02-15 11:50:43 +01:00
robert-bor
d7421ead0f
Issue #7 Ignore keywords in the trie that are null or empty
2014-02-15 11:44:54 +01:00
robert-bor
a4fcfe8f20
Updated README for v0.2.1
2014-02-08 13:24:36 +01:00
robert-bor
4bd568836f
Releasing v0.2.1 bug release for issue #5
v0.2.1
2014-02-08 12:46:48 +01:00
robert-bor
31117d6a6e
Solved issue #5 by introducing a proper boundary check for words that are at the end of a String
2014-02-08 12:45:55 +01:00
robert-bor
1656e862df
v0.2.0
v0.2.0
2014-02-01 22:02:29 +01:00
robert-bor
bcde097070
Issue #4 Trie.tokenize() available. It returns a list of tokens. A token can be either a fragment (unmatched text) or a match. If it is a match, the original emit can be queried.
2014-02-01 22:01:15 +01:00
robert-bor
ae20429936
Issue #3 added case insensitivity when matching keywords
2014-02-01 21:04:53 +01:00
robert-bor
cb44a6bff2
Issue #2 implemented whole word matching
2014-02-01 20:35:38 +01:00
robert-bor
47383f1dbc
Issue #1 added documentation for removing overlapping matches
2014-01-31 16:26:36 +01:00
robert-bor
4c8ea8ba57
Issue #1 fixed bug in compareTo method of Interval. Problem was the compareTo only worked on start position, whereas it should also work on end position.
2014-01-31 16:16:21 +01:00
robert-bor
1785a554f3
Issue #1 remove overlapping intervals. Resolution rule: longer matches over smaller ones, left-most over right-most
2014-01-31 14:56:11 +01:00
robert-bor
922da26965
Emit now also contains the start position of the found keyword
2014-01-30 09:36:53 +01:00
robert-bor
b689165eb9
FTP link does not work. Fixed
2014-01-30 09:12:33 +01:00