|
||||||||||
PREV CLASS NEXT CLASS | FRAMES NO FRAMES | |||||||||
SUMMARY: NESTED | FIELD | CONSTR | METHOD | DETAIL: FIELD | CONSTR | METHOD |
java.lang.Objectca.uottawa.balie.CharacterNGram
public class CharacterNGram
Methods to collect and handle character n-gram. A character n-gram is a sequence of n chars. For instance, in the word WORD, there are 3 bigrams: WO, OR and RD.
Constructor Summary | |
---|---|
CharacterNGram(int pi_NGramSize)
Creates a new n-gram handler. |
Method Summary | |
---|---|
void |
Feed(java.lang.String pi_InString)
Feed a text to the N-gram handler. |
java.lang.Double[] |
Instance(java.lang.String[] pi_RefNGrams)
Creates an instance made of n-gram relative frequencies for a given set of reference n-grams. |
java.util.Hashtable<java.lang.String,java.lang.Integer> |
NGramFrequency()
Get the table that associates each n-gram to its frequency. |
java.util.Hashtable<java.lang.String,java.lang.Integer> |
UNIGramFrequency()
Get the Unigram table |
Methods inherited from class java.lang.Object |
---|
equals, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait |
Constructor Detail |
---|
public CharacterNGram(int pi_NGramSize)
pi_NGramSize
- The value of N (must be at least 2, to extract bigrams)Method Detail |
---|
public void Feed(java.lang.String pi_InString)
pi_InString
- The text to split in n-gramspublic java.lang.Double[] Instance(java.lang.String[] pi_RefNGrams)
pi_RefNGrams
- Reference n-grams for which the statistics are required.
public java.util.Hashtable<java.lang.String,java.lang.Integer> NGramFrequency()
public java.util.Hashtable<java.lang.String,java.lang.Integer> UNIGramFrequency()
|
||||||||||
PREV CLASS NEXT CLASS | FRAMES NO FRAMES | |||||||||
SUMMARY: NESTED | FIELD | CONSTR | METHOD | DETAIL: FIELD | CONSTR | METHOD |