|
||||||||||
| PREV CLASS NEXT CLASS | FRAMES NO FRAMES | |||||||||
| SUMMARY: NESTED | FIELD | CONSTR | METHOD | DETAIL: FIELD | CONSTR | METHOD | |||||||||
java.lang.Objectca.uottawa.balie.CharacterNGram
public class CharacterNGram
Methods to collect and handle character n-gram. A character n-gram is a sequence of n chars. For instance, in the word WORD, there are 3 bigrams: WO, OR and RD.
| Constructor Summary | |
|---|---|
CharacterNGram(int pi_NGramSize)
Creates a new n-gram handler. |
|
| Method Summary | |
|---|---|
void |
Feed(java.lang.String pi_InString)
Feed a text to the N-gram handler. |
java.lang.Double[] |
Instance(java.lang.String[] pi_RefNGrams)
Creates an instance made of n-gram relative frequencies for a given set of reference n-grams. |
java.util.Hashtable<java.lang.String,java.lang.Integer> |
NGramFrequency()
Get the table that associates each n-gram to its frequency. |
java.util.Hashtable<java.lang.String,java.lang.Integer> |
UNIGramFrequency()
Get the Unigram table |
| Methods inherited from class java.lang.Object |
|---|
equals, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait |
| Constructor Detail |
|---|
public CharacterNGram(int pi_NGramSize)
pi_NGramSize - The value of N (must be at least 2, to extract bigrams)| Method Detail |
|---|
public void Feed(java.lang.String pi_InString)
pi_InString - The text to split in n-gramspublic java.lang.Double[] Instance(java.lang.String[] pi_RefNGrams)
pi_RefNGrams - Reference n-grams for which the statistics are required.
public java.util.Hashtable<java.lang.String,java.lang.Integer> NGramFrequency()
public java.util.Hashtable<java.lang.String,java.lang.Integer> UNIGramFrequency()
|
||||||||||
| PREV CLASS NEXT CLASS | FRAMES NO FRAMES | |||||||||
| SUMMARY: NESTED | FIELD | CONSTR | METHOD | DETAIL: FIELD | CONSTR | METHOD | |||||||||