|
|||||||||
PREV CLASS NEXT CLASS | FRAMES NO FRAMES | ||||||||
SUMMARY: NESTED | FIELD | CONSTR | METHOD | DETAIL: FIELD | CONSTR | METHOD |
java.lang.Objecttp.builder.BuilderNGram
public class BuilderNGram
Class, which makes representation of text data by N-grams.
Constructor Summary | |
---|---|
BuilderNGram(DocsRepresentDB.RepresentationModel model,
DocsRepresentDB.Preprocessing preprocessing)
Constructor of the class |
Method Summary | |
---|---|
DocsRepresentDB |
buildRepresentation(DocumentsDatabase database)
Creation of word representation (if the user didn't set a depth parameter....) |
DocsRepresentDB |
buildRepresentation(DocumentsDatabase database,
int depth)
Gets text documents from a database and creates their N-gram representation. |
protected java.lang.Void |
doInBackground()
|
void |
setMinNGramOccur(int min_ngram_occur)
Sets a parameter of minimum N-gram occurence |
Methods inherited from class java.lang.Object |
---|
clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait |
Methods inherited from interface tp.builder.BuilderInterface |
---|
addPropertyChangeListener |
Constructor Detail |
---|
public BuilderNGram(DocsRepresentDB.RepresentationModel model, DocsRepresentDB.Preprocessing preprocessing)
model
- model used for representation (binary, TF or TF/IDF)preprocessing
- preprocessing options selected by the userMethod Detail |
---|
public DocsRepresentDB buildRepresentation(DocumentsDatabase database, int depth)
buildRepresentation
in interface BuilderInterface
database
- input database of text documentsdepth
- the lentgh of N-grams
public void setMinNGramOccur(int min_ngram_occur)
setMinNGramOccur
in interface BuilderInterface
min_ngram_occur
- minimum N-gram occurencepublic DocsRepresentDB buildRepresentation(DocumentsDatabase database)
buildRepresentation
in interface BuilderInterface
database
- database of all documents
protected java.lang.Void doInBackground() throws java.lang.Exception
java.lang.Exception
|
|||||||||
PREV CLASS NEXT CLASS | FRAMES NO FRAMES | ||||||||
SUMMARY: NESTED | FIELD | CONSTR | METHOD | DETAIL: FIELD | CONSTR | METHOD |