|
|||||||||
PREV CLASS NEXT CLASS | FRAMES NO FRAMES | ||||||||
SUMMARY: NESTED | FIELD | CONSTR | METHOD | DETAIL: FIELD | CONSTR | METHOD |
java.lang.Objecttp.loader.DataCleaner
public class DataCleaner
Class for cleaning of loaded text data.
Constructor Summary | |
---|---|
DataCleaner()
|
Method Summary | |
---|---|
java.lang.String |
cleanContentFile(java.lang.String content)
Method removes some special symbols, which occur in contents of Reuters documents. |
java.lang.String |
cleanPlainText(java.lang.String str)
Method removes some special symbols, which occur in plain text documents. |
java.lang.String |
cleanSGMLReuters(java.lang.String str)
Method removes some special symbols, which occur in Reuters dataset. |
java.lang.String |
cleanTopic(java.lang.String topic)
Method removes some special symbols, which occur in topic of Reuters documents. |
Methods inherited from class java.lang.Object |
---|
clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait |
Constructor Detail |
---|
public DataCleaner()
Method Detail |
---|
public java.lang.String cleanSGMLReuters(java.lang.String str)
str
- original string
public java.lang.String cleanTopic(java.lang.String topic)
str
- original string
public java.lang.String cleanContentFile(java.lang.String content)
str
- original string
public java.lang.String cleanPlainText(java.lang.String str)
str
- original string
|
|||||||||
PREV CLASS NEXT CLASS | FRAMES NO FRAMES | ||||||||
SUMMARY: NESTED | FIELD | CONSTR | METHOD | DETAIL: FIELD | CONSTR | METHOD |