| Class | Description |
|---|---|
| ArticleSelector |
This class extracts all the articles that belong to a given category in
Wikipedia
|
| ArticleTextExtractor |
This class provides methods to load a list of Wikipedia articles IDs and
preprocess them.
|
| BackArticleSelector |
This class extracts all the articles belonging to a given category in
Wikipedia
|
| CategoryDepth |
Class that automatises the process of selecting how deep within the category tree
one must go to extract articles from a given domain.
|
| CategoryExplorer |
The
CategoryExplorer class is used to explore the categories of
Wikipedia. |
| CategoryExtractor |
This class extracts all the subcategories from an indicated category in
Wikipedia
TODO build junit
|
| CategoryNameStats |
This class computes the percentage of categories that are claimed to belong
to a concrete domain from a category tree.
|
| CategoryTreeNode |
This class stores all the relevant information about a classified
category.
|
| DomainKeywords |
This class gets the most common terms in the articles belonging to, at
least, one category of a given domain.
|
| DomainVocabulary |
A
DomainVocabulary instance is used to store a set of terms with its
frequency. |
| GroupOfCategories |
A
GroupOfCategories instance contains the scored categories from
Wikipedia which are related to other called root category. |
| GroupOfCategories.ScoredCategory |
The
ScoredCategory class enriches the
de.tudarmstadt.ukp.wikipedia.api.Category objects providing the
following information:
Parent: The first category which allows access to this one. |
| Xecutor |
This class intends to join together all the necessary process to extract the
articles related to a given category.
|
| XecutorTheFirst |
This class intends to join together all the necessary process to extract the
articles related to a given category.
|
| XecutorTheSecond |
This class intends to join together all the necessary process to extract the
articles related to a given category.
|