public class SimilarityESAdocs extends SimilarityESA
cat.lump.ir.sim.ml.esa.esa.SimilarityESA
esaGen, objectA, objectB, overrideObjects, textsA, textsB
esaVectorsA, esaVectorsB, log
Constructor and Description |
---|
SimilarityESAdocs(java.io.File documentsApath,
java.io.File documentsBpath,
java.lang.String indexPath,
java.lang.String language,
boolean overrideObjects)
Includes the path to the documents in A and B.
|
Modifier and Type | Method and Description |
---|---|
protected EsaVectors |
computeVectors(java.io.File documentsPath,
java.lang.String object,
java.lang.String set)
Computes the vectors for the texts in the given set.
|
protected void |
setDocumentsPath(java.io.File documentsApath,
java.io.File documentsBpath)
A method that loads the texts in collections A and B.
|
computeVectorsA, computeVectorsB, objectExists, saveObject, setObjects
computePairwiseSimilarities, computeSimilarities, computeSimilarity, displaySimilarities, documentsExist, exitError, getPairwiseSimilarities, getSimilarities, getSimilaritiesMatrix, getSimilarity, getSimilarity
public SimilarityESAdocs(java.io.File documentsApath, java.io.File documentsBpath, java.lang.String indexPath, java.lang.String language, boolean overrideObjects)
documentsApath
- path to documents AdocumentsBpath
- path to documents BindexPath
- path to Lucene's indexlanguage
- languageoverrideObjects
- whether objects should be overriddenprotected void setDocumentsPath(java.io.File documentsApath, java.io.File documentsBpath)
SimilarityESA
setDocumentsPath
in class SimilarityESA
protected EsaVectors computeVectors(java.io.File documentsPath, java.lang.String object, java.lang.String set) throws java.lang.ClassNotFoundException, java.io.IOException
computeVectors
in class SimilarityESA
documentsPath
- path to the documentsobject
- name of the (previously generated object)set
- whether we are processing A or Bjava.io.IOException
java.lang.ClassNotFoundException