public class Document2Query
extends java.lang.Object
Constructor and Description |
---|
Document2Query() |
Document2Query(java.util.Locale lan) |
Modifier and Type | Method and Description |
---|---|
java.lang.String |
doc2WeightQuery(java.lang.String file)
Generates a query in which tokens' relevance depend on their frequency
|
java.lang.String |
file2FlatQuery(java.lang.String file)
Generates a query in which every token has the same relevance
TODO why am I using the same tokenizer for every language???
|
static java.lang.String |
flatQuery(java.lang.String[] tokens)
Creates a query considering all the tokens (i.e. some words could be
repeated)
|
Analyzer |
getAnalyzer() |
java.lang.String |
str2FlatQuery(Analyzer analyzer,
java.lang.String text)
Generates a query in which every token has the same relevance
|
java.lang.String |
str2FlatQuery(java.lang.String text)
Generates a query in which every token has the same relevance
|
static java.lang.String |
vocQuery(java.lang.String[] tokens)
Creates a query considering only the vocabulary (i.e. types)
|
static java.lang.String |
weightQuery(java.lang.String[] tokens)
Creates a query where the relevance of a type depends on its
frequency (i.e. if a token w appears 4 times, it will appear
as w^4)
|
public Document2Query()
public Document2Query(java.util.Locale lan)
public Analyzer getAnalyzer()
public java.lang.String file2FlatQuery(java.lang.String file)
file
- java.io.IOException
public java.lang.String str2FlatQuery(Analyzer analyzer, java.lang.String text)
analyzer
- text
- string representation of the querypublic java.lang.String str2FlatQuery(java.lang.String text)
analyzer
- text
- string representation of the querypublic java.lang.String doc2WeightQuery(java.lang.String file)
file
- public static java.lang.String vocQuery(java.lang.String[] tokens)
tokens
- public static java.lang.String flatQuery(java.lang.String[] tokens)
tokens
- public static java.lang.String weightQuery(java.lang.String[] tokens)
tokens
-