public class Document2Query
extends java.lang.Object
| Constructor and Description |
|---|
Document2Query() |
Document2Query(java.util.Locale lan) |
| Modifier and Type | Method and Description |
|---|---|
java.lang.String |
doc2WeightQuery(java.lang.String file)
Generates a query in which tokens' relevance depend on their frequency
|
java.lang.String |
file2FlatQuery(java.lang.String file)
Generates a query in which every token has the same relevance
TODO why am I using the same tokenizer for every language???
|
static java.lang.String |
flatQuery(java.lang.String[] tokens)
Creates a query considering all the tokens (i.e. some words could be
repeated)
|
Analyzer |
getAnalyzer() |
java.lang.String |
str2FlatQuery(Analyzer analyzer,
java.lang.String text)
Generates a query in which every token has the same relevance
|
java.lang.String |
str2FlatQuery(java.lang.String text)
Generates a query in which every token has the same relevance
|
static java.lang.String |
vocQuery(java.lang.String[] tokens)
Creates a query considering only the vocabulary (i.e. types)
|
static java.lang.String |
weightQuery(java.lang.String[] tokens)
Creates a query where the relevance of a type depends on its
frequency (i.e. if a token w appears 4 times, it will appear
as w^4)
|
public Document2Query()
public Document2Query(java.util.Locale lan)
public Analyzer getAnalyzer()
public java.lang.String file2FlatQuery(java.lang.String file)
file - java.io.IOExceptionpublic java.lang.String str2FlatQuery(Analyzer analyzer,
java.lang.String text)
analyzer - text - string representation of the querypublic java.lang.String str2FlatQuery(java.lang.String text)
analyzer - text - string representation of the querypublic java.lang.String doc2WeightQuery(java.lang.String file)
file - public static java.lang.String vocQuery(java.lang.String[] tokens)
tokens - public static java.lang.String flatQuery(java.lang.String[] tokens)
tokens - public static java.lang.String weightQuery(java.lang.String[] tokens)
tokens -