LengthModel (WikiTailor)

All Classes

Summary:
Nested |
Field |
Constr |
Method

Detail:
Field |
Constr |
Method

java.lang.Object
- cat.lump.ir.sim.cl.len.LengthModel

```
public class LengthModel
extends java.lang.Object
```
A class to estimate length models for a language pair. The program is divided in two:
- Parameters learning. A parallel collection is used to estimate the parameters of the Gaussian distribution that expresses the expected length of a texts's translation from one language to another.
- Quality estimation. The parameters for the language pair is known beforehand and it is used to estimate the length factor of a pair of texts (potential translations).
The model was originally proposed in:
Pouliquen, Steinberger, and Ignat. Automatic Identification of Document Translations in Large Multilingual Document Collections. In: Proceedings of RANLP-2003, pp. 401-408. Borovets, Bulgaria, 2003.

It can be used as a feature for machine translation quality estimation.

It has been used for plagiarism detection as well. The definition implemented here, as well as some background is available at:
Potthast, Barrón-Cedeño, Stein, and Rosso. Cross-Language Plagiarism Detection. Language Resources and Evaluation (LRE), Special Issue on Plagiarism and Authorship Analysis 45(1), pp. 1-18. Springer Netherlands (2011)
The class includes a CLI that can be called as follows:
LEARNING
java -jar LengthModel.jar -l -s en.txt -t es.txt
ESTIMATION
java -jar LengthModel.jar -s en.txt -t es.test -m 1.17491349130 -d 0.34648875 -v
(The default operation is estimation.)
Author:

albarron

Constructor Summary

Constructors
Constructor and Description

LengthModel()

Method Summary

Methods
Modifier and Type	Method and Description
`static void`	`main(java.lang.String[] args)` Parses the input parameters and either learns a length model from a collection or estimates the corresponding values for a set of texts

Methods inherited from class java.lang.Object
clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait

- Constructor Detail
  - LengthModel
```
public LengthModel()
```
- Method Detail
  - main
```
public static void main(java.lang.String[] args)
                 throws ParseException
```
    Parses the input parameters and either learns a length model from a collection or estimates the corresponding values for a set of texts
    
    Parameters:
    args -
    
    Throws:
    
    ParseException

All Classes

Summary:
Nested |
Field |
Constr |
Method

Detail:
Field |
Constr |
Method