pitt.search.lucene
Class IndexBilingualFiles

java.lang.Object
  extended by pitt.search.lucene.IndexBilingualFiles

public class IndexBilingualFiles
extends java.lang.Object

Index pairs of bilingual texts in a parallel corpus. See http://code.google.com/p/semanticvectors/wiki/BilingualModels for more thorough documentation of preparation of corpora and creation of models.


Constructor Summary
IndexBilingualFiles(java.lang.String lang1, java.lang.String lang2)
           
 
Method Summary
protected  org.apache.lucene.document.Document fileBilingualDocument(java.io.File file1, java.io.File file2)
           
static void main(java.lang.String[] args)
           
 
Methods inherited from class java.lang.Object
clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait
 

Constructor Detail

IndexBilingualFiles

public IndexBilingualFiles(java.lang.String lang1,
                           java.lang.String lang2)
Method Detail

fileBilingualDocument

protected org.apache.lucene.document.Document fileBilingualDocument(java.io.File file1,
                                                                    java.io.File file2)
                                                             throws java.io.IOException
Throws:
java.io.IOException

main

public static void main(java.lang.String[] args)