NEGRAIMPORT

Section: User Commands (1)
Updated: September 23, 2006
Index Return to Main Contents
 

NAME

negraimport - A tool to convert NEGRA format to Emdros MQL  

SYNOPSIS

negraimport [ options ] [input_filename ...]
 

DESCRIPTION

negraimport is a command-line tool to convert NEGRA format to Emdros MQL for later importing into Emdros.

 

OPTIONS

negraimport supports the following command-line switches:
--help
show help, then quit
-V , --version
show version, then quit
-b , --backend backend
set database backend to `backend'. Valid values are: For PostgreSQL: "p", "pg", "postgres", and "postgresql". For MySQL: "m", "my", and "mysql". For SQLite 2.X.X: "s", "l", "lt", "sqlite", and "sqlite2". For SQLite 3.X.X: "3", "s3", "lt3", and "sqlite3".
-s , --schema
show MQL schema on stdout, then quit (can be used with -d)
-d , --dbname dbname
set database name. If used with -s, the string "CREATE DATABASE
-o , --output filename
dump to file filename. The default is "-", which means "standard output".
--start-monad monad
The start monad to use. Must be >= 1. The default is 1.
--start-id_d id_d
The start id_d to use. Must be >= 1. The default is 1.
 

OPERATION

negraimport reads corpus data in NEGRA format and converts the data to MQL statements for later importing into Emdros.

The filenames given after the options on the command line are interpreted as if each of them contains one NEGRA document, each containing one or more trees. If no filenames are given, the input is read from stdin.

If no -o switch is given, the output is printed on stdout.

If an error occurs, the string "FAILURE" or the string "ERROR" is printed on stderr, along with an error message.

If no error occurs, a string of the form "SUCCESS: next_monad is X next_id_d is Y" is printed on stderr, where X and Y are positive integers denoting the next monad and the next id_d to be used by the next invocation of the program, respectively. This is useful if you've got several directories' worth of documents to import.

 

SCHEMA

The schema can be seen by giving the program -s switch, with an optional -d switch.

A "Sentence" corresponds to one top-level sentence.

A "Phrase" is a nonterminal in the tree which is not a top-level sentence. Note that this encompasses clauses ("S") as well as phrases. Its "parent" feature points to its parent. Its parents feature feature gives a list of id_ds of all its parents, right up to the "Sentence" node. Its "tag", "morphtag", and "edge" features are what you think they are. There are five pairs of features "secedgeX" and "secparentX", for X in the range 1 to 5 (inclusive). These features show secondary edges.

A "Word" is a terminal node in the tree (i.e., a token). Its features are largely the same as for "Phrase", except that it has a "surface" feature which gives the surface of the token, as well as a "postag" instead of a "tag".

 

RETURN VALUES

0 Success
1 Wrong usage
2 Connection to backend server could not be established
3 An exception occurred (the type is printed on stderr)
4 Could not open file
5 Database error
6 Compiler error (internal error)
 

AUTHORS

Copyright 2001-2006 by Ulrik Petersen (ulrikp@users.sourceforge.net). Note that this software is distributed under the GNU GPL. See the sources for details.


 

Index

NAME
SYNOPSIS
DESCRIPTION
OPTIONS
OPERATION
SCHEMA
RETURN VALUES
AUTHORS

This document was created by man2html, using the manual pages.
Time: 22:29:32 GMT, December 01, 2006