skip to content Search  | A-Z Directory  | Contacting People  | About Us Information Division
ePrints Repository University of Melbourne
 home about browse search register user help

University of Melbourne ePrints Repository

   

Querying and Updating Treebanks: A Critical Survey and Requirements Analysis

Lai, C. and Bird, S. G. (2004) Querying and Updating Treebanks: A Critical Survey and Requirements Analysis. In Proceedings Australasian Language Technology Workshop, pages pp. 139-146, Macquarie University, Sydney.

Full text available as:
PDF - Requires Adobe Acrobat Reader or other PDF viewer.

Abstract

Language technology makes extensive use of hierarchically annotated text and speech data. These databases are stored in flat files and manipulated using corpus-specific query tools or special-purpose scripts. While the size of these databases and the range of applications has grown rapidly in recent years, neither method for managing the data has led to reusable, scalable software. The formal properties of the query languages are not well understood. Hence established methods for indexing tree data and optimizing tree queries cannot be employed. We analyze a range of existing linguistic query languages, and adduce a set of requirements for a reusable, scalable linguistic query language.

Keywords:querying, treebanks, corpora
Subjects:Engineering > Department of Computer Science and Software Engineering
Arts > Department of Linguistics and Applied Linguistics
ID Code:774
Deposited By:Lai, Catherine (288)
Deposited On:16 December 2004
Alternative Locations:http://www.alta.asn.au/events/altw2004/publication/04-22.pdf
Eprint Statistics:View statistics for this eprint
Item Type:Conference Paper