home › event - tbl-improved non-deterministic segmentation and tagging for a chinese parser

EVENT:

TBL-improved non-deterministic segmentation and tagging for a Chinese parser
Conferences & Talks

12th Conference of the European Chapter of the Association for Computational Linguistics (EACL-09)

28 March 2009
Athens, Greece

 

description

Although much progress has been made recently in word segmentation and POS tagging for Chinese, the output of current state-of-the-art systems is too inaccurate to allow for syntactic analysis based on it. We present an experiment in improving the output of an off-the-shelf module that performs segmentation and tagging, the tokenizer-tagger from Beijing University (PKU). Our approach is based on transformation-based learning (TBL). Unlike in other TBL-based approaches to the problem, however, both obligatory and optional transformation rules are learned, so that the final system can output multiple segmentation and POS tagging analyses for a given input.

 

upcoming events   view all 

Automated Data Integration
Eric Huang, Author, Saigopal Nelaturi
27 October 2014
Conferences & Talks  

Global Competitiveness: The Role of Innovation and Productivity
Stephen Hoover, CEO, PARC
27 October 2014 | Toronto, Canada
Conferences & Talks  

The Internet of Everything
Stephen Hoover, CEO, PARC
28 October 2014 | Toronto, Canada
Conferences & Talks  

Open Forum: Cities and the Digital Frontier
Mike Steep
30 October 2014
George E. Pake Auditorium, PARC | Palo Alto, CA

Churchill Club