Sardar Jaf
Towards the Development of a Hybrid Parser for Natural Languages
Jaf, Sardar; Allan, Ramsay; Jones, Andrew V.; Ng, Nicholas
Authors
Ramsay Allan
Andrew V. Jones
Nicholas Ng
Abstract
In order to understand natural languages, we have to be able to determine the relations between words, in other words we have to be able to 'parse' the input text. This is a difficult task, especially for Arabic, which has a number of properties that make it particularly difficult to handle. There are two approaches to parsing natural languages: grammar-driven and data-driven. Each of these approaches poses its own set of problems, which we discuss in this paper. The goal of our work is to produce a hybrid parser, which retains the advantages of the data-driven approach but is guided by grammar rules in order to produce more accurate output. This work consists of two stages: the first stage is to develop a baseline data-driven parser, which is guided by a machine learning algorithm for establishing dependency relations between words. The second stage is to integrate grammar rules into the baseline parser. In this paper, we describe the first stage of our work, which is now implemented, and a number of experiments that have been conducted on this parser. We also discuss the result of these experiments and highlight the different factors that are affecting parsing speed and the correctness of the parser results.
Citation
Jaf, S., Allan, R., Jones, A. V., & Ng, N. (2013, September). Towards the Development of a Hybrid Parser for Natural Languages. Presented at 2013 Imperial College Computing Student Workshop., London, United Kingdom
Presentation Conference Type | Conference Paper (published) |
---|---|
Conference Name | 2013 Imperial College Computing Student Workshop. |
Start Date | Sep 26, 2013 |
End Date | Sep 27, 2013 |
Publication Date | Sep 1, 2013 |
Deposit Date | Feb 12, 2016 |
Publicly Available Date | Feb 22, 2016 |
Pages | 49-56 |
Series Title | OASIcs - OpenAccess Series in Informatics |
Series Number | 35 |
Book Title | 2013 Imperial College Computing Student Workshop (ICCSW’13). |
DOI | https://doi.org/10.4230/oasics.iccsw.2013.49 |
Keywords | Hybrid Parsing, Arabic Parsing, Grammar-Driven Parser, Data-Driven Parser, Natural Language Processing. |
Public URL | https://durham-repository.worktribe.com/output/1151405 |
Files
Published Conference Proceeding
(548 Kb)
PDF
Publisher Licence URL
http://creativecommons.org/licenses/by/4.0/
You might also like
Combining Machine Learning Classifiers for the Task of Arabic Characters Recognition
(2018)
Journal Article
Security Threats to Critical Infrastructure: The Human Factor
(2018)
Journal Article
BotDet: A System for Real Time Botnet Command and Control Traffic Detection
(2018)
Journal Article
CAM: A Combined Attention Model for Natural Language Inference
(2018)
Presentation / Conference Contribution
An Exploration of Dropout with RNNs for Natural Language Inference
(2018)
Presentation / Conference Contribution