Staff View: :: Library Catalog

Saved in:

Bibliographic Details
Main Author:	Daniel Cruz Cavalieri
Format:	Artículo científico
Language:	en
Published:	Sociedad Española para el Procesamiento del Lenguaje Natural 2011
Subjects:	Computación of Part optimization speech clustering vector space model
Online Access:	https://www.redalyc.org/articulo.oa?id=515751747021
Tags:	Add Tag No Tags, Be the first to tag this record!

_version_	1866579187227164672
author	Daniel Cruz Cavalieri
author_facet	Daniel Cruz Cavalieri
contents	A Part-of-Speech Tag Clustering for a Word Prediction System in Portuguese Language Daniel Cruz Cavalieri Teodiano Freire Bastos Filho Mário Sarcinelli Filho Sira Elena Palazuelos Cagigas Javier Macías Guarasa José L. Martín Sánchez Computación of Part optimization speech clustering vector space model This paper presents an automatic method for reducing the part-of-speech tagset to be considered by a word prediction system in Portuguese. The method is based on a similarity measure applied to a association matrix, generated by employing a odds ratio association measure in the bigrams of parts-of-speech (bipos) probability distribution in a corpus. The results reported in this paper show that using the proposed clustering method with an appropriate threshold value over the similarity has the potential to improve the word prediction system. Moreover, it makes possible to use new clustering techniques such as fuzzy clustering. The results also show that when using a word prediction system based on a syntactic model, the clustering cannot be performed between the major syntactic categories, even if the clusters generated seem correct from a linguistic point of view. 2011 artículo científico 1135-5948 https://www.redalyc.org/articulo.oa?id=515751747021 en http://www.redalyc.org/revista.oa?id=5157 Procesamiento del Lenguaje Natural application/pdf Sociedad Española para el Procesamiento del Lenguaje Natural Procesamiento del Lenguaje Natural (España) Num.47
format	Artículo científico
id	redalyc_515751747021
language	en
publishDate	2011
publisher	Sociedad Española para el Procesamiento del Lenguaje Natural
spellingShingle	A Part-of-Speech Tag Clustering for a Word Prediction System in Portuguese Language Daniel Cruz Cavalieri Computación of Part optimization speech clustering vector space model A Part-of-Speech Tag Clustering for a Word Prediction System in Portuguese Language Daniel Cruz Cavalieri Teodiano Freire Bastos Filho Mário Sarcinelli Filho Sira Elena Palazuelos Cagigas Javier Macías Guarasa José L. Martín Sánchez Computación of Part optimization speech clustering vector space model This paper presents an automatic method for reducing the part-of-speech tagset to be considered by a word prediction system in Portuguese. The method is based on a similarity measure applied to a association matrix, generated by employing a odds ratio association measure in the bigrams of parts-of-speech (bipos) probability distribution in a corpus. The results reported in this paper show that using the proposed clustering method with an appropriate threshold value over the similarity has the potential to improve the word prediction system. Moreover, it makes possible to use new clustering techniques such as fuzzy clustering. The results also show that when using a word prediction system based on a syntactic model, the clustering cannot be performed between the major syntactic categories, even if the clusters generated seem correct from a linguistic point of view. 2011 artículo científico 1135-5948 https://www.redalyc.org/articulo.oa?id=515751747021 en http://www.redalyc.org/revista.oa?id=5157 Procesamiento del Lenguaje Natural application/pdf Sociedad Española para el Procesamiento del Lenguaje Natural Procesamiento del Lenguaje Natural (España) Num.47
title	A Part-of-Speech Tag Clustering for a Word Prediction System in Portuguese Language
topic	Computación of Part optimization speech clustering vector space model
url	https://www.redalyc.org/articulo.oa?id=515751747021

Similar Items