Logo do repositório
 
A carregar...
Miniatura
Publicação

LemPORT: a High-Accuracy Cross-Platform Lemmatizer for Portuguese

Utilize este identificador para referenciar este registo.
Nome:Descrição:Tamanho:Formato: 
OASIcs.SLATE.2014.267.pdf446.2 KBAdobe PDF Ver/Abrir

Orientador(es)

Resumo(s)

Although lemmatization is a very common subtask in many natural language processing tasks, there is a lack of available true cross-platform lemmatization tools specifically targeted for Portuguese, namely for integration in projects developed in Java. To address this issue, we have developed a lemmatizer, initially just for our own use, but which we have decided to make publicly available. The lemmatizer, presented in this document, yields an overall accuracy over 98% when compared against a manually revised corpus.

Descrição

Palavras-chave

lemmatization normalization rules lexicon

Contexto Educativo

Citação

Projetos de investigação

Unidades organizacionais

Fascículo