Publicação
LemPORT: a High-Accuracy Cross-Platform Lemmatizer for Portuguese
| dc.contributor.author | Rodrigues, Ricardo | |
| dc.contributor.author | Oliveira, Hugo Gonçalo | |
| dc.contributor.author | Gomes, Paulo | |
| dc.date.accessioned | 2026-01-27T14:29:45Z | |
| dc.date.available | 2026-01-27T14:29:45Z | |
| dc.date.issued | 2014 | |
| dc.description.abstract | Although lemmatization is a very common subtask in many natural language processing tasks, there is a lack of available true cross-platform lemmatization tools specifically targeted for Portuguese, namely for integration in projects developed in Java. To address this issue, we have developed a lemmatizer, initially just for our own use, but which we have decided to make publicly available. The lemmatizer, presented in this document, yields an overall accuracy over 98% when compared against a manually revised corpus. | eng |
| dc.identifier.doi | 10.4230/oasics.slate.2014 | |
| dc.identifier.uri | http://hdl.handle.net/10400.26/61226 | |
| dc.language.iso | eng | |
| dc.peerreviewed | n/a | |
| dc.rights.uri | http://creativecommons.org/licenses/by/4.0/ | |
| dc.subject | lemmatization | |
| dc.subject | normalization | |
| dc.subject | rules | |
| dc.subject | lexicon | |
| dc.title | LemPORT: a High-Accuracy Cross-Platform Lemmatizer for Portuguese | eng |
| dc.type | conference object | |
| dspace.entity.type | Publication | |
| oaire.citation.conferenceDate | 2014 | |
| oaire.citation.endPage | 274 | |
| oaire.citation.startPage | 267 | |
| oaire.citation.title | 3rd Symposium on Languages, Applications and Technologies (SLATE’14) | |
| oaire.version | http://purl.org/coar/version/c_970fb48d4fbd8a85 | |
| person.familyName | Rodrigues | |
| person.givenName | Ricardo | |
| person.identifier.ciencia-id | D31C-FB4A-FEAA | |
| person.identifier.orcid | 0000-0002-6262-7920 | |
| relation.isAuthorOfPublication | c64ccf7c-eca2-43cf-a4a2-78e684499c00 | |
| relation.isAuthorOfPublication.latestForDiscovery | c64ccf7c-eca2-43cf-a4a2-78e684499c00 |
