Rodrigues, RicardoOliveira, Hugo GonçaloGomes, Paulo2026-01-272026-01-272014http://hdl.handle.net/10400.26/61226Although lemmatization is a very common subtask in many natural language processing tasks, there is a lack of available true cross-platform lemmatization tools specifically targeted for Portuguese, namely for integration in projects developed in Java. To address this issue, we have developed a lemmatizer, initially just for our own use, but which we have decided to make publicly available. The lemmatizer, presented in this document, yields an overall accuracy over 98% when compared against a manually revised corpus.englemmatizationnormalizationruleslexiconLemPORT: a High-Accuracy Cross-Platform Lemmatizer for Portugueseconference object10.4230/oasics.slate.2014