| Nome: | Descrição: | Tamanho: | Formato: | |
|---|---|---|---|---|
| 867.49 KB | Adobe PDF |
Orientador(es)
Resumo(s)
This paper describes the application of the NLPyPort pipeline to Named Entity Recognition (NER) and Relation Extraction in Portuguese, more precisely in the scope of the IberLEF-2019 evaluation task on the topic. NER was tackled with CRF, based on several features, and trained in the HAREM collection, but results were low. This was partly caused by an issue on the submitted model, which had been trained in lowercase text, but, apparently, also due to the training data used, which highlights the different natures of HAREM, the source of the majority of the testing corpus, and SIGARRA. Relations were extracted with a set of rules bootstrapped from the examples provided by the organisation. Despite an F1-score of 0.72, we were the only participants in this task. We also express our doubts concerning the utility of the extracted relations.
Descrição
Palavras-chave
NLP NER CRF Relation Extraction PoS Tagging Pattern Based
