Logo do repositório
 
A carregar...
Miniatura
Publicação

NLPyPort: Named Entity Recognition with CRF and Rule-Based Relation Extraction

Utilize este identificador para referenciar este registo.
Nome:Descrição:Tamanho:Formato: 
NER_Portuguese_paper_7.pdf867.49 KBAdobe PDF Ver/Abrir

Orientador(es)

Resumo(s)

This paper describes the application of the NLPyPort pipeline to Named Entity Recognition (NER) and Relation Extraction in Portuguese, more precisely in the scope of the IberLEF-2019 evaluation task on the topic. NER was tackled with CRF, based on several features, and trained in the HAREM collection, but results were low. This was partly caused by an issue on the submitted model, which had been trained in lowercase text, but, apparently, also due to the training data used, which highlights the different natures of HAREM, the source of the majority of the testing corpus, and SIGARRA. Relations were extracted with a set of rules bootstrapped from the examples provided by the organisation. Despite an F1-score of 0.72, we were the only participants in this task. We also express our doubts concerning the utility of the extracted relations.

Descrição

Palavras-chave

NLP NER CRF Relation Extraction PoS Tagging Pattern Based

Contexto Educativo

Citação

Projetos de investigação

Unidades organizacionais

Fascículo