Representation learning methods for early detection of pathological lesions in chest X-rays images.

Evangelista, Ricardo Gil Salgado

Publication

Representation learning methods for early detection of pathological lesions in chest X-rays images.

2024-12Master thesis

dc.contributor.advisor	Guevara López, Miguel
dc.contributor.advisor	Santos, Catarina Ferreira
dc.contributor.author	Evangelista, Ricardo Gil Salgado
dc.date.accessioned	2025-02-14T10:35:26Z
dc.date.available	2025-02-14T10:35:26Z
dc.date.issued	2024-12
dc.description.abstract	Atualmente existe uma dificuldade em interpretar imagens de Raio-X da zona do tórax, principalmente para médicos que não têm especialidade em radiologia, pois é uma tarefa complexa e apesar de estarem treinados para fazer essa análise, existe um grande grupo de doenças/patologias que podem ser detetadas radiologicamente na zona torácica. É aqui que entram as técnicas emergentes de Inteligência Artificial, como a “Computer Vision” e o “Machine (Deep) Learning”, pois com a ajuda de ambas as técnicas é possível criar mecanismos de avaliação automática com elevado grau de certeza. Atualmente, já tenham sido publicados vários trabalhos, algoritmos, métodos e até soluções prontas para a avaliação automática dessas imagens, elas ainda não atingem o nível de precisão necessário.Por conseguinte, este é considerado um problema sem solução. Esta dissertação é uma nova tentativa de automatizar e melhorar o processo de avaliação de imagens de raios X do tórax. A principal contribuição deste trabalho visa resolver o problema de desequilíbrio dos conjuntos de dados de domínio público, implementando um procedimento de “Aumento de dados”, que permite melhorar o desempenho/precisão dos modelos de classificação de Deep Learning desenvolvidos anteriormente. Com isso, é possível aumentar o número de imagens de patologias sub-representadas. Testámos o nosso método utilizando o conjunto de dados CheXpert, que será descrito em pormenor mais adiante. Outra contribuição é o facto de dividir o conjunto de dados em vários subconjuntos binários e treiná-los isoladamente. Neste sentido, como mencionado, foi criado um subconjunto para cada patologia em vez de avaliar todas as patologias em conjunto. Selecionámos e afinámos dois modelos de classificação de aprendizagem profunda de alto desempenho desenvolvidos anteriormente: VGG19 e DenseNet121. No final, foi obtida uma tabela com todos os valores de AUC antes e depois do Data Augmentation, bem como um gráfico para cada patologia e para cada modelo. Estes passos resultaram em valores médios de AUC de 0,68 e 0,74 antes do Data Augmentation e de 0,96 e 0,97 após o Data Augmentation, para o modelo VGG19 e o modelo DenseNet121 respetivamente.	pt_PT
dc.description.abstract	There is currently difficulty in interpreting X-Ray images of the chest area, especially for physicians who do not have a specialty in radiology, because it is a complex task and although physicians are trained to do this analysis, there is a large group of diseases/pathologies that manifest themselves radiologically in the thoracic area. This is where emergent Artificial Intelligence techniques, such as Computer Vision and Machine (Deep) Learning come in, because with the help of both techniques, it is possible to create mechanisms that automatically assess with a high degree of certainty those X-Ray images. Although, at present, several papers have been published and various algorithms and methods exist, as well as some off-the-shelf solutions to evaluate these images automatically, they still do not have the required level of accuracy. Therefore, this is considered an unsolved problem. This thesis is a further attempt to automate and improve the process of evaluating chest X-ray images. The main contribution of this work aims to try the imbalanced problem of public domain datasets by implementing a “Data Augmentation” procedure, which allows enhancing the performance / accuracy of previous developed deep learning classification models. With this, it is possible to increase the number of images of pathologies underrepresented. We test our method using the CheXpert Dataset, which will be described in detail afterwards. Another contribution is the fact of dividing the dataset into several binary subsets and train these alone. In this sense, as mentioned, a subset was created for each pathology instead of evaluating all the pathologies together. We selected and fine tuning two previously developed high-performance deep learning classification models: VGG19 and DenseNet121. In the end, it was obtained a table with all the AUC values before and after Data Augmentation, as well as a graph for each pathology, for each model. These steps resulted in average AUC values of 0.68 and 0.74 before Data Augmentation and 0.96 and 0.97 after Data Augmentation, for the VGG19 model and the DenseNet121 model respectively.	pt_PT
dc.identifier.tid	203799445	pt_PT
dc.identifier.uri	http://hdl.handle.net/10400.26/54420
dc.language.iso	eng	pt_PT
dc.subject	Radiografias do tórax	pt_PT
dc.subject	Doenças pulmonares	pt_PT
dc.subject	Machine (deep) learning	pt_PT
dc.subject	Aumento de dados	pt_PT
dc.subject	Inteligência Artificial	pt_PT
dc.subject	Chest X-rays	pt_PT
dc.subject	Lung diseases	pt_PT
dc.subject	Machine (deep) learning	pt_PT
dc.subject	Data augmentation	pt_PT
dc.subject	Artificial intelligence	pt_PT
dc.title	Representation learning methods for early detection of pathological lesions in chest X-rays images.	pt_PT
dc.type	master thesis
dspace.entity.type	Publication
rcaap.rights	openAccess	pt_PT
rcaap.type	masterThesis	pt_PT
thesis.degree.grantor	Instituto Politécnico de Setúbal
thesis.degree.name	Mestrado em Engenharia Biomédica	pt_PT

Files

Original bundle

Now showing 1 - 1 of 1

Name:: Tese_VersaoFinal.pdf
Size:: 2.36 MB
Format:: Adobe Portable Document Format

Download

License bundle

Now showing 1 - 1 of 1

Name:: license.txt
Size:: 1.85 KB
Format:: Item-specific license agreed upon to submission
Description:

Download

Collections

IPS - ESTS - BIBLIOTECA - Dissertações de mestrado