Enhancing Disease Diagnosis in Healthcare Using LSTM Networks with Optimized Feature Selection: A Comparative Research on Heart Disease, Breast Cancer, and Liver Disease Datasets

  • Tina Sachdeva Assistant Professor, Shaheed Rajguru College of Applied Sciences for Women, University of Delhi, Vasundhara Enclave, Delhi-110096 http://orcid.org/0009-0003-8798-8719
  • Priyanka Sharma Assistant Professor, Acharya Narendra Dev College, University of Delhi, Govind Puri, Kalkaji-110019 http://orcid.org/0009-0009-3341-6823
  • Mehtab Alam Assistant Professor, Acharya Narendra Dev College, University of Delhi, Govind Puri, Kalkaji-110019 http://orcid.org/0000-0001-7554-2160

Abstract

Clinical diagnosis relies heavily on accurate and timely medical assessment. However, clinical data sets are typically large, incomplete, and noisy, therefore limiting the reliability of traditional models used to diagnose clinical conditions. To address these limitations, this study developed an integrated model combining an optimised feature selection strategy with a Long Short-Term Memory (LSTM) model to provide improved accuracy across three clinical condition benchmarks, namely heart disease, breast cancer, and liver disorder. The first step in this process was feature selection, which involved selecting only those features that were deemed clinically relevant. By doing this, the subsequent LSTM model was able to recognise temporal patterns within the patient's clinical data set more easily than it would have been able to do using the original clinical data set. Results from this work demonstrates that the proposed methodology resulted in higher accuracy compared to baseline classifiers and also to non-optimised LSTM models across all three data sets. Specifically, the proposed method had an accuracy of 91.5% for heart disease, 97.6% for breast cancer, with a Receiver Operating Characteristic Area Under the Curve (ROC-AUC) of 0.99, and 83.9% accuracy for liver disorder. Additionally, there was a clear improvement in terms of recall and F1-score. Overall, the results from this study demonstrate that the integration of a method for dimensionality reduction with a sequential learning approach produces a reliable and generalizable clinical diagnostic model that can support clinical decision-making in various health care settings.

Downloads

Download data is not yet available.

Author Biography

Mehtab Alam, Assistant Professor, Acharya Narendra Dev College, University of Delhi, Govind Puri, Kalkaji-110019

Mehtab Alam is currently working as an Assistant Professor in Acharya Narendra Dev College, University of Delhi, India. Dr. Alam got his PhD in Computer Science and Engineering from Jamia Hamdard New Delhi, India after completing Master of Technology in Information Security and Cyber Forensics and Bachelor of Technology in Information Technology. He is an active researcher in Artificial Intelligence, Internet of Things (IoT), Cyber Security and Robotics, with a particular focus on intelligent control strategies, machine learning-driven perception systems, and secure IoT architectures. His recent works explore learning-based control for multi-finger robotic grippers, AI-guided design of solid-state batteries for IoT and medical applications, and cybersecurity frameworks for healthcare networks. With over 50 high-quality research publications, including journal articles, book chapters, and e-books. Dr. Alam actively contributes to the advancement of secure and intelligent computing systems.

Published
2026-02-19
How to Cite
Sachdeva, T., Sharma, P., & Alam, M. (2026). Enhancing Disease Diagnosis in Healthcare Using LSTM Networks with Optimized Feature Selection: A Comparative Research on Heart Disease, Breast Cancer, and Liver Disease Datasets. ITEGAM-JETIA, 12(57), 993-1008. https://doi.org/10.5935/jetia.v12i57.3154
Section
Articles