Features Engineering for Deep Learning NLP

Maulaya_Radhibilla1 · August 1, 2023, 3:35pm

I want to build deeplearning NLP model with datasets below. The dataset I have contains diseases and their corresponding symptoms like this

Disease	Symptom_1	Symptom_2	Symptom_3	Symptom_4	Symptom_5
Fungal infection	itching	skin_rash	nodal_skin_eruptions	dischromic_patches	None
Fungal infection	skin_rash	nodal_skin_eruptions	dischromic_patches	None	None
Fungal infection	itching	nodal_skin_eruptions	dischromic_patches	None	None
Fungal infection	itching	skin_rash	dischromic_patches	None	None
Fungal infection	itching	skin_rash	nodal_skin_eruptions	None	None

Before using it for Natural Language Processing (NLP) tasks, I want to preprocess the data to represent symptoms in a suitable format for my deep learning NLP model. I am considering two feature engineering options:

List of Symptoms for Each Disease:
I could create a new dataset where each row corresponds to a disease, and the symptoms are listed as a string. For example:

Disease	Symptoms
Chronic cholestasis	itching, yellowish skin, nausea, loss of appetite, abdominal pain, yellowing of eyes
Chronic cholestasis	itching, yellowish skin, nausea, loss of appetite, abdominal pain, yellowing of eyes

or,

Transformed Symptom Descriptions:
Alternatively, I could transform the symptoms into a single string description for each disease. For example:

"Fungal infection. Itching. Reported signs of dischromic patches. 
Patient reports no patches in throat. 
Issues of frequent skin rash. 
Patient reports no spotting urination. 
Patient reports no stomach pain. 
Nodal skin eruptions over the last few days."

My question is, which kind of feature engineering should I use that would work better for my deep learning model? I would appreciate input and insights from the community to help me make an informed decision.

Muhammad_Yousaf2 · August 26, 2023, 7:46am

Hi,
In my opinion try using the linear Regression model, which will help you create a function. Also try minimizing the cost of the function.
Thanks

Topic		Replies	Views
Need advise on my personal project model AI Discussions	1	56	November 2, 2022
Medical concepts extraction from documents through LLM finetuining Generative AI with Large Language Models week-module-2	13	888	September 13, 2023
What should be the LSTM model architecture in order to forecast disease probability? AI Discussions	2	56	September 2, 2022
ML project model suggestion AI Discussions ai-discussions , project	2	182	March 12, 2024
How to tackle problem with nlp dataset AI Discussions	3	63	October 2, 2023

Features Engineering for Deep Learning NLP

Related topics