Which model to use?

thethunderstrome · November 23, 2025, 2:23am

I am working on prediction of the survivors of titanic using their data (I know its weird) but after looting some graphs and feeding it into logistic regression i don’t get any efficient model no matter how i tried. So should I learn neural networks and than apply that or anything other than that you want to suggest.

balaji.ambresh · November 23, 2025, 5:36am

Have you seen the notebooks here ?

tsaipradyumna · November 26, 2025, 7:36am

Hi thethunderstrome,

It’s completely normal to hit a wall with Logistic Regression on the Titanic dataset! It’s a classic problem, and the solution typically lies in data preparation, rather than simply switching to a more complex model.

Before jumping into Neural Networks, I strongly recommend you focus on two key areas:

1. Feature Engineering (The Biggest Gain)

Logistic Regression is a linear model, and it struggles with raw data. You need to create features that expose non-linear relationships.

Extract ‘Title’: The title (e.g., Mr., Miss, Master) from the name is incredibly predictive.
Create ‘Family Size’: Combine SibSp and Parch. People traveling alone, in small groups, or in very large groups had different survival chances.
Improve Age Imputation: Instead of using the overall average, impute missing ‘Age’ based on the passenger’s ‘Title’.

2. Try Ensemble Methods

These algorithms handle the complexity of the data much better than Logistic Regression without the heavy overhead of Neural Networks.

Random Forest: A great starting point. It’s robust, less prone to overfitting, and handles non-linearities automatically.
Gradient Boosting (e.g., XGBoost): These are the gold standard for structured, tabular data and will likely give you the highest accuracy.

The takeaway: A well-engineered dataset fed into an XGBoost model will almost certainly outperform raw data fed into a Neural Network on this challenge.

Topic		Replies	Views
Selection of logistic regression vs neural network Advanced Learning Algorithms week-module-3	1	482	January 19, 2023
Why to use Neural Network Advanced Learning Algorithms week-module-1	4	374	September 26, 2023
Why neural network is not working well with this case Advanced Learning Algorithms week-module-2	10	545	February 2, 2023
Logistic Regression vs Neural Network AI Discussions	2	192	January 12, 2024
Data preprocessing Supervised ML: Regression and Classification week-module-3	3	539	September 4, 2022

Which model to use?

1. Feature Engineering (The Biggest Gain)

2. Try Ensemble Methods

Related topics