Skip to content

Pavan-create-dot/Task-2-Titanic-Survival-Data-Analysis

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

9 Commits
 
 
 
 
 
 

Repository files navigation

🛳️ Titanic Survival Data Analysis: Factors Influencing Survival

⚠️ Disclaimer: This dataset is NOT real data from the Titanic ship. It is a simulated dataset created for the purpose of data analysis and visualization practice. The project explores how factors such as age, family size, gender, and embarkation point influenced the likelihood of survival during the Titanic disaster. This data should not be considered authentic or representative of real Titanic passengers.

🚀 Project Overview This project analyzes a simulated Titanic dataset to uncover trends and insights related to passenger survival rates. By employing statistical methods and interactive visualizations, we investigate key factors affecting survival, including:

Age: Did younger passengers have higher survival rates? Family Size: Did passengers with larger families have better chances of survival? Titles (Mr., Mrs., Miss., etc.): How did social status or titles influence survival? Embarkation Point: Did boarding location (Cherbourg, Queenstown, Southampton) affect survival chances? Fare: Was there a relationship between fare amount and survival? Gender: Did women and children have higher survival rates? Correlations: What relationships existed between different variables? Missing Data: Where are the gaps in the dataset, and how do they impact analysis?

🔍 What’s Inside? The repository includes a Jupyter Notebook containing step-by-step Titanic data analysis and the following visualizations:

Survival by Age Groups

A bar graph displaying survival rates across age categories (children, young adults, adults, and elderly). Survival by Family Size

Examines how family size (siblings, spouses, parents, or children aboard) influenced survival chances. Survival by Title

Analyzes survival rates based on social titles (e.g., Mr., Mrs., Miss.), reflecting the influence of social status. Survival by Embarkation Point

Explores survival differences based on embarkation points (Cherbourg, Queenstown, Southampton). Survival by Fare Distribution

Shows how fare amounts correlate with survival, reflecting class-based evacuation priorities. Survival by Gender

Compares survival rates between men and women, highlighting the prioritization of women and children in lifeboats. Correlation Heatmap

Visualizes the relationships between variables like age, family size, fare, and survival. Missing Values Heatmap

Highlights the locations of missing data to guide preprocessing efforts.

🌟 Key Insights from the Visualizations Missing Data: Understanding missing values is crucial for effective preprocessing. Age: Children showed higher survival rates due to prioritization. Family Size: Survival chances may increase with larger family groups. Title: Titles like “Mrs.” and “Miss.” correlated with higher survival, reflecting societal norms. Embarkation Point: Cherbourg passengers had slightly higher survival rates. Fare: Higher fares were associated with better survival odds, indicating class advantages. Gender: Women had significantly higher survival rates than men, aligning with historical accounts. Correlations: Relationships between variables such as fare, age, and survival provide actionable insights.

🧩 Technologies Used Jupyter Notebook 📓 for interactive analysis. Python 🐍 for data processing and analysis. Pandas 🐼 for data manipulation. Matplotlib and Seaborn 🎨 for visualizations. Numpy 🔢 for numerical operations.

📁 Dataset The dataset (Dataset.csv) is simulated to represent Titanic passenger data. It includes variables like age, gender, fare, family size, embarkation point, and survival status.

link : https://1drv.ms/x/c/f6bc90f75ca4b431/ERObFs-fyLNDqImvsFcaJkoBzihPx_iflqyeljRB7WdIJA?e=AnES5T

🎥 Watch the Project in Action Explore the analysis through the interactive notebook: https://1drv.ms/v/c/f6bc90f75ca4b431/EZP_VUyBdx9JndEwPtN-n9IB-SilnUGpVqiQp83o50ru6A?e=ptWQfG

🚀 Conclusion This project provides insights into simulated Titanic survival factors, demonstrating the use of data storytelling to analyze historical scenarios. By visualizing key variables, the analysis highlights patterns and relationships that could have influenced survival during the Titanic disaster.

💬 Contribute & Feedback Feel free to ask questions, suggest improvements, or contribute to the project. Contributions are always welcome!

image image image image image image image image

About

Titanic Survival Data Analysis: Factors Influencing Survival

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published