This project involves analyzing a dataset of Nobel Prize winners to gain insights into their demographics, distribution, and trends over time. The dataset contains information about Nobel Prize laureates from 1901 to 2016, including details like birth country, gender, category, and more. The goal of this analysis is to uncover patterns, trends, and significant observations related to Nobel Prize recipients.
The Nobel Prize is a prestigious international award presented annually to individuals or organizations that have made significant contributions to various fields, including Physics, Chemistry, Medicine, Literature, Peace, and Economics. This project focuses on exploring the Nobel Prize dataset to uncover insights about laureates, including gender distribution, country representation, categories dominance, and historical trends.
The dataset used for this analysis contains information about Nobel Prize laureates and includes various attributes such as:
- Name and surname of the laureate
- Birth and death dates
- Birth and death places (countries)
- Nobel Prize category and year
- Summary of the laureate's contribution
- Gender and organization (if applicable)
The dataset spans from 1901 to 2016 and provides a comprehensive collection of Nobel Prize laureate information for in-depth analysis.
The project follows a structured organization:
data/
: Contains the dataset filesnotebooks/
: Jupyter notebooks for data cleaning, exploration, visialization and analysisREADME.md
: Documentation about the project (you are here)
The data analysis process involves several steps:
- Data cleaning: Handling missing values, correcting inconsistencies
- Exploratory data analysis: Visualizing demographics, categories, trends
- Statistical analysis: Extracting insights from the data
After analyzing the dataset, several key findings were discovered:
- Gender distribution among laureates
- Most awarded countries
- Dominant Nobel Prize categories
- Historical trends and changes
The analysis of the Nobel Prize dataset provides valuable insights into the demographics, distribution, and trends of laureates. This project highlights the importance of recognizing contributions across various fields and regions.