🎯 Project Objective:
Welcome to the "Exploring Texas Real Estate Sales Data" project! This repository is dedicated to providing a detailed analysis of real estate sales data in Texas using descriptive statistics and powerful data visualization techniques in R.
🔍 Dataset Acquisition and Preparation:
- Imported the "realestate_texas.csv" dataset which encapsulates extensive information on property sales across various cities in Texas. The dataset includes key variables such as city, year, month, sales volume, median price, listings, and months of inventory.
📊 Data Exploration:
- Conducted thorough exploratory data analysis to understand the dataset's structure and main characteristics. This includes identifying the types of variables, calculating measures of central tendency and dispersion, and examining the distribution of key variables.
💡 Variability and Asymmetry Analysis:
- Investigated which variable exhibits the highest variability and which shows the most asymmetry, offering insights into the dynamics and spread of the data.
🎨 Advanced Data Visualization:
- Created a series of compelling visualizations using R's ggplot2 package. Visuals such as box plots to compare median prices among cities, and bar charts to track sales trends over months and years, help illustrate the data's story vividly.
- Enhanced visualization aesthetics by customizing themes, colors, labels, axes, and legends to make the graphs not only informative but also visually appealing.
📈 Statistical Analysis and Gini Index Calculation:
- Utilized the dplyr package to perform statistical summaries, computing key metrics like mean, standard deviation, and variance, organized by city, year, and month.
- Calculated the Gini index for selected variables to assess inequality and concentration within the data.
This project navigates through the intricacies of descriptive statistics and visual storytelling to shed light on the Texas real estate market trends. It serves as an invaluable resource for anyone interested in data science, statistics, or real estate market dynamics.
🔗 GitHub Repository: Dive into the codebase to discover how detailed statistical analysis paired with strategic data visualization can uncover fascinating insights into real estate data. This repository offers a comprehensive toolkit for anyone looking to understand or present real estate trends through data.
Feel free to explore, adapt, and enhance the analyses and visualizations presented here to fit your personal or professional needs. Happy exploring!