Recommender Systems for Steam Video Games - ALS and NCF

Overview

This project aims to build a recommender system for Steam video games using two distinct approaches: Alternating Least Squares (ALS) and Neural Collaborative Filtering (NCF). The dataset consists of user interactions with Steam games, providing information such as:

user_id: Unique identifier for users.
name: Title of the game.
hours: Time spent playing.
action: Whether the game was purchased or played.
zero: Unknown variable.

You can access the dataset here: Steam Video Games Data

Although the dataset contains 200,000 records, which could be handled using pandas, I opted to use PySpark for data preprocessing to gain experience with distributed data processing tools. This choice helps develop proficiency in handling larger datasets efficiently and prepares for future projects where scalability might be critical.

Since the dataset lacks explicit feedback (like ratings), we utilize implicit feedback by leveraging the hours feature. The assumption is that the more time a user spends on a game, the stronger their preference for that game, making hours a proxy for preference.

Models:

Alternating Least Squares (ALS): A collaborative filtering model implemented with PySpark, which is well-suited for implicit feedback data and scales efficiently with larger datasets. ALS leverages matrix factorization to learn latent factors for users and games.
Neural Collaborative Filtering (NCF): A deep learning-based recommendation approach. It models user-game interactions using neural networks, specifically designed to capture non-linear user-item relationships, which may not be captured by traditional matrix factorization techniques like ALS.

Tech Stack:

PySpark: For data preprocessing and implementing the ALS model.
TensorFlow: For building and training the NCF model.
Jupyter Notebook: For development and experimentation.
NumPy & pandas: For data manipulation.
Scikit-Learn: Used for evaluation metric and train-test splitting for the NCF model.

Result

The NCF model outperformed the ALS model, showing a lower RMSE.

License

This project is licensed under the MIT License.

Name		Name	Last commit message	Last commit date
Latest commit History 9 Commits
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
recommendation_steam.ipynb		recommendation_steam.ipynb
steam-200k.csv		steam-200k.csv

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Recommender Systems for Steam Video Games - ALS and NCF

Overview

Models:

Tech Stack:

Result

License

About

Releases

Packages

Languages

License

asenacak/recommenderSystems-SteamVideoGames

Folders and files

Latest commit

History

Repository files navigation

Recommender Systems for Steam Video Games - ALS and NCF

Overview

Models:

Tech Stack:

Result

License

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages