Netflix Watch History Sync

Netflix Watch History Sync is an automated script designed to download and archive Netflix viewing history for multiple profiles and import the data into a MariaDB database daily. It runs in a headless Linux environment using Selenium for browser automation and is optimized for use in cron jobs. Error handling, duplicate checking, and file archiving make it reliable for long-term use.

Features

Automated Netflix Login: Logs in and downloads watch history without manual interaction.
Multiple Profiles: Supports handling multiple Netflix profiles in one run.
MariaDB Integration: Imports viewing history into a MariaDB database with duplicate checking.
File Archiving: Moves processed CSV files to an archive folder.
Headless Execution: Runs on a Linux server without a GUI.
Cron Job Friendly: Suitable for scheduled daily runs to keep the database up-to-date.

Requirements

Python 3.8+
Chromium and ChromiumDriver for headless browsing
MariaDB server
Python Packages: Selenium, pandas, mysql-connector-python
Linux Server (tested on Ubuntu/Debian)

Installation

Clone the Repository:

bash

git clone https://github.com/IVRYSimon/netflix-watch-history-sync.git
cd netflix-watch-history-sync

Install Required Packages: Install Python dependencies:

bash
```
pip install -r requirements.txt
```

Install Chromium and ChromiumDriver:

bash

sudo apt update
sudo apt install -y chromium-browser chromium-chromedriver`

Set Up MariaDB Database:

sql

CREATE DATABASE netflix_viewed;
CREATE TABLE netflix_watchlist (
    id INT AUTO_INCREMENT PRIMARY KEY,
    title VARCHAR(255),
    date_watched DATE,
    profile_name VARCHAR(100),
    UNIQUE(title, date_watched, profile_name)
);

With the UNIQUE function, we assure that entries are unique identified by title, date_watched and profile_name

Configuration

Edit the script to update the Netflix credentials, profile names, and database configuration:

Netflix Account Details:

python

NETFLIX_EMAIL = "[email protected]"
NETFLIX_PASSWORD = "your-password"
PROFILES = ["Profile1", "Profile2", "Profile3"]`

MariaDB Configuration:

python

conn = mariadb.connect(
    user="your-db-user",
    password="your-db-password",
    host="localhost",
    database="netflix_viewed"
)

Download Path: Set the directory for temporary CSV files and archived files.

python
```
DOWNLOAD_PATH = "/path/to/download"
```

Usage

Run the script manually to test the setup:

bash

python3 import_netflix_viewed.py

Setting up a Cron Job

To automate daily data syncing, set up a cron job. Open the crontab editor:

bash

crontab -e

Add a line for daily execution at 2:00 AM:

cron

0 2 * * * /usr/bin/python3 /path/to/netflix-watch-history-sync/import_netflix_viewed.py >> /path/to/log/netflix_sync.log 2>&1`

Troubleshooting

Common Issues and Fixes

Issue: "Profile 'ProfileName' could not be found."

Cause: The profile name in the script doesn’t match the actual profile on Netflix.
Solution: Double-check the profile names in the script to ensure they match the exact names on Netflix.

Issue: "session not created: Chrome not reachable"

Cause: Chromium or Chromedriver versions may be incompatible.
Solution:
- Ensure both Chromium and Chromedriver are installed with compatible versions.
- Check versions:
  
  bash
```
chromium --version
chromedriver --version` 
```

Issue: "DevToolsActivePort file doesn't exist"

Cause: Chrome sometimes fails to start in headless mode.
Solution:
- Add --no-sandbox and --disable-dev-shm-usage to Chrome options.
- Ensure /tmp directory has sufficient space:
  
  bash
```
df -h /tmp
```

Issue: "No such file or directory: NetflixViewingHistory.csv"

Cause: The CSV file may not have downloaded.
Solution:
- Ensure the "Viewing Activity" page is loading correctly in Selenium.
- Increase time.sleep() delays if needed to allow time for page loads.

Issue: "IntegrityError: Duplicate entry"

Cause: Attempted to insert an existing entry.
Solution: The script already includes duplicate checking, so this message can safely be ignored. To suppress the error, ensure you’re handling IntegrityError in the import function.

License

This project is licensed under the MIT License - see the LICENSE file for details.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Netflix Watch History Sync

Features

Table of Contents

Requirements

Installation

Configuration

Usage

Setting up a Cron Job

Troubleshooting

Common Issues and Fixes

Issue: "Profile 'ProfileName' could not be found."

Issue: "session not created: Chrome not reachable"

Issue: "DevToolsActivePort file doesn't exist"

Issue: "No such file or directory: NetflixViewingHistory.csv"

Issue: "IntegrityError: Duplicate entry"

License

About

Releases

Packages

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
README.md		README.md
import_netflix_viewed.py		import_netflix_viewed.py
requirements.txt		requirements.txt

IVRYSimon/netflix-watch-history-sync

Folders and files

Latest commit

History

Repository files navigation

Netflix Watch History Sync

Features

Table of Contents

Requirements

Installation

Configuration

Usage

Setting up a Cron Job

Troubleshooting

Common Issues and Fixes

Issue: "Profile 'ProfileName' could not be found."

Issue: "session not created: Chrome not reachable"

Issue: "DevToolsActivePort file doesn't exist"

Issue: "No such file or directory: NetflixViewingHistory.csv"

Issue: "IntegrityError: Duplicate entry"

License

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages