This project uses collaborative filtering to build a basic recommender system using movie rating data.
Two files: Download them here - https://www.kaggle.com/datasets/ashukr/movie-rating-data?select=ratings.csv
ratings.csv: User ratings for moviesmovies.csv: Metadata (title, genres)
- Merge ratings and metadata
- Compute average rating and number of ratings per movie
- Visualize distributions
- Create a utility matrix (userId vs movie titles)
- Use Pearson correlation to find movies similar to a target (e.g., xXx (2002))
- Histogram of number of ratings
- Histogram of average ratings
- Joint distribution of count vs average
- Heatmap of feature correlations
movie-recommender/
├── data/
│ ├── ratings.csv
│ └── movies.csv
├── notebooks/
│ └── movie_recommender.ipynb
├── README.md
├── requirements.txt
└── .gitignore
- Clone the repo
- Install dependenciesand run:
pip install -r requirements.txt #Run jupyter notebook notebooks/movie_recommender.ipynb