Movie Recommender

  • Tech Stack: Spark, SQL, Pandas, AWS
  • Website URL: Link

Movie Recommendation Engine Development in Apache Spark

  • Built data pipeline based on Spark RDD to analyze movie rating datasets with Spark SQL
  • Implemented the Alternative Least Square model to provide personalized movie recommendations with Root Mean Square Error of 0.69 and adapted user-based approaches to handle system cold-start problems
  • Conducted model hyperparameters tuning with Spark ML cross-evaluation toolbox and monitored data processing performance via Spark UI on AWS