Design and develop a Distributed Recommendation System on Hadoop
Problem statement:
Given 2 CSV data sets:
(a) A course dataset containing details of courses offered
(b) A job description dataset containing a list of job descriptions
(Note: Each field of a job description record is demarcated by ” “)
You have to design and implement a distributed recommendation system using the data sets, which will recommend the best courses for up-skilling based on a given job description. You can use the data set to train the system and pick some job descriptions not in the training set to test.
Sample Solution