Big Data Essentials - Week 3 (Course 1) #2

saint1729 · 2019-02-03T20:22:42Z

Hi,

In "mapper_rating.py", why did you print as

'print "%d\t%s" % (count, word)'?

Is it necessary that the element to be sorted to be present at the beginning.

I tried it the following way,

'print "%s\t%d" % (word, count)'

and changed the "line 6" from "reducer_rating.py" to

"word, count = line.strip().split('\t', 1)"

and changed the yarn command option to,

'-D mapreduce.partition.keycomparator.options=-k2,2nr'

It it not showing the right answer. Can you please tell me what is wrong in the options I set?

Provide feedback