which two resources should you expect to be bottlenecks?
Asked by: jellybean
You need to create a job that does frequency analysis on input data. You will do this by writing a
Mapper that uses TextInputFormat and splits each value (a line of text from an input file) into
individual characters. For each one of these characters, you will emit the character as a key and
an InputWritable as the value. As this will produce proportionally more intermediate data than input
data, which two resources should you expect to be bottlenecks?
Processor and network I/O
Disk I/O and network I/O
Processor and RAM
Processor and disk I/O