This is like Unix tools but distributed across thousands of machines. Instead of the stdin and stdout like Unix, map reduce jobs read and write files on a distributed filesystem like the Hadoop File System.