Intermediate files produced are not removed and can cause the result of jobs between different runs to be incorrect Suggested solution: Use regex to remove the files in question after processing reduce tasks