massive dataset parallel processing 103704