Friday, January 26, 2018

Limitations of Hadoop Version 1

  • Max cluster size ~5000 nodes
  • Max concurrent tasks ~40,000
  • Coarse Synchronization in JobTracker
  • Failure kills Queued and running JobTracker
Hard partition of resources into map and reduce slots
  • Non-optimal Resource Utilization
Lacks support for Alternate Paradigms and Services
  • Iterative applications in MapReduce are 10x slower

