Uber details Fiber, a framework for distributed AI model training

Venture Beat | Mar 26, 2020 at 11:35 PM
  • It leverages cluster management software for job scheduling and tracking, doesn’t require preallocating resources, and can dynamically scale up and down on the fly, allowing users to migrate from one machine to multiple machines seamlessly.
  • Fiber comprises an API layer, backend layer, and cluster layer.
  • As for the cluster layer, it taps different cluster managers to help manage resources and keep tabs on different jobs, reducing the number of items Fiber needs to track.