Optimizing Resource Management in Cloud Analytics Services

Read PDF →

, 2018

Category: Distributed Systems

Overall Rating

2.4/5 (17/35 pts)

Score Breakdown

  • Latent Novelty Potential: 5/10
  • Cross Disciplinary Applicability: 7/10
  • Technical Timeliness: 2/10
  • Obscurity Advantage: 3/5

Synthesized Summary

  • This paper offers potentially actionable insights through its conceptual frameworks, particularly the use of market mechanisms... and "transformed costs" for optimization decomposition.

  • These frameworks provide alternative, interdisciplinary approaches to resource management problems... where existing solutions might not fully account for incentives or structural complexity.

  • The core problems remain relevant, and the abstract frameworks... offer interesting interdisciplinary perspectives.

  • However, the specific technical solutions and models are largely tied to outdated architectural assumptions and simplifying models.

Optimist's View

  • The "learning-to-switch" framework, though demonstrated on GS/RAS, could be generalized to learn switching between any set of competing resource allocation strategies in dynamic environments.

  • The concept of explicitly quantifying the effective job size (or resource requirement) by incorporating the cost and benefit of speculation... into a "virtual job size" metric is potentially novel.

  • The idea of defining "transformed costs"... to enable separating an upstream decision... from a downstream decision... is potentially highly novel and applicable to other multi-stage optimization problems where decisions interact in complex ways.

  • The central idea is the application of supply function bidding (SFB), a mechanism from electricity markets, to an internal resource coordination problem (tenant power reduction) within a data center...

Skeptic's View

  • The thesis is fundamentally rooted in the architectural paradigms dominant around 2018: Hadoop/Spark running on EC2-like VM clusters...

  • This thesis likely faded because its specific technical contributions... addressed problems within system contexts that were rapidly evolving.

  • The models rely on specific distributions (Pareto, Zipf) which are approximations; real-world behavior is often more complex and dynamic.

  • Mainstream cloud infrastructure and orchestration layers (Kubernetes) have absorbed many of these challenges into their core design.

Final Takeaway / Relevance

Watch