4.5 Article

DMFSGD: A Decentralized Matrix Factorization Algorithm for Network Distance Prediction

Journal

IEEE-ACM TRANSACTIONS ON NETWORKING
Volume 21, Issue 5, Pages 1511-1524

Publisher

IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC
DOI: 10.1109/TNET.2012.2228881

Keywords

Matrix completion; matrix factorization; network distance prediction; stochastic gradient descent

Funding

  1. EU under Project FP7-Fire ECODE
  2. European Network of Excellence PASCAL2
  3. Belgian network DYSCO (Dynamical Systems, Control, and Optimization)
  4. Interuniversity Attraction Poles Programme
  5. Belgian State, Science Policy Office

Ask authors/readers for more resources

The knowledge of end-to-end network distances is essential to many Internet applications. As active probing of all pairwise distances is infeasible in large-scale networks, a natural idea is to measure a few pairs and to predict the other ones without actually measuring them. This paper formulates the prediction problem as matrix completion where the unknown entries in a pairwise distance matrix constructed from a network are to be predicted. By assuming that the distance matrix has low-rank characteristics, the problem is solvable by low-rank approximation based on matrix factorization. The new formulation circumvents the well-known drawbacks of existing approaches based on Euclidean embedding. A new algorithm, so-called Decentralized Matrix Factorization by Stochastic Gradient Descent (DMFSGD), is proposed. By letting network nodes exchange messages with each other, the algorithm is fully decentralized and only requires each node to collect and to process local measurements, with neither explicit matrix constructions nor special nodes such as landmarks and central servers. In addition, we compared comprehensively matrix factorization and Euclidean embedding to demonstrate the suitability of the former on network distance prediction. We further studied the incorporation of a robust loss function and of nonnegativity constraints. Extensive experiments on various publicly available datasets of network delays show not only the scalability and the accuracy of our approach, but also its usability in real Internet applications.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.5
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available