Key points are not available for this paper at this time.
The Web abounds with dyadic data that keeps increasing by every single second. Previous work has repeatedly shown the usefulness of extracting the interaction structure inside dyadic data 21, 9, 8. A commonly used tool in extracting the underlying structure is the matrix factorization, whose fame was further boosted in the Netflix challenge 26. When we were trying to replicate the same success on real-world Web dyadic data, we were seriously challenged by the scalability of available tools. We therefore in this paper report our efforts on scaling up the nonnegative matrix factorization (NMF) technique. We show that by carefully partitioning the data and arranging the computations to maximize data locality and parallelism, factorizing a tens of millions by hundreds of millions matrix with billions of nonzero cells can be accomplished within tens of hours. This result effectively assures practitioners of the scalability of NMF on Web-scale dyadic data.
Building similarity graph...
Analyzing shared references across papers
Loading...
Chao Liu
State Grid Corporation of China (China)
Hung‐Chih Yang
National Taiwan University
Jinliang Fan
Tianjin Normal University
Microsoft (United States)
Building similarity graph...
Analyzing shared references across papers
Loading...
Liu et al. (Mon,) studied this question.
synapsesocial.com/papers/6a0dbd1c389a567298baa7a3 — DOI: https://doi.org/10.1145/1772690.1772760
Synapse has enriched 5 closely related papers on similar clinical questions. Consider them for comparative context: