- Library Home /
- Search Collections /
- Open Collections /
- Browse Collections /
- UBC Theses and Dissertations /
- Vertex-and-edge ordering for faster parallel graph...
Open Collections
UBC Theses and Dissertations
UBC Theses and Dissertations
Vertex-and-edge ordering for faster parallel graph processing Trostanovsky, Alex
Abstract
Graph structured data, which models complex relationships, describes data in myriad domains, such as social network analysis, protein structure analysis, and supply chain management. As the size of graph data grows, researchers develop modern systems to handle massive datasets with trillions of entries. To accelerate processing, systems optimize the data layout of vertices and edges in memory or on disk, but, to date, researchers have treated the performance improvements due to vertex and edge orderings separately. This work investigates the interaction between vertex and edge orderings. We explore whether combining these orderings improves performance. An extensive performance study of different orderings finds that a specific combination, the SlashBurn vertex order and the Hilbert edge order, consistently provides the fastest runtimes for scale-free graphs. These results motivate our development of a parallelized SlashBurn and the Rhubarb edge ordering and blocking technique. Using 14 cores, Parallel SlashBurn is up to 11.96× faster than the sequential implementation. Rhubarb enables the scalable implementation of parallel edge-centric algorithms using the Hilbert curve and offers end-to-end performance speedup for the Collaborative Filtering application of up to more than 2.5× over modern parallel Graph Processing Systems.
Item Metadata
Title |
Vertex-and-edge ordering for faster parallel graph processing
|
Creator | |
Supervisor | |
Publisher |
University of British Columbia
|
Date Issued |
2023
|
Description |
Graph structured data, which models complex relationships, describes data in myriad domains, such as social network analysis, protein structure analysis, and supply chain management. As the size of graph data grows, researchers develop modern systems to handle massive datasets with trillions of entries. To accelerate processing, systems optimize the data layout of vertices and edges in memory or on disk, but, to date, researchers have treated the performance improvements due to vertex and edge orderings separately. This work investigates the interaction between vertex and edge
orderings. We explore whether combining these orderings improves performance. An extensive
performance study of different orderings finds that a specific combination, the SlashBurn vertex
order and the Hilbert edge order, consistently provides the fastest runtimes for scale-free graphs.
These results motivate our development of a parallelized SlashBurn and the Rhubarb edge ordering
and blocking technique. Using 14 cores, Parallel SlashBurn is up to 11.96× faster than the sequential
implementation. Rhubarb enables the scalable implementation of parallel edge-centric algorithms
using the Hilbert curve and offers end-to-end performance speedup for the Collaborative Filtering
application of up to more than 2.5× over modern parallel Graph Processing Systems.
|
Genre | |
Type | |
Language |
eng
|
Date Available |
2023-10-12
|
Provider |
Vancouver : University of British Columbia Library
|
Rights |
Attribution-NonCommercial-NoDerivatives 4.0 International
|
DOI |
10.14288/1.0437140
|
URI | |
Degree | |
Program | |
Affiliation | |
Degree Grantor |
University of British Columbia
|
Graduation Date |
2023-11
|
Campus | |
Scholarly Level |
Graduate
|
Rights URI | |
Aggregated Source Repository |
DSpace
|
Item Media
Item Citations and Data
Rights
Attribution-NonCommercial-NoDerivatives 4.0 International