Header menu link for other important links
X
Gluon-async: A bulk-asynchronous system for distributed and heterogeneous graph analytics
Published in Institute of Electrical and Electronics Engineers Inc.
2019
Volume: 2019-September
   
Pages: 15 - 28
Abstract
Distributed graph analytics systems for CPUs, like D-Galois and Gemini, and for GPUs, like D-IrGL and Lux, use a bulk-synchronous parallel (BSP) programming and execution model. BSP permits bulk-communication and uses large messages which are supported efficiently by current message transport layers, but bulk-synchronization can exacerbate the performance impact of load imbalance because a round cannot be completed until every host has completed that round. Asynchronous distributed graph analytics systems circumvent this problem by permitting hosts to make progress at their own pace, but existing systems either use global locks and send small messages or send large messages but do not support general partitioning policies such as vertex-cuts. Consequently, they perform substantially worse than bulk-synchronous systems. Moreover, none of their programming or execution models can be easily adapted for heterogeneous devices like GPUs. In this paper, we design and implement a lock-free, non-blocking, bulk-Asynchronous runtime called Gluon-Async for distributed and heterogeneous graph analytics. The runtime supports any partitioning policy and uses bulk-communication. We present the bulk-Asynchronous parallel (BASP) model which allows the programmer to utilize the runtime by specifying only the abstract communication required. Applications written in this model are compared with the BSP programs written using (1) D-Galois and D-IrGL, the state-of-The-Art distributed graph analytics systems (which are bulk-synchronous) for CPUs and GPUs, respectively, and (2) Lux, another (bulk-synchronous) distributed GPU graph analytical system. Our evaluation shows that programs written using BASP-style execution are on average ~1.5x faster than those in D-Galois and D-IrGL on real-world large-diameter graphs at scale. They are also on average ~12x faster than Lux. To the best of our knowledge, Gluon-Async is the first asynchronous distributed GPU graph analytics system. © 2019 IEEE.
About the journal
JournalData powered by TypesetParallel Architectures and Compilation Techniques - Conference Proceedings, PACT
PublisherData powered by TypesetInstitute of Electrical and Electronics Engineers Inc.
ISSN1089795X
Open AccessNo
Concepts (14)
  •  related image
    Application programs
  •  related image
    Iridium compounds
  •  related image
    Locks (fasteners)
  •  related image
    Lutetium compounds
  •  related image
    Program processors
  •  related image
    ASYNCHRONOUS PARALLEL
  •  related image
    BSP MODEL
  •  related image
    BULK SYNCHRONOUS PARALLEL
  •  related image
    DESIGN AND IMPLEMENTS
  •  related image
    DISTRIBUTED AND HETEROGENEOUS
  •  related image
    GRAPH ANALYTICS
  •  related image
    Heterogeneous devices
  •  related image
    HETEROGENEOUS GRAPH
  •  related image
    Parallel architectures