Gbase

Kang, U.; Tong, Hanghang; Sun, Jimeng; Lin, Ching-Yung; Faloutsos, Christos

Published in

Proceedings of the 17th ACM SIGKDD international conference on Knowledge discovery and data mining - KDD '11

DOI: 10.1145/2020408.2020580

Tools

Export citation

Search in Google Scholar

Gbase

Proceedings article published in 2011 by U. Kang, Hanghang Tong

, Jimeng Sun, Ching-Yung Lin, Christos Faloutsos

This paper was not found in any repository; the policy of its publisher is unknown or unclear.

Full text: Unavailable

Preprint: archiving allowed

Upload

Postprint: archiving allowed

Upload

Published version: archiving forbidden

Policy details

Data provided by

Abstract

Graphs appear in numerous applications including cyber-security, the Internet, social networks, protein networks, recommendation systems, and many more. Graphs with millions or even billions of nodes and edges are common-place. How to store such large graphs efficiently? What are the core operations/queries on those graph? How to answer the graph queries quickly? We propose GBASE, a scalable and general graph management and mining system. The key novelties lie in 1) our storage and compression scheme for a parallel setting and 2) the carefully chosen graph operations and their efficient implementation. We designed and implemented an instance of GBASE using MapReduce/Hadoop. GBASE provides a parallel indexing mechanism for graph mining operations that both saves storage space, as well as accelerates queries. We ran numerous experiments on real graphs, spanning billions of nodes and edges, and we show that our proposed GBASE is indeed fast, scalable and nimble, with significant savings in space and time.

Published in

Links

Tools

Gbase

Abstract