Distributed Storage System for Electric Power Data Based on HBase

Managing massive electric power data is a typical big data application because electric power systems generate millions or billions of status, debugging, and error records every single day. To guarantee the safety and sustainability of electric power systems, massive electric power data need to be p...

Full description

Saved in:
Bibliographic Details
Main Authors: Jiahui Jin, Aibo Song, Huan Gong, Yingying Xue, Mingyang Du, Fang Dong, Junzhou Luo
Format: Article
Language:English
Published: Tsinghua University Press 2018-12-01
Series:Big Data Mining and Analytics
Subjects:
Online Access:https://www.sciopen.com/article/10.26599/BDMA.2018.9020026
Tags: Add Tag
No Tags, Be the first to tag this record!
Description
Summary:Managing massive electric power data is a typical big data application because electric power systems generate millions or billions of status, debugging, and error records every single day. To guarantee the safety and sustainability of electric power systems, massive electric power data need to be processed and analyzed quickly to make real-time decisions. Traditional solutions typically use relational databases to manage electric power data. However, relational databases cannot efficiently process and analyze massive electric power data when the data size increases significantly. In this paper, we show how electric power data can be managed by using HBase, a distributed database maintained by Apache. Our system consists of clients, HBase database, status monitors, data migration modules, and data fragmentation modules. We evaluate the performance of our system through a series of experiments. We also show how HBase’s parameters can be tuned to improve the efficiency of our system.
ISSN:2096-0654