Big Data

15:56, 05/05/2020

Big Data

VNPT Big Data Platform is a centralized platform for computing data at a large scale, providing services to a wide range of clients from governmental authorities to SMEs. Based on distributed computing and storing data methods, VNPT Big Data Platform can easily cope with the processing of grand-scaled data sources, which catalyzes the introduction of new services as well as optimization of existing ones.

VNPT Big Data Platform can also improve the administrative and operational capability of the Government and save costs; while helping other companies to optimize their technical network systems and enabling individual customers to experience smart services.

 

Since brought into operation in 2017, the VNPT Big Data Platform has constantly been expanded and improved. There has been continually ongoing research on new data storing technology, databases, machine learning algorithms as well as neural networks in search of the most optimal tools for different result analysis. Since 2018, VNPT Platform has been assigned to be the main platform for implementing the Data Lake project, which is a centralized storage for the entire data of VNPT Corporation.

The VNPT Big Data platform consisted of the following components:

  • Data source: includes the entire internal data of VNPT (telecommunication, information technology, logs) as well as external data such as social medias.
  • Distributed data storage facility: has the storage capacity of Petabytes (PB). There are several parallel processing units with computing capability of TBs per second to cope with real-time throughput which can amount to GBs per second as well as multi-dimension operators for the real-time computation of multi-dimension KPIs of Intelligence systems.
  • Visualization: uses the best platforms and libraries to demonstrate from basic simple diagrams to graphical representation of multi-dimensional relations, which helps analysts and decision-makers have a better understanding of the subject in question.
  • AI & ML: it is necessary to mention the involvement of AI in order to produce desired results from TBs of data. The Big Data platform allows for flexible usage of different algorithms to build different models based on the problems which cannot be solved by human.