The Client: one of Europe's largest providers of mobile services, online TV, and mobile payment solutions.
Project implementation period: 6 months.
The client used the Apache Hive storage system based on the Hadoop platform (this is proprietary software) to work with corporate data.
The client contacted us with a widespread problem. The volume of the company's data was constantly growing. The need to store it carried with it an increase in the cost of using storage for the following reasons:
The goal of the project was to provide the opportunity to accumulate new data and store historical data without having to degrade storage performance and increase the cost of ownership.
The tasks that we have set to achieve the goal of the project:
At the project's start, the data volume was about 260 TB. In comparison, the estimated volume of information by the end of the first year was expected to be 338 TB, which would have required increased server capacities and the purchase of additional Hive licenses.
Our experts suggested that the client replace the current software solution with an alternative one — the Greenplum distributed database. With the help of open-source software development, the proposed solution eliminated several problems while providing additional benefits.
When approaching the final stage of the project, BlitzBrain data engineering specialists took on the responsibility of training the technical specialists of the client company to work with a new solution, before handing over the necessary documentation. This enabled the client to reduce the cost of the employee onboarding process and immerse current employees in the latest technology.
As a result of the project, the client was able to: