7 November 2022 | Noor Khan
Headquartered in California, our client are a well-established Fortune 500 company worth over a few billion as of October 2022. They deal with a large scale of various broadcasting data including audience and commercial data. We have worked with them on a number of projects to help unlock the potential of data by continuously improving and optimising data performance.
Our client deal with huge volumes of data and were having delays in their reporting. They were running around 80 reports and each report took around 4 to 5 minutes to be produced. The data reports delay of each report adds a considerable amount of time to the reporting time of the full 80 reports. Therefore, our client were looking for an alternative solution to significantly improve their data reporting turnaround.
Our clients existing data structure was built on Amazon Redshift, which is a powerful technology, however, it presented data delay challenges for our clients' data. Therefore, we recommend Databricks as the alternative to processing data quickly and efficiently. Databricks offered scalable, efficient and quicker processing of data with the use of independent clusters which can run parallel.
Databricks clusters
Our highly experienced data engineers created three clusters on Databricks including Cluster A for storing all data, Cluster B to set up ETL, and Cluster C for any issues and delays, which could then be moved to a new cluster to enable parallel processing. One of the biggest benefits on offer with data bricks is the ability to create as many clusters as required to process data in parallel. This enabled much more efficient and quick processing of data improving data reporting speed by 80%.
Find out more about Databricks partnership.
215 million rows of data processed hourly
Cluster B where the ETL process is running had 16 nodes and a huge amount of data is being processed. Approximately 9 million rows of viewing data and 215 million rows of commercial data are processed on an hourly basis and around the clock, every day.
Errors and optimisation
As the streams of data are constantly flowing, our engineers provide operational monitoring and support to spot errors, resolve issues and continuously make recommendations to improve and optimise data performance. PagerDuty is employed for error alerts, which are then resolved by the Ardent data engineers.
Overall, our clients can significantly reduce the data processing and reporting time with the adoption of Databricks. This offers them many benefits from improving productivity to a better data turnaround time for end clients. They have peace of mind with the operational and monitoring support as any errors and issues that may arise will be resolved quickly and efficiently. Additionally, both the Ardent team for this project and the client's data science team have regular meetings to discuss progress and optimisation suggestions.
Explore Ardent data engineering services.
Accelerating market research by automating data collection with OCR technology. [...]
Read More... from Fortune 500 company entertaining audiences for over two decades
A market leader, internationally renowned media and broadcasting company Founded in 2002, our client has been around for over two decades and is an internationally known company dealing with broadcasting data for commercial use. With a mission of making high-quality technology and content affordable for everyone, they have established themselves as a market leader. [...]
Read More... from Fortune 500 company entertaining audiences for over two decades
Leader logistics software provider Our client is a leading logistics software provider in the UK. With over 3 decades of experience in the industry, they continuously look to innovate with technology. Their range of software products includes a warehouse management system and removal management software. They aim to remove the complexity of software and bring [...]
Read More... from Fortune 500 company entertaining audiences for over two decades
Businesses face significant challenges to continuously manage and optimise their databases, extract valuable information from them, and then to share and report the insights gained from ongoing analysis of the data. As data continues to grow exponentially, they must address key issues to unlock the full potential of their data asset across the whole business. [...]
Read More... from Fortune 500 company entertaining audiences for over two decades
How Ardent can help you prepare your data for AI success Data is at the core of any business striving to adopt AI. It has become the lifeblood of enterprises, powering insights and innovations that drive better decision making and competitive advantages. As the amount of data generated proliferates across many sectors, the allure of [...]
Read More... from Fortune 500 company entertaining audiences for over two decades
Overcoming Market Research Challenges For Market Research agencies, Organisations and Brands exploring insights across markets and customers, the traditional research model of bidding for a blend of large-scale qualitative and quantitative data collection processes is losing appeal to a more value-driven, granular, real-time targeted approach to understanding consumer behaviour, more regular insights engagement and more [...]
Read More... from Fortune 500 company entertaining audiences for over two decades