Case Studies

Automating ETL and Improving Customer Experience

Automobile Buying & Selling

Client is building a modern-day marketplace to make car buying and selling more easy, convenient and transparent, and bringing more of the transaction online. They are designing a shopping experience customized to individual preferences. There is no “one-size-fits-all” solution when it comes to purchasing a car.
  • Category: Spark Streaming Analytics
  • Date: 2023
  • Location: USA

A Global Automotive Company Automating ETL and Improving Customer Personalization Using Amazon Kinesis Data Streams

 

Challenge

Managing High Data Latency and Lack of Control Over Web Traffic Analytics: The client faced issues with its third-party analytics provider, which included high data latency, limited control over web traffic, and difficulty in adding or removing data fields. These limitations hindered the client’s ability to gain quick insights, detect anomalies, and improve performance in their CI/CD environments. Additionally, its largest dataset originated outside of its AWS infrastructure, leading to inefficiencies in processing and delivering near-real-time analytics.

Solution

Solution | Improving Time to Insight for Clickstream Analytics by 48 Times Using Amazon Kinesis Data Streams

The company implemented a solution that uses Amazon Kinesis Data Streams and Spark Streaming to automate the ETL process, enabling near-real-time data ingestion and analysis. Data is processed every 10 seconds, significantly reducing the latency from hours to minutes. This resulted in a 48-time improvement in time to insight for clickstream data, reducing query time from 4 hours to 5 minutes.

This solution supports continuous integration and delivery (CI/CD) and A/B testing, allowing near-instantaneous detection of issues and faster iterations of product features. With real-time web insights and JSON support, the company can quickly identify and resolve anomalies, optimize customer experiences, and personalize user interactions offline by analyzing and leveraging behavioural data almost instantly.

Benefits

  • Near-Real-Time Data Insights: The new solution enables quick access to data insights, allowing for fast reactions and decision-making.
  • Reduced Latency: Data is processed with minimal latency, providing real-time analytics that significantly improves operational efficiency.
  • Improved Personalization: The company can now personalize customer interactions within seconds based on near-real-time analytics of user sessions.
  • Enhanced CI/CD Support: Continuous integration and delivery processes are supported with rapid detection of issues, leading to quicker resolutions and iterations.
  • Frictionless Data Flow: Data is delivered seamlessly to the relevant teams, improving workflow and reducing inefficiencies.
  • Scalable Architecture: The streaming data architecture allows for ongoing innovation and scalability, supporting future personalization and analytics goals.

“Using Amazon Kinesis Data Streams provides data to the appropriate teams in a consumable manner and reduces all friction points.”

Distinguished Engineer

Tools Used for Building the Solution on AWS

  • Amazon Kinesis Data Streams
  • Amazon Serverless Computing
  • Amazon Simple Storage Service (S3)