Setting up Data Infrastructure to Optimize Marketing Efforts: Simple App Case


The Simple team invested in marketing initiatives in the USA geo heavily but couldn’t efficiently assess the ROI and effectiveness of each marketing channel. To overcome the challenge, they decided to become data mature and analyze marketing data. Since the company lacked an in-house data analytics team, they turned to a reliable partner, Valiotti Analytics, who set up data infrastructure and relevant reports to monitor marketing metrics.


  1. Optimized marketing strategy for lead generation through extensive reports on marketing attribution and channel efficiency in Redash

Thanks to the set-up data infrastructure, the Client managed to evaluate marketing attribution and the efficiency of different channels, which allowed them to better allocate their marketing budget and, therefore, optimize costs.

The reports are created in Redash and regularly updated, allowing the marketing team to access recent insights into marketing performance. To achieve this, the Valiotti Analytics team built the analytical infrastructure from scratch:

Analytical infrastructure from scratch

To provide a comprehensive overview of marketing efforts, there were several data sources:

  1. Facebook API data was automatically sourced from Facebook with a Python script every 30 minutes. It was transferred to the Clickhouse database.
  2. Product Event and Billing data was generated and sent by the app through SDK and a third-party service, RevenueCat, accordingly. Some RevenueCat events were stored in an S3 bucket and then sent to ClickHouse with a bash script.

Apache Kafka is an intermediary between a data source and a Clickhouse database. The solution allows numerous streams of data to be collected and stored for further extraction, in case they are lost or unavailable in Clickhouse. The collected data is stored according to various partitions to ensure easy access and navigation. The Kafka cluster is located on AWS Cloud, where a set of VPS (virtual private servers) is formed, which forms a cluster from 3 remote nodes. If one node or two nodes fail, the collection still remains, which ensures the stability of streamlined data collection. In addition, ClickHouse is connected to the Kafka cluster.

Clickhouse, as a data management system, regularly takes the collected data from Kafka and stores it. It also forms the data into tables, which are constantly monitored by a specific engine that removes duplicates. Clickhouse facilitates the completion of requests, allowing for seamless report building in Redash, as it sorts data properly, assigns it to the relevant partitions, and stores it on the hard drive.

Having the infrastructure set up, we thought through the visualization to find correlations between the allocated marketing budget across channels and received revenue. The up-to-date reports are delivered in Redash.


  • If you have a large marketing budget, invest in data maturity from the beginning. This will help you optimize your initiatives and spending.
  • Collect all the possible data on conversions to have a broader picture of your marketing efforts. This will help you to analyze the investments more accurately and, hence, optimize them.
  • Automate reporting to avoid manual mistakes and improve the accuracy of your reports. Otherwise, you risk coming up with misleading insights and, as a result, losing money.

Learn How Data Insights Can Benefit Your Business

Wondering what value data insights can bring your business? Get in touch, and we'll answer your questions!

Contact Us

Other Case Studies

  • Buff

    Regained focus on business KPIs with dashboards transforming data into revenue.

    Read more
  • Scalista

    Refined an ETL Project for Flawless Performance and Seamless Data Workflows

    Read more
  • betPawa

    A Flexible and Scalable DWH system Re-Built from Scratch with Improved Data Processing Time and Quality

    Read more