Skip to main content

Storing Request Statistics Using Redis

If we build a backend service, we probably need to keep any statistics related to our service such as number of requests, average time for processing, number of errors, etc. We may also need to have statistic records in several time precisions for example hourly and daily. The simplest solution, we can just store any information into a table in our database, then calculate the summary when needed or by a request. But, it will cost our system storage and computing resources.

Unlike relational databases or NoSQL that only have CRUD operations in general, Redis operations are related to the type of data that is stored. For example, a list will have push or pop operations, a sorted set will have a ranking, incrementing, or union operation, and so on. For instance for storing request statistics in Redis, we will store statistics of the response time of a backend application. The metrics are minimum time, maximum time, number of responses, and total response time. The last two metrics can be used for calculating the average time to respond. We will also store the statistics for hourly and daily information.

We need several records in sorted sets to store the following information.

  • List of starting times of all periods
  • Metrics of each period
  • Temporary data for aggregating minimum and maximum values

A list of starting times is used to help us to list all metrics records that we have. The record of metrics of each period will be named with starting time of the period.

Our backend application will call a function every time it handles a request with a response time value in milliseconds as the first parameter of the function.

All metrics in a period will be aggregated in a single record directly. Every time the function is called, it will calculate the period based on the calling time and the precision value.

The processes are as follows.

  1. Calculate starting time of current period
  2. Store the response time value in temporary records as both minimum and maximum values of response time
  3. Aggregate data in temporary records with matrics record of the current period
  4. Increment number of responses and total response time in the matrics record of the current period

For this example, we utilize Node.js with the redis module. We also need uuid for generating a random key for temporary records. The implementation is as follows.

function captureStats(responseTime) {
  const now = Date.now();
  const hourly = 3600 * 1000;
  const daily = 24 * 3600 * 1000;
  
  const tempKeyMin = `stats:temp:${uuidv1()}`;
  const tempKeyMax = `stats:temp:${uuidv1()}`;

  // get starting time of the period
  const stimeHourly = Math.floor(now / hourly) * hourly;
  const stimeDaily = Math.floor(now / daily) * daily;
  
  // to store starting time of each period
  const timesHourlyKey = `stats:times:${hourly}`;
  const timesDailyKey = `stats:times:${daily}`;
  
  // to store metrics of each period
  const statsHourlyKey = `stats:${hourly}:${stimeHourly}`; 
  const statsDailyKey = `stats:${daily}:${stimeDaily}`; 

  redisClient.multi()
    .zadd(timesHourlyKey, now, stimeHourly)
    .zadd(timesDailyKey, now, stimeDaily)
    .zadd(tempKeyMin, responseTime, 'min')
    .zadd(tempKeyMax, responseTime, 'max')
    .zunionstore(statsHourlyKey, 2, statsHourlyKey, tempKeyMin, 'aggregate', 'min')
    .zunionstore(statsHourlyKey, 2, statsHourlyKey, tempKeyMax, 'aggregate', 'max')
    .zunionstore(statsDailyKey, 2, statsDailyKey, tempKeyMin, 'aggregate', 'min')
    .zunionstore(statsDailyKey, 2, statsDailyKey, tempKeyMax, 'aggregate', 'max')
    .zincrby(statsHourlyKey, 1, 'count')
    .zincrby(statsDailyKey, 1, 'count')
    .zincrby(statsHourlyKey, responseTime, 'sum')
    .zincrby(statsDailyKey, responseTime, 'sum')
    .expire(tempKeyMin, 10)
    .expire(tempKeyMax, 10)
    .exec((err, results) => {
      // check
    });

}

Each metric record will have the following parameters.

  • min. The value will be updated by aggregating the latest value in the temporary record with the current value in the metric record using zunionstore and the aggregate min method.
  • max. The value will be updated by aggregating the latest value in the temporary record with the current value in the metric record using zunionstore and the aggregate max method.
  • count. The value will be incremented by one in each function call.
  • sum. The value will be incremented by the response time value in each function call.

Comments

Popular posts from this blog

Increase of Malicious Activities and Implementation of reCaptcha

In recent time, I've seen the increase of malicious activities such as login attempts or phishing emails to some accounts I manage. Let me list some of them and the actions taken. SSH Access Attempts This happened on a server that host a Gitlab server. Because of this case, I started to limit the incoming traffic to the server using internal and cloud firewall provided by the cloud provider. I limit the exposed ports, connected network interfaces, and allowed protocols. Phishing Attempts This typically happened through email and messaging platform such as Whatsapp and Facebook Page messaging. The malicious actors tried to share a suspicious link lured as invoice, support ticket, or something else. Malicious links shared Spammy Bot The actors leverage one of public endpoint on my website to send emails. Actually, the emails won't be forwarded anywhere except to my own email so this just full my inbox. This bot is quite active, but I'm still not sure what...

Configuring Swap Memory on Ubuntu Using Ansible

If we maintain a Linux machine with a low memory capacity while we are required to run an application with high memory consumption, enabling swap memory is an option. Ansible can be utilized as a helper tool to automate the creation of swap memory. A swap file can be allocated in the available storage of the machine. The swap file then can be assigned as a swap memory. Firstly, we should prepare the inventory file. The following snippet is an example, you must provide your own configuration. [server] 192.168.1.2 [server:vars] ansible_user=root ansible_ssh_private_key_file=~/.ssh/id_rsa Secondly, we need to prepare the task file that contains not only the tasks but also some variables and connection information. For instance, we set /swapfile  as the name of our swap file. We also set the swap memory size to 2GB and the swappiness level to 60. - hosts: server become: true vars: swap_vars: size: 2G swappiness: 60 For simplicity, we only check the...

Deliver SaaS According Twelve-Factor App

If you haven't heard of  the twelve-factor app , it gives us a recommendation or a methodology for developing SaaS or web apps structured into twelve items. The recommendation has some connections with microservice architecture and cloud-native environments which become more popular today. We can learn the details on its website . In this post, we will do a quick review of the twelve points. One Codebase Multiple Deployment We should maintain only one codebase for our application even though the application may be deployed into multiple environments like development, staging, and production. Having multiple codebases will lead to any kinds of complicated issues. Explicitly State Dependencies All the dependencies for running our application should be stated in the project itself. Many programming languages have a kind of file that maintains a list of the dependencies like package.json in Node.js. We should also be aware of the dependencies related to the pla...

Kenshin VS The Assassin

It is an assassin versus assassin.

Free Cloud Services from UpCloud

Although I typically deploy my development environment or experimental services on UpCloud , I do not always stay updated on its announcements. Recently, I discovered that UpCloud has introduced a new plan called the Essentials plan, which enables certain cloud services to be deployed at no cost. The complimentary services are generally associated with network components or serve as the foundation for other cloud services. This feature is particularly useful when retaining foundational services, such as a load balancer, is necessary, while tearing down all services and reconfiguring the DNS and other application settings each time we temporarily clean up infrastructure to reduce costs is undesirable.  When reviewing the service specifications of the cloud services in the Essentials plan, they appear to be very similar to those in the Development plan. The difference in service levels is unclear, but it could be related to hardware or resource allocation. For instance, the loa...

What's Good About Strapi, a Headless CMS

Recently, I've been revisiting Strapi as a solution for building backend systems. I still think this headless CMS can be quite useful in certain cases, especially for faster prototyping or creating common websites like company profiles or e-commerce platforms. It might even have the potential to handle more complex systems. With the release of version 5, I'm curious to know what updates it brings. Strapi has launched a new documentation page, and it already feels like an improvement in navigation and content structure compared to the previous version. That said, there's still room for improvement, particularly when it comes to use cases and best practices for working with Strapi. In my opinion, Strapi stands out with some compelling features that could catch developers' attention. I believe three key aspects of Strapi offer notable advantages. First, the content-type builder feature lets us design the data structure of an entity or database model, including field ...