DDoS with HTTP packets for dummies

Kill network traffic

At first glance, it seems that the easiest way to attack a server via DDoS is to overload its network channel to such an extent that new packets begin to be processed slowly or are rejected altogether. This is logical, since the establishment of each new connection and the SSL Handshake process require the exchange of several packets between the client and the server.

But there is an important nuance here.

Modern servers located in data centers have a bandwidth of 10 Gbps or even more. To fully load such a channel, a huge volume of requests is required every second.

A more effective approach is to overload the server’s outgoing bandwidth, since each response can be much larger than the received request. For example, a standard HTTP request GET / HTTP/1.1\r\n\r\n takes only 18 bytes, while the server’s response can reach several kilobytes, creating a significantly larger load.

Worker exhaustion

Port exhaustion

Running out of RAM

“A blow to the wallet”

Hosting providers such as Google Cloud, AWS, DigitalOcean, and others have quotas for outgoing traffic. For example, DigitalOcean charges an additional fee of $0.01 per gigabyte for exceeding the quota.

How can this be used? It is enough to find a large file on the server and download it many times. Thus, we quickly exhaust the quota for outgoing traffic.

Experience and experiments show that this quota can be exceeded very quickly, causing financial losses to the server owner.

Defining server configuration

It is important to properly configure key server parameters such as connection timeout, client max body, and the number of requests per connection (thanks to Connection: keep-alive). They determine how efficiently the server will handle incoming requests and how resilient it will be to load.

1. Connection Timeout

The connection timeout value allows you to control the time to wait before closing the connection. For example, if the timeout is 1 minute, you can send a request a second before closing it, opening hundreds or even thousands of new connections in parallel. However, you should take into account the limitations: one IP address has only 65,536 available ports, and this limit imposes certain limitations on the scaling of such actions.

2. Client Max Body

Another important parameter is client max body, which determines the maximum size of the request body. Using its value, you can force the server to spend more resources by sending the largest possible files or voluminous data. If the size exceeds the limit, the server may reject the request or close the connection. To make the most of the available time, it is worth combining client max body with connection timeout.

3. CPU load testing

Another way to evaluate server performance is to test the CPU load. One method is to send a large JSON object, which requires significant computing resources for parsing.

How does it work?

First, a JSON request is sent, which the server processes for, for example, 2 seconds.
Then the same JSON is sent in parallel connections to see if its processing time increases.
If the processing time starts to increase, this means that the server does not have enough processor cores to process requests simultaneously and may have reached its limit.

The importance of these parameters

Knowing how your server is configured and what resources your hosting provider provides can help you develop an effective plan for both stress testing and configuration vulnerability detection. This allows you to assess your server’s vulnerability and predict its behavior under high load.

Experiment

To prove or disprove something, you have to try it. For this purpose, a small web API with several endpoints was developed:

Environment settings

For performance testing, the server is running on a device with an 8-core CPU, and the attack is performed from another physical device on the local network. The main goal is to create a load on the server, compare performance, and draw appropriate conclusions.

Technical details

The backend is implemented in Python using the Sanic Web Framework.
Sanic Workers are used because they have shown better performance than Uvicorn in tests.
Database – PostgreSQL 17.
The server uses SSL certificates for a secure connection.

Running an experiment

To create the load, a Python script was written that sends 10,000 requests to the /hit2 endpoint. The aiohttp library is used, which allows requests to be sent asynchronously, minimizing resource consumption on the client side.

Test results

Despite the large number of requests, the script had almost no impact on the performance of the client device. However, the server with an 8-core CPU was loaded at almost 100%.

The graph shows an increase in the number of database entries.
The server is unable to consistently handle more than 600 requests per second, indicating possible limitations in processor power or database bandwidth.

These results demonstrate that under high load, the server quickly reaches its limits, and to improve performance, it is necessary to optimize CPU usage and the approach to processing requests.

Conclusions to this experiment

We open a large number of connections

Conclusion

HTTP flood DoS and DDoS attacks can be extremely effective, but it is important to not just randomly send requests, but first find a bottleneck in the server architecture. This could be an endpoint with frequent database writes or a resource-intensive operation that performs complex calculations.

This article was written to demonstrate methods for creating load on a server, including how to identify and exploit weaknesses in its architecture. By identifying which operations are consuming the most resources, you can more effectively conduct load attacks, using the fewest requests to achieve maximum impact.

DDoS with HTTP packets for dummies