Throttle question

Joerg_Weingarten · August 11, 2014, 10:15pm

Hi,

I need some understanding help with how I can throttle in my scenario setup and I think its just me not understanding the terminology. Let’s say I have 3000 scenarios to run, I want to be able to say “never run more than 4 parallel/concurrently in 1 second”. So the “running” in the console output should never grow beyond “4”.

When I use “constantUsersPerSec” with a “scenario” and set it to “4” it will “inject” 4 new requests every second to the server. It doesn’t matter in what the state the server is in, it doesn’t matter how many requests are already running, every second it will inject/send another 4 requests. Correct?

The behavior I am trying to get to is that never more than 4 requests are running/pending. I can’t figure out how to apply “throttle” so that when I look at the output of the test the “running” never shows more than “4”.

For example this output should not happen:

---- Get -----------------------------------------------------------------------

[##- ] 3%

waiting: 2320 / running: 7 / done:73

---- Requests ------------------------------------------------------------------

The “running: 7” should never exceed “4”.

I hope my question makes sense. Thx for any pointers!

—Joerg—

Excilys · August 24, 2014, 1:19am

Let’s say I have 3000 scenarios to run

3000 scenarios, are you sure? Don’t you mean 3000 virtual users?

When I use “constantUsersPerSec” with a “scenario” and set it to “4” it will “inject” 4 new requests every second to the server.

It will start 4 new virtual users that will start going down the scenario workflow.

The behavior I am trying to get to is that never more than 4 requests are running/pending.

If your response time exceed 1 sec, you can end up with more that 4 alive virtual users at a given time.

What do your throttle attempts look like?
http://gatling.io/docs/2.0.0-RC2/general/simulation_setup.html#throttling

Joerg_Weingarten · August 24, 2014, 3:56pm

Thanks for the reply Stéphane.

Yes 3000 virtual users. I tried the different throttling options you pointed me to on a scenario level as well as a setup level.

The product I am testing has response times from 1 second to 80 seconds (Druid Data store queries can take this long).
I was trying to get to a setup that never has more than 4 virtual users running at the same time, and never less either.
The throttling works fine from the aspect of “never inject more than 4 new virtual users”, but that isn’t really want my intention was.

Any suggestion on how I can make the “never have more than N parallel virtual virtual users running” work?

Thx
—Joerg—

Alex_Bagehot · August 24, 2014, 11:30pm

Hi Joerg,

To recap, you have a population of 3000 virtual users who could need to access this system at any time, but there is only ever exactly 4 active virtual users as there is some hard restriction outside of the SUT that prevents more that 4 virtual users accessing the system at a time?
Eg. (take an extreme case) If the system was down for 2 days and there was a large backlog of work to do on the system from the 3000 virtual user population, then when it came back online still only 4 could do their work simultaneously by some mechanism?

In which case you just use a finite user injection with a scenario that loops forever and a maxDuration on the simulation.
http://gatling.io/docs/2.0.0-RC2/general/simulation_setup.html#injection

    atOnceUsers(10), // 2
    rampUsers(10) over(5 seconds), // 3

Note the number in the console you were looking at will include any users that are idle(from a SUT perspective) in pause time for example. (not sure if you have pauses or not)

Typically though, most live systems have a rate at which requests/users arrive rather than parallel virtual user restriction, so you may need to sense check the parallel 4 users requirement.

Thanks,
Alex

Joerg_Weingarten · August 25, 2014, 12:22am

The 3000 and 4 are arbitrary numbers. Most likely I confuse “virtual” users somehow.

Basically I have a CSV file that I generated from a Production log file. The CSV file contains the REST API requests I want to basically play back.

When I use the throttle functionality right now it injects 4 users no matter what state the last 4 users are in (finished or still running).
This is a good way of testing and I am doing this at this point. And this is actually more like the real world example, which is why I like Gatling. Another 4 users could come at any given time and they don’t care how many users are already active.

The reason I want to limit to a constant number of running requests is simply debugging. It helps me analyse logs on the backend (Database, API, etc.) somewhat easier as I know I only have n number of requests (Example is 4) running.

—Joerg—

Alex_Bagehot · September 5, 2014, 12:11pm

Hi Joerg,

Any suggestion on how I can make the “never have more than N parallel virtual virtual users running” work?

for the case where you want cUps() users arriving, but no more than N active/running users, there is a ticket for this, so it’s not implemented yet: https://github.com/gatling/gatling/issues/1647

The 3000 and 4 are arbitrary numbers. Most likely I confuse “virtual” users somehow.

Probably not but it will likely be useful for you to determine whether the api requests are all independent of each other or how related. You can discard the notion of (implied concurrent) virtual users if they do not repeat sessions and independent of each other. You would need instead the user arrival rate from production data.

Basically I have a CSV file that I generated from a Production log file. The CSV file contains the REST API requests I want to basically play back.

Sounds good

When I use the throttle functionality right now it injects 4 users no matter what state the last 4 users are in (finished or still running).

Throttle is a rate limiter.
You are looking for a concurrency limiter (as far as you have stated the requirement, active users =4)

To investigate this I set up the following test:

a request in a SUT takes a configurable N seconds.

val request = http(“request”) .get(“http://localhost/SUT/request”)

val scn = scenario(“scn”) .exec(request )

I apply an open model workload of cUPS(n) with each user only making 1 request.

user arrival rate = request throttle rate = response time = 1

setUp(

scn.inject(constantUsersPerSec(1) during(testDuration seconds) ) .protocols(httpConf)

.throttle(jumpToRps(1), holdFor(2 hours))

)

the console prints every 5 seconds:

waiting: 100000 / running: 0 / done:0

waiting: 99995 / running: 1 / done:4

waiting: 99990 / running: 1 / done:9

waiting: 99985 / running: 1 / done:14

waiting: 99980 / running: 1 / done:19

waiting: 99975 / running: 1 / done:24

waiting: 99970 / running: 1 / done:29

waiting: 99965 / running: 1 / done:34

user arrival rate = request throttle rate = 1

response time = 4

setUp(

scn.inject(constantUsersPerSec(1) during(testDuration seconds) ) .protocols(httpConf)

.throttle(jumpToRps(1), holdFor(2 hours))

)

the console prints every 5 seconds:

waiting: 100000 / running: 0 / done:0

waiting: 99995 / running: 4 / done:1

waiting: 99990 / running: 4 / done:6

waiting: 99985 / running: 4 / done:11

waiting: 99980 / running: 4 / done:16

waiting: 99975 / running: 4 / done:21

waiting: 99970 / running: 4 / done:26

waiting: 99965 / running: 4 / done:31

user arrival rate = 2

request throttle rate = 1

response time = 1

setUp(

scn.inject(constantUsersPerSec(2) during(testDuration seconds) ) .protocols(httpConf)

.throttle(jumpToRps(1), holdFor(2 hours))

)

the console prints every 5 seconds:

waiting: 200000 / running: 0 / done:0

waiting: 199990 / running: 7 / done:3

waiting: 199980 / running: 12 / done:8

waiting: 199970 / running: 17 / done:13

waiting: 199960 / running: 22 / done:18

waiting: 199950 / running: 27 / done:23

waiting: 199940 / running: 32 / done:28

waiting: 199930 / running: 37 / done:33

waiting: 199920 / running: 42 / done:38

waiting: 199910 / running: 47 / done:43

waiting: 199900 / running: 52 / done:48

waiting: 199890 / running: 57 / done:53

…

Throttle does its job as the rps is constant for all 3 tests, but the active(running) users is different in each test for different reasons.

It is also worth noting that throttle constrains requests from starting, not completing. So the “active(running) users” is a mix of users in the middle of a scenario and users delayed/queued to make their first request (maybe those are not running as such as they have not done any work yet).

If you are happy that as the response time of your SUT varies then so will the active users above or below 4 then seems like a reasonable set up. If active users must be constant, and you want them to be executing a request rather than being delayed/queued or in a pause, then I think the only choice is to revert to a closed model of total of 4 users that loop the scenario.

This is a good way of testing and I am doing this at this point. And this is actually more like the real world example, which is why I like Gatling. Another 4 users could come at any given time and they don’t care how many users are already active.

agree with that!

The reason I want to limit to a constant number of running requests is simply debugging. It helps me analyse logs on the backend (Database, API, etc.) somewhat easier as I know I only have n number of requests (Example is 4) running.

Sounds reasonable for early or simplified testing.

I don’t think throttle is widely applicable though (or as a default way of modelling the load), esp. if you are using cUps().

Thanks,

Alex

Topic		Replies	Views
Number of scenarios per second Gatling (Open-Source)	1	160	July 25, 2016
Throttling Questions Gatling (Open-Source)	3	147	October 20, 2014
Throttling question Gatling (Open-Source)	3	121	December 4, 2014
Gatling Scenario Setup Details Gatling (Open-Source)	0	142	May 12, 2020
Throttle/Throttling ELI5 Gatling (Open-Source)	6	740	December 9, 2022

Throttle question

Related topics