Fetch: support rate limiting #133

mhuang74 · 2021-12-30T14:22:06Z

Add rate limiting support to Fetch (Issue #77 )

use linkedin 30 qps upper limit as reference rate limit
believe 30 qps can be reached via single thread, so current implementation uses just one thread
sleeps 10ms when rate limited. haven't fully tested this out, but believe this should still allow 30 qps
to reach higher qps using more threads, may need to first figure out how to keep output rows in same order as input

Integration test uses Actix to simulate web api with 2 qps limit. So it takes 10 sec to request 20 URLs.

mhuang@twisted-linen:/usr/local/projects/mhuang/rust/qsv (fetch-rate-limiting)$ QSV_LOG_LEVEL=debug RUST_BACKTRACE=1 cargo test fetch::fetch_ratelimit
    Finished test [optimized + debuginfo] target(s) in 0.40s
     Running unittests (target/debug/deps/qsv-e24fec371759ce4e)

running 0 tests

test result: ok. 0 passed; 0 failed; 0 ignored; 0 measured; 0 filtered out; finished in 0.00s

     Running tests/tests.rs (target/debug/deps/tests-84caf56d8c8e9a11)

running 1 test
test test_fetch::fetch_ratelimit ... ok

test result: ok. 1 passed; 0 failed; 0 ignored; 0 measured; 616 filtered out; finished in 9.03s

jqnatividad · 2021-12-30T16:48:52Z

Thanks @mhuang74 !

use linkedin 30 qps upper limit as reference rate limit

Rate limit is definitely better than throttle for dealing with real-world web services.

believe 30 qps can be reached via single thread, so current implementation uses just one thread

And at this stage, multi-threading is somewhat mismatched with the reality of the same services, and we can leave it for a future implementation.

sleeps 10ms when rate limited. haven't fully tested this out, but believe this should still allow 30 qps

10 ms seems to be a reasonable default... we can always tweak it or maybe even make it a configurable parameter if required.

to reach higher qps using more threads, may need to first figure out how to keep output rows in same order as input

Agreed. You may want to check out if indexmap can help. It certainly did with the test-data-generation crate which enables the generate command. #90

mhuang74 added 4 commits December 29, 2021 21:44

unit test works; but rate limit impl not working well yet

aea1daa

set rate limit from command line arg; integration test passes

4530e47

merge latest upstream

32128f7

fix typo

5ec1bf3

jqnatividad merged commit 6143e27 into dathere:master Dec 30, 2021

mhuang74 deleted the fetch-rate-limiting branch January 5, 2022 07:21

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fetch: support rate limiting #133

Fetch: support rate limiting #133

mhuang74 commented Dec 30, 2021 •

edited

Loading

jqnatividad commented Dec 30, 2021

Fetch: support rate limiting #133

Fetch: support rate limiting #133

Conversation

mhuang74 commented Dec 30, 2021 • edited Loading

jqnatividad commented Dec 30, 2021

mhuang74 commented Dec 30, 2021 •

edited

Loading