Analysing and Modelling the Accuracy and Latency Trade-offs in Rate Limiting on API-Gateway

Loading...
Thumbnail Image

Date

Journal Title

Journal ISSN

Volume Title

Publisher

Department of Industrial Management, Faculty of Science, University of Kelaniya, Sri Lanka.

Abstract

The rate limiting service in API gateways controls request entry by throttling requests with a boundary. The rate limiting accuracy determines how efficiently it works and whether it allows requests within the throttle count or exceeds the throttle count. Latency, on the other hand, is the round trip time of a particular API call. The accuracy of rate limiting service is defined using spillover error percentage. It is the error calculation for the requests that rate limiting service allows more than the throttle count. Requests must be sent to the rate limiting service in order to rate limit requests in the API gateway. The rate limiting service decides whether the incoming requests must be throttled. The time taken for the requests to be decided by the rate limiting service adds additional latency to the round trip time of requests. This additional latency can be controlled by introducing a timeout. However, this could result in a degradation in the accuracy of the rate limiting. This paper investigates the particular problem and models the relationship between accuracy and round-trip latency. The findings of this research address the analysis of the accuracy and latency trade-off with respect to the parameters influencing them, and also address the prediction outcome using random forest regressor and present key findings.

Description

Citation

Caucidheesan, K., & Poravi, G. (2025). Analysing and modelling the accuracy and latency trade-offs in rate limiting on API-gateway. Smart Computing and Systems Engineering (SCSE 2025). Department of Industrial Management, Faculty of Science, University of Kelaniya, Sri Lanka. (P. 67).

Endorsement

Review

Supplemented By

Referenced By