Comparative Study on Neural Network and Transformer-based Models for Predicting Exchange Rates in Sri Lanka

dc.contributor.authorBasnayake, B. R. P. M.
dc.contributor.authorChandrasekara, N. V.
dc.date.accessioned2025-10-09T04:11:47Z
dc.date.issued2025
dc.description.abstractTransformer (TF) models have demonstrated exceptional performance across various domains, with a significant focus on their application to time series forecasting. Key strengths of TFs are their ability to capture long-run dependencies and complex interactions within the data. This study aims to apply time series TF models for forecasting exchange rates and focuses on exploring the strengths and limitations of TFs in the hyperparameter tuning process and compare their performance with Seasonal Autoregressive Integrated Moving Average (SARIMA), Double SARIMA (DSARIMA) and neural network (NN) models. The study utilized daily exchange rate data for eight currency pairs relative to LKR. Hyperparameter tuning was performed using a trial-and-error approach to optimize model performance. The main findings from fitted TF models can be summarized as follows: The inherent fluctuations in exchange rate movements highlighted that a larger embedding size enhances the models' performance. Better performance was observed with four encoder and decoder layers and the attention head parameter was within the range of four to six, as deviating from this configuration led to higher error values. Further, higher error values were observed across all exchange rates when the learning rate was greater than 0.5. Maintaining a batch size of 16 and reduced dropout rates also yielded the lowest error values. This parameter selection enables the TFs to effectively capture key features from the input data while maintaining a balance that minimizes the risks of overfitting or underfitting. Overall, TF models performed well compared to SARIMA/DSARIMA and NN models, achieving the lowest error metrics owing to their advanced capability to capture non-linear patterns. This research will be valuable to scholars exploring the application of TF models and professionals in sectors like finance and economics, where precise exchange rate forecasting plays a vital role in informed decision-making and effective resource allocation.
dc.identifier.citationBasnayake, B. R. P. M., & Chandrasekara, N. V. (2025). Comparative study on neural network and transformer-based models for predicting exchange rates in Sri Lanka. In Proceedings of the International Research Conference on Smart Computing and Systems Engineering (SCSE 2025). Department of Industrial Management, Faculty of Science, University of Kelaniya.
dc.identifier.urihttp://repository.kln.ac.lk/handle/123456789/30074
dc.publisherDepartment of Industrial Management, Faculty of Science, University of Kelaniya.
dc.subjectexchange rates
dc.subjecthyperparameter tuning
dc.subjectnon-linear
dc.subjecttime series
dc.subjecttransformer
dc.titleComparative Study on Neural Network and Transformer-based Models for Predicting Exchange Rates in Sri Lanka
dc.typeArticle

Files

Original bundle

Now showing 1 - 1 of 1
No Thumbnail Available
Name:
SCSE Abstract Proceedings 2025-58.pdf
Size:
10.23 KB
Format:
Adobe Portable Document Format

License bundle

Now showing 1 - 1 of 1
No Thumbnail Available
Name:
license.txt
Size:
1.71 KB
Format:
Item-specific license agreed upon to submission
Description: