Machine learning algorithms for process modelling and decision-making in project portfolio management

Yurii  Yushchenko

Bertsimas, D., & Kallus, N. (2018). From predictive to prescriptive analytics. Management Science, 66(3), 1025-1044. doi: 10.1287/mnsc.2018.3253.
Bondar, O., Ryzhenko, N., & Chernykhivska, A. (2023). Green recovery of Ukraine in the post-war period. Collection of Scientific Papers “ΛΌГOΣ”, 13-14. doi: 10.36074/logos-27.10.2023.01.
Bubeck, S., & Cesa-Bianchi, N. (2012). Regret analysis of stochastic and non-stochastic multi-armed bandit problems. Foundations and Trends in Machine Learning, 5(1), 1-122. doi: 10.1561/9781601986276.
Bushuyev, S., Bushuyeva, N., Lobok, Ye., & Murovansky, H. (2025). Management of innovative projects based on artificial intelligence applications in a turbulent environment. Innovative Technologies and Scientific Solutions for Industries, 2(32), 168-176. doi: 10.30837/2522-9818.2025.2.168.
Bushuyev, S., Bushuyeva, N., Onyshchenko, S., & Bondar, A. (2021). Modelling projects portfolio structure dynamics. In 2021 IEEE 16^th international conference on computer sciences and information technologies (pp. 293-298). Lviv: IEEE Ukraine Section. doi: 10.1109/CSIT52700.2021.9648713.
Chakraborty, S. (2022). Incentivized exploration of non-stationary stochastic bandits. (Master's thesis, University of Colorado, Boulder, USA).
Chen, Q., Golrezaei, N., & Bouneffouf, D. (2022). Non-stationary bandits with auto-regressive temporal dependency. In Proceedings of the 37^thinternational conference on neural information processing systems (pp. 7895-7929). New York: Curran Associates Inc. doi: 10.5555/3666122.3666468.
Cooper, R.G. (2022). The 5th generation stage-gate idea to launch process. IEEE Engineering Management Review, 50(4), 43-55. doi: 10.1109/EMR.2022.3222937.
Devlin, J., Chang, M.W., Lee, K., & Toutanova, K. (2019). BERT: Pre-training of deep bidirectional transformers. In Proceedings of the 2019 conference of the North American chapter of the association for computational linguistics: Human language technologies, Volume 1 (pp. 4171-4186). Minneapolis: Association for Computational Linguistics.
Esmaeili, S.A., Shin, S., & Slivkins, A. (2023). Robust and performance incentivizing algorithms for multi-armed bandits with strategic agents. Proceedings of the AAAI Conference on Artificial Intelligence, 39(13), 13814-13822. doi: 10.1609/aaai.v39i13.33510.
Garivier, A., & Moulines, E. (2011). On Upper-Confidence Bound policies for Switching Bandit problems. In J. Kivinen, C. Szepesvári, E. Ukkonen & T. Zeugmann (Eds.), Algorithmic learning theory (pp. 174-188). Berlin: Springer. doi: 10.1007/978-3-642-24412-4_16.
Gornet, J., Hosseinzadeh, M., & Sinopoli, B. (2022). Stochastic Multi-armed Bandits with non-stationary rewards generated by a linear dynamical system. In IEEE 61^st conference on decision and control (pp. 1460-1465). Mexico: Institute of Electrical and Electronics Engineers.
Harris, K., & Slivkins, A. (2025). Should you use your large language model to explore or exploit? arXiv. doi: 10.48550/arXiv.2502.00225.
Keplinger, N.S., Luo, B., Bektas, I., Zhang, Y., Wray, K.H., Laszka, A., Dubey, A., & Mukhopadhyay, A. (2025). NS-Gym: A сomprehensive and open-source simulation environments and benchmarks for non-stationary Markov decision processes. arXiv. doi: 10.48550/arXiv.2501.09646.
Khurshid, S., Abdulla, M.S., & Ghatak, G. (2024). Optimizing sharpe ratio: Risk-adjusted decision-making in multi-armed bandits. Machine Learning, 114, article number 32. doi: 10.1007/s10994-024-06680-2.
Kovari, A. (2024). AI for decision support: Balancing accuracy, transparency, and trust across sectors. Information, 15(11), article number 725. doi: 10.3390/info15110725.
Levine, S., Kumar, A., Tucker, G., & Fu, J. (2020). Offline reinforcement learning: Tutorial, review, and perspectives on open problems. arXiv. doi: 10.48550/arXiv.2005.01643.
Slivkins, A., Xu, Y., & Zuo, S. (2025). Greedy algorithm for structured bandits: A sharp characterization of asymptotic success / failure. arXiv. doi: 10.48550/arXiv.2503.04010.
Slivkins, A. (2019). Introduction to multi-armed bandits. arXiv. doi: 10.48550/arXiv.1904.07272.
Thananjeyan, B., Kandasamy, K., Stoica, I., Jordan, M.I., Goldberg, K., & Gonzalez, J.E. (2021). Resource allocation in Multi-Armed Bandit exploration: Overcoming sublinear scaling with adaptive parallelism. Proceedings of the 38^th International Conference on Machine Learning, 139, 10236-10246.
Vernade, С., Gyorgy, A., & Mann, T.A. (2020). Non-stationary delayed bandits with intermediate observations. Proceedings of the 38^th International Conference on Machine Learning, 119, 9722-9732.
Xiang, D., West, R., Wang, J., Cui, X., & Huang, J. (2022). Multi Armed Bandit vs. A/B Tests in E-commerce – confidence interval and hypothesis test power perspectives. In KDD ‘22: Proceedings of the 28^th ACM SIGKDD conference on knowledge discovery and data mining (pp. 4204-4214). New York: Association for Computing Machinery. doi: 10.1145/3534678.3539144.
Yasinetskyi, O., & Galchenko, I. (2025). AI tools and risk management methods in the scrum project lifecycle. Telecommunications and Information Technologies, 3(88), 106-112. doi: 10.31673/2412-4338.2025.038711.
Zuo, J., & Joe-Wong, C. (2021). Combinatorial multi-armed bandits for resource allocation. In 55^th annual conference on information sciences and systems (pp. 1-4). Baltimore: Johns Hopkins University.

Economics, Entrepreneurship, Management

Article

Machine learning algorithms for process modelling and decision-making in project portfolio management

Abstract

Keywords

Suggested citation

References