Implementasi RESTful untuk Mengurangi Mean Time to Resolve pada Alert Handling

Authors

  • Yadhi Aditya Politeknik Negeri Bandung, Bandung
  • Setiadi Rachmat Politeknik Negeri Bandung, Bandung
  • Maisevli Harika Politeknik Negeri Bandung, Bandung
  • Mufqi Uwais Nastiar Salim Politeknik Negeri Bandung, Bandung
  • Dewanto Joyo Pramono Politeknik Negeri Bandung, Bandung

DOI:

https://doi.org/10.30865/jurikom.v9i5.4968

Keywords:

Integrated System, Alert Handling, On-call Engineer, Infrastructure IT, Mean Time to Resolve

Abstract

One of the responsibilities of the company's IT infrastructure team is ensuring that IT services or services stay available. One part of analyzing the performance of IT infrastructure is MTTR (Mean Time to Resolve) or the average time necessary to restore a service that is experiencing issues. Warnings or alerts are shown when a service is encountering difficulties. As a member of the IT infrastructure team, the on-call engineer is responsible for managing alarms. At PT Sense Health Indonesia, difficulties manifest in the form of recurrence alerts, or warnings resulting from previously fixed or rectified issues. This warning is repeated with sufficient frequency to enhance the MTTR value. The findings demonstrated that the integration of tools using the RESTful approach streamlined the process of processing alerts. The deployment led to a 78% decrease in the average time required to handle alarms. This study contributes by demonstrating one method for decreasing MTTR for alert handling

References

“Infrastructure Engineer.†https://www.fieldengineer.com/skills/what-is-an-infrastructure-engineer#:~:text=Infrastructure Engineers are responsible for,size function efficiently and seamlessly. (accessed Aug. 11, 2022).

Q. Lin, H. Zhang, J. G. Lou, Y. Zhang, and X. Chen, “Log clustering based problem identification for online service systems,†in Proceedings - International Conference on Software Engineering, 2016, pp. 102–111. doi: 10.1145/2889160.2889232.

X. Zhou et al., “Latent error prediction and fault localization for microservice applications by learning from system trace logs,†in ESEC/FSE 2019 - Proceedings of the 2019 27th ACM Joint Meeting European Software Engineering Conference and Symposium on the Foundations of Software Engineering, 2019, pp. 683–694. doi: 10.1145/3338906.3338961.

X. Zhou et al., “Fault Analysis and Debugging of Microservice Systems: Industrial Survey, Benchmark System, and Empirical Study,†IEEE Trans. Softw. Eng., vol. 47, no. 2, pp. 243–260, 2021, doi: 10.1109/TSE.2018.2887384.

“Red Hat CodeReady Workspaces.†https://developers.redhat.com/products/openshift-dev-spaces/overview (accessed Aug. 11, 2022).

“GitHub.†https://github.com/ (accessed Aug. 11, 2022).

“BitBucket.†https://bitbucket.org/ (accessed Aug. 11, 2022).

“INTEGRATION: Ansible and AWS.†https://www.ansible.com/integrations/cloud/amazon-web-services (accessed Aug. 11, 2022).

“On-call and alert management to keep services always on.†https://www.atlassian.com/software/opsgenie (accessed Aug. 11, 2022).

F. Calefato and F. Lanubile, “A hub-and-spoke model for tool integration in distributed development,†in Proceedings - 11th IEEE International Conference on Global Software Engineering, ICGSE 2016, 2016, pp. 129–133. doi: 10.1109/ICGSE.2016.12.

A. L. Wasserman, “Tool integration in software engineering environments,†Lect. Notes Comput. Sci. (including Subser. Lect. Notes Artif. Intell. Lect. Notes Bioinformatics), vol. 467 LNCS, pp. 137–149, 1990, doi: 10.1007/3-540-53452-0_38.

P. K. Venkatesh, S. Wang, F. Zhang, Y. Zou, and A. E. Hassan, “What do client developers concern when using web APIs? An empirical study on developer forums and stack overflow,†in Proceedings - 2016 IEEE International Conference on Web Services, ICWS 2016, 2016, pp. 131–138. doi: 10.1109/ICWS.2016.25.

M. H. Bohara, M. Mishra, and S. Chaudhary, “RESTful Web Service integration using Android platform,†2013. doi: 10.1109/ICCCNT.2013.6726550.

M. Hasan, “Design and Implementation of an Automated IT Incident Management System for Small and Medium-Sized Enterprises,†2021.

A. Kelkar, U. Naiknaware, S. Sukhlecha, A. Sanadhya, M. Natu, and V. Sadaphal, “Analytics-based solutions for improving alert management service for enterprise systems,†in Proceedings - IEEE 13th International Conference on Data Mining Workshops, ICDMW 2013, 2013, pp. 219–227. doi: 10.1109/ICDMW.2013.166.

S. Sasirekha, I. J. L. Paul, and S. Swamynathan, “An API Centric Smart Kitchen Application,†2018. doi: 10.1109/ICCCSP.2018.8452850.

H. K. Brar and P. J. Kaur, “Differentiating integration testing and unit testing,†in 2015 International Conference on Computing for Sustainable Global Development, INDIACom 2015, 2015, pp. 796–798.

S. Brown, The C4 model for visualising software architecture. 2018. [Online]. Available: https://c4model.com/%0Ahttps://c4model.com/%0Ahttps://www.infoq.com/articles/C4-architecture-model/%0Ahttps://c4model.com/

Additional Files

Published

2022-10-31

How to Cite

Aditya, Y., Rachmat, S., Harika, M., Salim, M. U. N., & Pramono, D. J. (2022). Implementasi RESTful untuk Mengurangi Mean Time to Resolve pada Alert Handling. JURNAL RISET KOMPUTER (JURIKOM), 9(5), 1434−1443. https://doi.org/10.30865/jurikom.v9i5.4968