Site Reliability Engineer (Azure) in Bucuresti

You may not know our name, but you have surely used our innovations and solutions. Our mission is to unlock the world and make it safer through cutting-edge identity technologies. Every day, around the globe, we are enabling citizens and consumers alike to perform their daily critical activities (such as pay, connect and travel), in the physical as well as digital space. We are transforming their lives by making the world more secure and yet also more streamlined. We have brought together complementary know-how and technologies that have never been combined before for both the physical and digital era: secured connectivity, secured payments and secured identity management. Cybersecurity, biometrics, large scale distributed systems and Cloud computing, analytics and smart devices are at the core of both our physical products and our software and systems. We serve our clients in 180 countries thanks to our 15,000 employees worldwide. PurposeThis role is responsible for maintaining the service-level agreement critical production platforms or products and providing automated operations to ensure the service to our clients is always of the best quality.Key Missions The key mission of the Site Reliability Engineer is providing automated operations and preventive monitoring of SLA-critical production platforms, together with the Idemia’s SRE Team. Hardens platforms or products before and after they go live by reviewing their design, security and implementation, tuning configuration as well as developing auxiliary tools and necessary monitoring of critical health indicators Maintains platforms or products after go live by measuring and monitoring their availability, performance and overall system health Recovers platforms or products during production incidents to meet targeted service-level agreements Performs detailed root cause analysis and conduct post-mortem analysis Seeks proactively for improvements of non-functional requirements and cooperates with development and product teams to improve operational aspects of platforms or products Participates to the Change Advisory Board and validate readiness, security and maturity of new releases through development, execution and verification of automated smoke test Supports stakeholders by providing technical expertise when necessary Develops monitoring including new dashboard and alert implementation and continuous improvement Supports proactively product design review and collaborates with development teams during product development phase Communicates with internal stakeholders during planned interventions or changes and outages Technologies we are using: Azure Linux Docker Docker - compose Azure Kubernetes Service Prometheus Grafana Terraform Nagios Jenkins

Contact

Datele de contact vor fi vizibile dupa ce veti aplica!

Anunţ expirat
loading...
www.mynextjob.ro folosește cookies. Navigând în continuare, iți exprimi acordul pentru folosirea acestora. Află mai multe Am ințeles!