Grafana (regular) PostgreSQL (regular) Bash (regular) Are you intrigued by planetary scale, distributed, intelligent systems?Do you like to build services that can run themselves? Then this is the role for you!Join our Performance & Reliability Engineering Organization!Akamai is the world's largest, most trusted cloud delivery platform. We ensure that infrastructure services have reliability and uptime that meet service level objectives and agreements. SREs monitor our service capacity and performance. We focus on optimizing services, building infrastructure, and eliminating manual operations work with automation.Partner with the bestThe UMP Team is a part of Akamai's Systems Communications group. A cross-functional engineering team that develops the distributed systems and services that underpin Akamai's global network. Our system allows fast and reliable configuration of Akamai's global network. You'll develop automation that prevents service from recurring and handles non-exceptional service conditions to reduce manual operations.As a Senior Site Reliability Engineer, you will be responsible for:Being an SME and tuning systems to optimize performance and operate more reliablyProviding ongoing technical assistance in areas including model database management, configuration management, and simulation runsGuiding software releases, automating activations for new features, maintenance of services prioritizing safetyDeveloping monitoring tools and automate processes to help scale our systems betterImplementing and improving monitoring, alerting, and emergency response procedures and maneuversTroubleshooting complex application issues, service incidents, performance and availability issuesProviding expertise in developing code that provides predictive results from analytical trending and modelingManaging on-premises resources through infrastructure, code frameworks, and declarative configuration managementDo what you loveTo be successful in this role you will:Have experience in Computer Science, Engineering, and Linux/Unix-like operating systemsHave Decent knowledge of Web programming technologies and automation by scripting in Python/bashHave a significant background in performance analytics and performance optimizationHave experience with writing queries and big data technologies (PostgreSQL)Have commercial experience with deployment/configuration management tools (Ansible, Terraform, Puppet, Chef, SaltStack).Be experienced in monitoring large-scale systems (using Prometheus, Grafana, etc.)Have experience with CI/CD tools (i.e. Jenkins, Travis, Gitlab CI)Have experience with oncall model of work Build your career at AkamaiOur ability to shape digital life today relies on developing exceptional people like you. The kind that can turn impossible into possible. We’re doing everything we can to make Akamai a great place to work. A place where you can learn, grow and have a meaningful impact.With our company moving so fast, it’s important that you’re able to build new skills, explore new roles, and try out different opportunities. There are so many different ways to build your career at Akamai, and we want to support you as much as possible. We have all kinds of development opportunities available, from programs such as GROW and Mentoring to internal events like the APEX Expo and tools such as Linkedin Learning, all to help you expand your knowledge and experience here.Learn moreNot sure if this job is the right match for you or want to learn more about the job before you apply? Schedule a 15-minute exploratory call with the Recruiter and they would be happy to share more details. Here is the contact info for the recruiter assigned to this position: Kaja Grygakgryga@akamai.com
Site Reliability Engineer in Constanţa
Contact
Datele de contact vor fi vizibile dupa ce veti aplica!
Anunţ expirat