C++ (advanced) Java (advanced) Do you have the mindset necessary to develop, instrument, and operate critical services? In mid-2020 we ramped up our investment in SRE and started a dedicated DBRE team. The team is centred in Oslo but has some remote staff as well.About the Site Reliability Engineering TeamCognite’s Data Fusion SaaS product stores and processes operational data at scale, enabling the world's largest industrial companies to make data-driven decisions. Our platform is running on both public and private clouds. The Site Reliability Engineering team works closely with the software engineers implementing core product features and ensures that the products are built to be highly available, observable, and resilient. You are expected to be hands-on and make changes to the codebase yourself. About our Tech stackWe work with open source technologies that need to run in multiple cloud environments – both public clouds (like Google Cloud Platform and Azure) and in private clouds with customer-provided Kubernetes. Managed Kubernetes (GKE, AKS, Openshift) forms the base that we build our products on top of. Where possible, we have used PaaS to store states, such as Google Bigtable, Spanner, and Pubsub. We replicate data to different storage systems to be able to answer different types of queries, where PostgreSQL and Elasticsearch are important examples. Our backend developer teams work with Java, Kotlin, Scala, Python, and Rust. CI/CD is handled by a combination of Github, Jenkins, and Spinnaker to test and deploy code to production. The infrastructure is managed as code with Terraform and Atlantis and services are monitored using Prometheus, Grafana, and Lightstep.About the job to be doneEstablish robust reliability engineering to support our software engineering teams, you will be embedded in and work closely with themEnable us to run 100s of Cognite Data Fusion clusters in different regions with high availability and performanceNurture a reliability mindset in our engineering culture and contribute to growing the organization's overall knowledge in this areaAbout youMaster’s or Bachelor’s in Computer Science or a similar amount of experienceStrong background in software engineering, and experience with multiple statically typed programming languages such as Java, C++ or RustExperience with software design, algorithms, and data structures5+ years of experience with operating software in production, preferably deployed as SaaSExceptional troubleshooting and problem-solving skillsWhat we offerAn opportunity to make an impact on the industrial future and be part of disruptive and groundbreaking projectsCompetitive salary and benefits (including pension plans, insurance, benefits, and more)IT equipment and tools to allow you to be productiveCoverage of mobile telephone subscription and broadband connectionExtended private health services and free annual health checkFree staffed gymSocial activitiesCognite is a global industrial SaaS company that was established with one clear vision: to rapidly empower industrial companies with contextualized, trustworthy, and accessible data to help drive the full-scale digital transformation of asset-heavy industries around the world. Our core Industrial DataOps platform, Cognite Data Fusion™, enables industrial data and domain users to collaborate quickly and safely to develop, operationalize, and scale industrial AI solutions and applications to deliver both profitability and sustainability. Visit us at www.cognite.com and follow us on Twitter @CogniteData or LinkedIn: https://
Senior Site Reliability Engineer in Constanţa
Datele de contact vor fi vizibile dupa ce veti aplica!