Site Reliability Engineer
Jobtome
Job Title: Site Reliability Engineer (SRE) / Infrastructure Operations MID LEVEL
Role Overview
Responsible for managing day-to-day infrastructure operations, including monitoring, alerting, and driving stability improvements across the environment.
Key Responsibilities
- Monitor overall infrastructure health and system performance
- Track key performance metrics such as CPU, memory, and disk utilization
- Tune alerts to improve signal-to-noise ratio and reduce alert fatigue
- Support disaster recovery (DR) rehearsals and readiness activities
- Maintain and update runbooks, documentation, and operational reports
Required Experience
- 4–6 years of experience in Site Reliability Engineering (SRE) or infrastructure operations
- Hands-on experience with VMware environments
- Experience with monitoring tools such as PRTG, Datadog, or similar platforms
- Strong incident management experience, including response and resolution processes
Core Skills & Competencies
- Solid understanding of infrastructure performance metrics (CPU, memory, disk, etc.)
- Experience with alert tuning and optimization
- Ability to proactively detect and troubleshoot performance issues
- Strong incident management and operational response capabilities
Screening Signals
Look for candidates who:
- Understand CPU Ready thresholds and their impact on performance
- Have hands-on experience tuning alerts to reduce noise
- Can proactively identify and resolve performance bottlenecks
- Demonstrate strong incident management experience in production environments
Vaga publicada há 7 dias atrás
Empregos semelhantes que podem ser interessantes para vocêCom base na vaga Site Reliability Engineer em Colombo, PR
- ...Site Reliability Engineer We are looking for a full-time, remote Site Reliability / DevOps Engineer with 3+ years of experience in high-availability cloud environments. This role is heavily focused on AWS infrastructure, CloudFormation, and CI/CD pipeline ownership...
- ...SRE, with the remaining experience in a relevant infrastructure/engineering role; Hands-on experience with Microsoft Azure and... ...conversation classes in 12 languages; Working model: remote, hybrid or on-site in Portugal. What makes us different? Our organisational...
- ...! Tá afim de fazer parte de tudo isso? Dá uma olhada no que estamos buscando: Estamos em busca de um(a) DBRE (Database Reliability Engineer) para atuar em um ambiente orientado por Data Mesh, com foco em resiliência, automação e governança de dados distribuídos. Essa...
- Transmission Line Engineer - LATAM Our client is seeking a Transmission Line Engineer to join their growing engineering team. This role... ...successful project execution. Travel occasionally to customer sites to support project meetings, technical presentations, and...
- Sr. Workday Application Engineer – Workday Integrations (Corporate Services) Overview Join a leading retail organization as a Senior... ...with Product, Architecture, and business teams to build scalable, reliable integrations that improve operational efficiency and employee...
$ 3.000
Mobile Engineer (React + React Native) LATAM | Remote | EST-Aligned | International Independent Contractor Compensation: USD $3,000–$... ...mobile platforms with a focus on usability, performance, and reliability. - Optimize application performance, rendering, routing, and component...- ...Description: Be part of the Application Engineering Team responsible for adapting and... ...strategy, including both remote and on-site support. Shape our products by being... ...experience as a Product Manager / Methodology Engineer. Knowledge & experience with ADAS/AV...
- ...About the Role This is a senior full‑stack engineering role focused on building and scaling the... ...real operational workflows into robust, reliable software. As the team grows, you’ll have... ...What Success Looks Like A successful engineer in this role: Builds scalable, reliable...
- ML/AI Engineer (Databricks) Location: Remote (Brazil) | Type: Contract | Experience: 3... ...models, and AI-powered applications that run reliably in production rather than living in a... ...so the data feeding your models is reliable and well-modeled Bring LLM and retrieval...
- ...We are looking for a Senior Cloud Engineer to join an international team and work remotely from Brazil, supporting global cloud infrastructure... ...scaling, and operational processes to improve efficiency and reliability Deploy and manage containerized applications using Docker...
- Buscamos um(a) AI Engineer para atuar no coração do Sankhya RH, sendo a ponte de integração entre as soluções da nossa Unidade de Negócios (Pontotel, Mindsight, Vixting, Sankhya LMS, Benefícios e Folha). Aqui, você não vai apenas trabalhar com modelagem tradicional. Você...
- ...16x eleita uma das melhores empresas para se trabalhar em tecnologia no Brasil (GPTW). Estamos buscando um(a) Senior Full Stack Engineer para atuar alocado(a) em um dos nossos grandes clientes do segmento de tecnologia e informação, com produtos utilizados em larga...
- ...to expand ideas through the right tools, contributing to our success in a collaborative environment We are looking for a Support Engineer. In this role, you will: Collaborate closely with cross-functional teams (Regulatory Operations, IT Security, QA, Validation) to meet...
- ...Role Description: Headspace is looking for a Senior Software Engineer (Fullstack with Flutter focus) to support our Employer... ...and design patterns (e.G., MVC, MVVM). ~ Experience designing reliable, scalable microservices architectures. ~ Proficiency with Git...
- Buscamos um(a) AI Engineer para atuar no coração do Sankhya RH, sendo a ponte de integração entre as soluções da nossa Unidade de Negócios (Pontotel, Mindsight, Vixting, Sankhya LMS, Benefícios e Folha). Aqui, você não vai apenas trabalhar com modelagem tradicional. Você...
- ...strong reputation by delivering premium products to clients worldwide, emphasizing reliability and innovation. Role Description This is a contract, remote role for Rubber Engineers specializing in the tire industry. Key responsibilities include designing, developing...
- Lead System Support Engineer - CREQ195578 Description Candidate should have hands on experience in handling windows/MAC OS and laptops... ...in handling IIS operations including creating/updating new IIS Sites and IIS servers Should have experience in handling DR processes...
- ...options data is among the most intricate financial data to store, index, and serve at scale. We take engineering seriously. The Role We're hiring a Technical Support Engineer to be the front line of the customer experience at Theta Data. This is not a generic helpdesk...
- Project Description: We are looking for passionate and creative developers eager to craft innovative products and embrace new technologies. You will be part of a small, agile, and highly skilled team of technologists who are building next generation of solutions for Risk...
- Na vertical de Platform Engineering da Sankhya, você atua no desenvolvimento e na evolução contínua dos sistemas internos, automações e templates que sustentam nossa Internal Developer Platform (IDP), a plataforma que padroniza, governa e acelera o ciclo...
- We are looking for a remote, full-time AI / ML Engineer with 4+ years of machine learning experience, preferably within MarTech SaaS companies, to join the engineering team of our U.S. client. You will join our client's innovative, growing team on a mission to unleash...
- ...We are looking for a Senior Generative AI Engineer to join the Customer Delivery team at CI&T. This person will work embedded within strategic client projects, helping companies across industries such as banking, insurance, and retail successfully adopt and operationalize...
- ...components in React, enhancing existing UI, developing and testing REST APIs, and collaborating closely with product, design, and engineering teams. They will participate in Agile ceremonies, troubleshoot issues across the stack, and contribute to overall system...
- ...~ sFamiliarity with healthcare, retail, or location-based service ecosystem s Job Overvi ewWe are seeking an AI Engineer to design, build, and scale intelligent agents supporting a high-volume enterprise platform operating across approximately 9,000 locations...
- ...applications for both iOS and Android platforms Build, ship, and support production-grade applications with a focus on performance and reliability Write and maintain automated tests, including unit, integration, and UI testing Integrate third-party services and APIs such as...
- Responsabilidades e atribuições Mapear e redesenhar fluxos de desenvolvimento ponta a ponta, da ideação ao deploy e operação; Identificar gargalos sistêmicos que impactam múltiplas squads simultaneamente; Definir, acompanhar e evoluir métricas de fluxo, ...
- No Grupo SBF, reunimos duas grandes forças do esporte no Brasil: a Centauro, maior varejista esportiva multimarcas da América Latina, e a Fisia, distribuidora oficial da Nike no país. Queremos ser referência em esporte no Brasil e construir um futuro em que cada experiência...
- ...Data Engineer Brazil Data Analyst II Important Information Location: Brazil Job Mode: Full-time Work Mode: Work from home Job Summary As a SQL & Snowflake Engineer, you will play a key role in the full development lifecycle, translating...
- ...Java Software Engineer, AI Imaging (Backend) About the Company Our client is a retail AI platform powering real-world, repeatable success for dozens of enterprise retailers. More than 100 million shoppers each month engage with their AI-driven outfitting, product...
- Hiring Now | CAD & Simulation Engineer – Aerospace | Remote Contract Opportunity Location: India & Greenlit Countries (Mexico, Bangladesh, Brazil, Colombia, Egypt) Employment Type: Contractor Assignment ⏳ Contract Duration: 8 Weeks Start Date: Immediate About the Role:...
Deseja receber mais vagas?
Assine e receba vagas semelhantes a Site Reliability Engineer. Seja o primeiro a se candidatar!
