Site Reliability Engineer
Jobtome
Job Title: Site Reliability Engineer (SRE) / Infrastructure Operations MID LEVEL
Role Overview
Responsible for managing day-to-day infrastructure operations, including monitoring, alerting, and driving stability improvements across the environment.
Key Responsibilities
- Monitor overall infrastructure health and system performance
- Track key performance metrics such as CPU, memory, and disk utilization
- Tune alerts to improve signal-to-noise ratio and reduce alert fatigue
- Support disaster recovery (DR) rehearsals and readiness activities
- Maintain and update runbooks, documentation, and operational reports
Required Experience
- 4–6 years of experience in Site Reliability Engineering (SRE) or infrastructure operations
- Hands-on experience with VMware environments
- Experience with monitoring tools such as PRTG, Datadog, or similar platforms
- Strong incident management experience, including response and resolution processes
Core Skills & Competencies
- Solid understanding of infrastructure performance metrics (CPU, memory, disk, etc.)
- Experience with alert tuning and optimization
- Ability to proactively detect and troubleshoot performance issues
- Strong incident management and operational response capabilities
Screening Signals
Look for candidates who:
- Understand CPU Ready thresholds and their impact on performance
- Have hands-on experience tuning alerts to reduce noise
- Can proactively identify and resolve performance bottlenecks
- Demonstrate strong incident management experience in production environments
Vaga publicada há 8 dias atrás
Empregos semelhantes que podem ser interessantes para vocêCom base na vaga Site Reliability Engineer em São Leopoldo, RS
- ...Site Reliability Engineer We are looking for a full-time, remote Site Reliability / DevOps Engineer with 3+ years of experience in high-availability cloud environments. This role is heavily focused on AWS infrastructure, CloudFormation, and CI/CD pipeline ownership...
- ...Job Title: Site Reliability Engineer (SRE) / Infrastructure Operations MID LEVEL Role Overview Responsible for managing day-to-day infrastructure operations, including monitoring, alerting, and driving stability improvements across the environment. Key Responsibilities...
- ...SRE, with the remaining experience in a relevant infrastructure/engineering role; - Hands-on experience with Microsoft Azure and Kubernetes... ...classes in 12 languages; - Working model: remote, hybrid or on-site in Portugal. What makes us different? - Our organisational...
- ...desafios! Tá afim de fazer parte de tudo isso? Dá uma olhada no que estamos buscando: Estamos em busca de um(a) DBRE (Database Reliability Engineer) para atuar em um ambiente orientado por Data Mesh, com foco em resiliência, automação e governança de dados distribuídos....
- ...Cloud Windows Infrastructure Engineer Job Purpose As Senior Windows Infrastructure... ...Service. Manage IIS in depth: sites, application pools, bindings, ARR / URL Rewrite... ...professional English, written and spoken. Reliable, distraction-free home office with stable...
- ...Transmission Line Engineer - LATAM Our client is seeking a Transmission Line Engineer to join their growing engineering team. This... ...successful project execution. Travel occasionally to customer sites to support project meetings, technical presentations, and project...
- Sr. Workday Application Engineer – Workday Integrations (Corporate Services) Overview Join a leading retail organization as a Senior... ...with Product, Architecture, and business teams to build scalable, reliable integrations that improve operational efficiency and employee...
$ 3.000
Mobile Engineer (React + React Native) LATAM | Remote | EST-Aligned | International Independent Contractor Compensation: USD... ...mobile platforms with a focus on usability, performance, and reliability. Optimize application performance, rendering, routing, and component...- ...presentations and training at customer locations. Premium Service & On site Support : Product Identification and Selection, Conveyor... ...trade shows. Conduct employee training and provide content for Engineering literature (engineering manual, data sheets) and tools (...
- We are looking for a Senior Cloud Engineer to join an international team and work remotely from Brazil, supporting global cloud infrastructure... ...scaling, and operational processes to improve efficiency and reliability Deploy and manage containerized applications using Docker and...
- ML/AI Engineer (Databricks) Location: Remote (Brazil) | Type: Contract | Experience: 3... ...models, and AI-powered applications that run reliably in production rather than living in a... ...so the data feeding your models is reliable and well-modeled - Bring LLM and retrieval...
- ...About the Role This is a senior full‑stack engineering role focused on building and scaling the... ...real operational workflows into robust, reliable software. As the team grows, you’ll have... ...What Success Looks Like A successful engineer in this role: Builds scalable, reliable...
- ...tools. We are solving this problem. Role: We are looking for a Senior Python Engineer to join our team, focusing on Python services, API integrations, scalability, and production reliability. Our Team: Our team has around 30 people working remotely across Europe....
- ...Senior Full-Stack Engineer (.NET / Angular / Azure) | Remote (Brazil) | $R20,000 per month We're working exclusively with a fast... ...operations globally; think live, real-time environments where reliability isn't a nice-to-have, it's the product. £50m+ transacted in...
- ...for hospitality. We’re looking for a Senior Machine Learning Engineer who is excited about the craft of machine learning—especially... ...evaluation. Build scalable ML solutions that can be used reliably in production environments. Develop capabilities to support...
- ...strong reputation by delivering premium products to clients worldwide, emphasizing reliability and innovation. Role Description This is a contract, remote role for Rubber Engineers specializing in the tire industry. Key responsibilities include designing,...
- Buscamos um(a) AI Engineer para atuar no coração do Sankhya RH, sendo a ponte de integração entre as soluções da nossa Unidade de Negócios (Pontotel, Mindsight, Vixting, Sankhya LMS, Benefícios e Folha). Aqui, você não vai apenas trabalhar com modelagem tradicional. Você...
- ...expand ideas through the right tools, contributing to our success in a collaborative environment We are looking for a Support Engineer . In this role, you will: Collaborate closely with cross-functional teams (Regulatory Operations, IT Security, QA,...
- ...16x eleita uma das melhores empresas para se trabalhar em tecnologia no Brasil (GPTW). Estamos buscando um(a) Senior Full Stack Engineer para atuar alocado(a) em um dos nossos grandes clientes do segmento de tecnologia e informação, com produtos utilizados em larga...
- ...Buscamos um(a) AI Engineer para atuar no coração do Sankhya RH, sendo a ponte de integração entre as soluções da nossa Unidade de Negócios (Pontotel, Mindsight, Vixting, Sankhya LMS, Benefícios e Folha). Aqui, você não vai apenas trabalhar com modelagem tradicional. Você...
- ...a team that wants you to grow and succeed. Join us as an Engineering Manager leading backend and full‑stack teams that power the SAP... ...grounding in observability and SRE practices to improve reliability, performance, and operational excellence. Strong stakeholder...
- We are looking for a remote, full-time AI / ML Engineer with 4+ years of machine learning experience, preferably within MarTech SaaS companies, to join the engineering team of our U.S. client. You will join our client's innovative, growing team on a mission to unleash...
- Na vertical de Platform Engineering da Sankhya, você atua no desenvolvimento e na evolução contínua dos sistemas internos, automações e templates que sustentam nossa Internal Developer Platform (IDP), a plataforma que padroniza, governa e acelera o ciclo...
- ...options data is among the most intricate financial data to store, index, and serve at scale. We take engineering seriously. The Role We're hiring a Technical Support Engineer to be the front line of the customer experience at Theta Data. This is not a generic helpdesk...
- Project Description: We are looking for passionate and creative developers eager to craft innovative products and embrace new technologies. You will be part of a small, agile, and highly skilled team of technologists who are building next generation of solutions for Risk...
- ...components in React, enhancing existing UI, developing and testing REST APIs, and collaborating closely with product, design, and engineering teams. They will participate in Agile ceremonies, troubleshoot issues across the stack, and contribute to overall system...
- We are looking for a Senior Generative AI Engineer to join the Customer Delivery team at CI&T. This person will work embedded within strategic client projects, helping companies across industries such as banking, insurance, and retail successfully adopt and operationalize...
- ...applications for both iOS and Android platforms Build, ship, and support production-grade applications with a focus on performance and reliability Write and maintain automated tests, including unit, integration, and UI testing Integrate third-party services and APIs...
- ...product traction You’ll have direct influence over product direction and UX standards This is one of the highest-priority engineering hires on the team right now. What you’ll be doing: Building rich editing and collaboration experiences (shared editing,...
- ...automation agent - sFamiliarity with healthcare, retail, or location-based service ecosystem s Job Overvi ewWe are seeking an AI Engineer to design, build, and scale intelligent agents supporting a high-volume enterprise platform operating across approximately 9,000 locations...
Deseja receber mais vagas?
Assine e receba vagas semelhantes a Site Reliability Engineer. Seja o primeiro a se candidatar!
