Site Reliability Engineer
Insight Global
Job Title: Site Reliability Engineer (SRE) / Infrastructure Operations MID LEVEL Role Overview
Responsible for managing day-to-day infrastructure operations, including monitoring, alerting, and driving stability improvements across the environment. Key Responsibilities
Monitor overall infrastructure health and system performance
Track key performance metrics such as CPU, memory, and disk utilization
Tune alerts to improve signal-to-noise ratio and reduce alert fatigue
Support disaster recovery (DR) rehearsals and readiness activities
Maintain and update runbooks, documentation, and operational reports Required Experience
4–6 years of experience in Site Reliability Engineering (SRE) or infrastructure operations
Hands-on experience with VMware environments
Experience with monitoring tools such as PRTG, Datadog, or similar platforms
Strong incident management experience, including response and resolution processes Core Skills & Competencies
Solid understanding of infrastructure performance metrics (CPU, memory, disk, etc.)
Experience with alert tuning and optimization
Ability to proactively detect and troubleshoot performance issues
Strong incident management and operational response capabilities Screening Signals
Look for candidates who:
Understand CPU Ready thresholds and their impact on performance
Have hands-on experience tuning alerts to reduce noise
Can proactively identify and resolve performance bottlenecks
Demonstrate strong incident management experience in production environments
Vaga publicada há 10 dias atrás
Empregos semelhantes que podem ser interessantes para vocêCom base na vaga Site Reliability Engineer em Ananindeua, PA
- ...Site Reliability Engineer We are looking for a full-time, remote Site Reliability / DevOps Engineer with 3+ years of experience in high-availability cloud environments. This role is heavily focused on AWS infrastructure, CloudFormation, and CI/CD pipeline ownership...
- ...Experiência comprovada como Engenheiro(a) de Confiabilidade de Site (SRE) ou função similar. Profundo conhecimento e experiência... ...MAIS SE TIVER: Certificações AWS (Solutions Architect, DevOps Engineer). Experiência com containers (Docker, Kubernetes)....
- ...SRE, with the remaining experience in a relevant infrastructure/engineering role; ~ Hands-on experience with Microsoft Azure and... ...classes in 12 languages; Working model: remote, hybrid or on-site in Portugal. What makes us different? Our organisational culture...
- ...! Tá afim de fazer parte de tudo isso? Dá uma olhada no que estamos buscando: Estamos em busca de um(a) DBRE (Database Reliability Engineer) para atuar em um ambiente orientado por Data Mesh , com foco em resiliência, automação e governança de dados distribuídos...
- ...can't keep up with the demand and need engineers to bring the product faster to the market... ...end, from the first webhook to long-term reliability Driving performance, cost, and... ...Final round with the team (half day, on-site if possible) By the end, you'll be wanting...
- ...Cloud Windows Infrastructure Engineer Job Purpose As Senior Windows Infrastructure Engineer... ...Service. Manage IIS in depth: sites, application pools, bindings, ARR / URL Rewrite... ...English, written and spoken. Reliable, distraction-free home office with stable...
- ...Transmission Line Engineer - LATAM Our client is seeking a Transmission Line Engineer to join their growing engineering team. This... ...successful project execution. Travel occasionally to customer sites to support project meetings, technical presentations, and project...
$ 3.000
...Mobile Engineer (React + React Native) LATAM | Remote | EST-Aligned | International Independent Contractor Compensation: USD $... ...mobile platforms with a focus on usability, performance, and reliability. Optimize application performance, rendering, routing, and...- ...Sr. Workday Application Engineer – Workday Integrations (Corporate Services) Overview Join a leading retail organization as a Senior... ...Product, Architecture, and business teams to build scalable, reliable integrations that improve operational efficiency and employee experience...
- Job Opportunity: DevOps Engineer, Network Infrastructure 12 months Brazil (Remote) *********************************Good communication... ...Network monitoring Network security systems Our focus is on reliability, scalability, efficiency, and high availability for users worldwide...
- ...AI Web Engineer (Systems & Automation) Contract | Remote We're looking for an AI web engineer who can both ship high-quality websites... ...us build a repeatable, scalable workflow for how we develop sites. This role is hands-on. You'll be building real client websites...
- ...Description: Be part of the Application Engineering Team responsible for adapting and... ...strategy, including both remote and on-site support. Shape our products by being... ...experience as a Product Manager / Methodology Engineer. Knowledge & experience with ADAS/AV...
- Position: Support/Platform Engineer Level: Mid Level Location: Brazil (Remote - can be anywhere in LATAM) We are working with a technology... ..., build, and operate scalable cloud infrastructure, enable reliable deployments, and improve platform resilience. You will work closely...
- ...We are looking for a Senior Cloud Engineer to join an international team and work remotely from Brazil, supporting global cloud infrastructure... ...scaling, and operational processes to improve efficiency and reliability Deploy and manage containerized applications using Docker...
- ...ML/AI Engineer (Databricks) Location: Remote (Brazil) | Type: Contract | Experience: 3... ...models, and AI-powered applications that run reliably in production rather than living in a... ...so the data feeding your models is reliable and well-modeled Bring LLM and retrieval...
- ...tools. We are solving this problem. Role: We are looking for a Senior Python Engineer to join our team, focusing on Python services, API integrations, scalability, and production reliability. Our Team: Our team has around 30 people working remotely across Europe....
- ...About the Role This is a senior full‐stack engineering role focused on building and scaling the... ...real operational workflows into robust, reliable software. As the team grows, you'll have... .... What Success Looks Like A successful engineer in this role: Builds scalable, reliable...
- ...Symfony framework, Composer package management, and Twig templating engine. API Experience: 3+ years experience designing and writing... ...and APIs with an emphasis on performance, scalability, and reliability. Drive Full-Stack Solutions: Work closely with DevOps on...
- We are looking to hire a Senior Payments Engineer to help build a next-generation payment orchestration platform. Our company operates... ...including API architecture, distributed systems, scalability, reliability, and performance optimization Experience with PCI compliance,...
- ...Senior Backend Engineer (PHP/Laravel) Location: Brazil About Sphise Technologies Sphise Technologies is a global outsourcing... ...EHR platform using PHP/Laravel, ensuring high performance, reliability, and scalability. Database Development: Creating database...
- ...collaborative environment. We are looking for a Senior Data Engineer (API). We are looking for a highly skilled Senior Data... ...pipelines and data platforms. Ensure scalability, performance, and reliability using microservices and concurrency best practices. Drive...
- ...foundations, and the rest will follow. The Team At Anterior, engineers share a strong “sense of product” and solve meaningful... ...customers and our core platform engineers to ensure seamless, reliable integrations into complex payer systems of record. About You...
- ...false positives, and ensure all automated outputs are secure, reliable, and free from hallucinations. Integrate automated testing... ...GraphQL, and AWS APIs (specifically S3) Must Haves: Data engineering & data testing: dbt, data lakehouse concepts, Medallion...
- Role Summary The Lead AWS Data Engineer provides technical leadership and hands‑on execution for enterprise data platforms hosted... ....Own architectural decisions related to scalability, reliability, security, and cost optimization of AWS data platforms .Define...
- ...success in a collaborative environment. We are looking for a Data Engineer Senior (Data Backbone). In this role you will: Act as the... ...engineering within the team Design and implement scalable, reliable, and high-performance data pipelines Lead architecture...
- ...Federation schemas that power the federated API gateway Partner with Engineering, Product, and UX to design and build solutions that address... ...design decisions that improve scalability, performance, and reliability Write highly performant code that supports a distributed,...
- ...platform for hospitality. We're looking for a Senior Machine Learning Engineer who is excited about the craft of machine learning—especially... ...and evaluation. Build scalable ML solutions that can be used reliably in production environments. Develop capabilities to support...
- ...strong reputation by delivering premium products to clients worldwide, emphasizing reliability and innovation. Role Description This is a contract, remote role for Rubber Engineers specializing in the tire industry. Key responsibilities include designing,...
- Senior Full-Stack Engineer (.NET / Angular / Azure) | Remote (Brazil) | $R20,000 per month We're working exclusively with a fast-scaling... ...operations globally; think live, real-time environments where reliability isn't a nice-to-have, it's the product. £50m+ transacted in...
- ...to expand ideas through the right tools, contributing to our success in a collaborative environment We are looking for a Support Engineer. In this role, you will: Collaborate closely with cross-functional teams (Regulatory Operations, IT Security, QA, Validation) to meet...
Deseja receber mais vagas?
Assine e receba vagas semelhantes a Site Reliability Engineer. Seja o primeiro a se candidatar!
