DevOps Engineer II
Exxat
Software Engineering
Vadodara, Gujarat, India
DevOps Engineer II (Job Description)
Exxat is a US-headquartered company building a cloud-based platform that powers Health Sciences education at top universities through an all-in-one suite of tools.
We are looking for a Senior DevOps Engineer to take ownership of our cloud infrastructure, deployment processes, and system reliability. In this role, you’ll lead Kubernetes operations, manage CI/CD pipelines, and drive automation across environments. You'll work closely with engineering and product teams to enable fast, secure, and scalable delivery.
From infrastructure as code using Terraform to implementing observability with Prometheus and Grafana, you’ll shape and scale the systems that power our platforms. You’ll also lead incident response, enforce DevOps best practices, and contribute to continuous improvement in cloud security and cost optimization.
This is a high-impact role in a fast-moving, product-first environment where engineers have deep ownership and direct influence on outcomes.
Key Responsibilities
- Support the design, implementation, and maintenance of cloud infrastructure (Azure preferred) to enable scalable and reliable applications.
- Manage and enhance monitoring and observability systems using tools such as Prometheus, Grafana, Loki, and Azure Monitor.
- Develop and maintain automation scripts and tools using Bash, Python, or PowerShell to improve operational efficiency.
- Build, maintain, and enhance CI/CD pipelines (Azure DevOps, GitLab CI, Jenkins, etc.) for reliable and automated deployments.
- Work with containerized applications using Docker and support Kubernetes (AKS) operations such as deployments, scaling, and troubleshooting.
- Perform production support and troubleshooting, including analyzing logs, metrics, and system behavior to resolve issues.
- Implement and manage Infrastructure as Code (IaC) using Terraform for consistent and repeatable environments.
- Maintain and optimize alerting systems, improving signal-to-noise ratio and reducing alert fatigue.
- Collaborate with development and QA teams to improve application performance, reliability, and deployment workflows.
- Apply cloud security best practices, including access control, secrets management, and secure configurations.
- Contribute to documentation and continuous improvement of DevOps processes and platform reliability.
- Work under the guidance of senior engineers to implement scalable and reliable solutions.
Preferred Qualifications
- Bachelor’s degree in Computer Science, Engineering, or related field (or equivalent experience).
- 3–6 years of experience in DevOps, cloud engineering, or related roles.
- Strong understanding of cloud platforms (Azure preferred), including compute, networking, storage, and identity concepts.
- Hands-on experience with monitoring and logging tools (Prometheus, Grafana, Azure Monitor, Loki, or equivalent).
- Proficiency in scripting (Bash, Python, or PowerShell) for automation and operational tasks.
- Experience building and maintaining CI/CD pipelines and deployment workflows.
- Working knowledge of Docker and Kubernetes (AKS preferred), including deployments, scaling, and debugging.
- Experience with Infrastructure as Code (Terraform or similar).
- Good understanding of Linux systems, networking basics, and troubleshooting techniques.
- Strong problem-solving skills and ability to work independently on operational tasks.
- Effective communication and collaboration skills
Nice to Have Skills
- Exposure to GitOps workflows (e.g., ArgoCD).
- Experience with log aggregation systems (Loki, ELK, etc.).
- Basic understanding of SLOs/SLIs and reliability concepts.
- Experience with cloud cost optimization and resource tuning.
- Exposure to database services (SQL/NoSQL) in cloud environments.
Role Expectations
- Independently handle day-to-day DevOps operations and troubleshooting.
- Contribute to improving automation, monitoring, and reliability across the platform.
- Collaborate with senior engineers and gradually take on higher ownership and complex tasks.
- Continuously learn and apply best practices in cloud, observability, and DevOps workflows.