Sr. DevOps Engineer/ Sr. SRE

Mohali , Punjab , India

29 Jan 2025

Job Title - Sr. DevOps Engineer

Experience Level: 5 to 10 years

Work Location - Onsite, Mohali

5 days working (Mon to Fri). Must be flexible to work in a 24 x 7 work environment.

Education Qualifications: Graduate in Computer Science / Information Security or raelated field

 

Job Description:

As a Senior DevOps Engineer in a SaaS company, the Role will be leading and managing a team of Junior DevOps Engineers, overseeing the planning and execution of DevOps strategies, and ensuring the smooth operation of the company’s cloud infrastructure. The responsibilty also includes for designing, building, and maintaining the infrastructure that supports SaaS applications and services, ensuring security, reliability, scalability, and cost-effectiveness.

This role demands strong leadership, technical expertise in cloud platforms (AWS, GCP, Azure, OCI), CI/CD pipelines, container orchestration (Kubernetes, Docker), automation (Terraform, Ansible), and proactive monitoring. The Sr. DevOps Engineer will work closely with Software Developers, DevOps Engineers, and other stakeholders to continuously improve infrastructure performance, optimize deployments, and enforce security best practices.

Job Responsibilities:

Key Responsibilities

1. Team Leadership & Management

  • Lead, mentor, and manage a team of Junior DevOps Engineers, fostering collaboration and technical growth.
  • Define DevOps strategies, roadmaps, and best practices, ensuring seamless execution of tasks using JIRA.
  • Plan and delegate responsibilities for infrastructure setup, automation, monitoring, and security.
  • Conduct performance reviews, one-on-one coaching, and knowledge-sharing sessions to enhance team efficiency.

2. Infrastructure Design & Scalability

  • Architect, deploy, and manage cloud-based infrastructure on Oracle Cloud Infrastructure (OCI), GCP, or AWS with a focus on high availability (HA) and disaster recovery (DR).
  • Optimize SaaS application environments for scalability, auto-scaling, load balancing, caching, and etc..
  • Ensure minimal downtime and optimal performance for increasing workloads by achieving SLA.

3. CI/CD Pipeline & Release Automation

  • Design, maintain, and optimize CI/CD pipelines using Jenkins, Bitbucket Pipelines, and Docker repositories.
  • Automate build, test, and deployment processes, improving release velocity while ensuring rollback mechanisms.

4. Containerization & Kubernetes Orchestration

  • Deploy and manage Docker-based applications and Kubernetes clusters (OCI Kubernetes Engine - OKE).
  • Implement Helm charts, ArgoCD, Horizontal Pod Autoscaling (HPA), Vertical Pod Autoscaling (VPA), and Cluster Autoscaler.
  • Ensure zero-downtime deployments and self-healing mechanisms for SaaS applications.

5. Cost Optimization & Resource Management

  • Monitor OCI, AWS, or GCP cloud usage and optimize costs through rightsizing and resource utilization.
  • Implement FinOps (Cloud Financial Management) best practices to optimize SaaS operational expenses.
  • Utilize cost analytics tools (OCI Cost Explorer, GCP Billing, etc.) to track and reduce cloud spending.

6. Monitoring, Logging & Incident Management

  • Implement proactive monitoring solutions using Prometheus, Grafana, New Relic, Site24x7, etc.
  • Set up centralized logging with ELK stack, Loki, Fluentd for application and infrastructure logs.
  • Participate in incident response, conduct Root Cause Analysis (RCA), and ensure rapid resolution of system outages.

7. Security, Compliance & DevSecOps

  • Integrate security scanning tools (SAST - SonarQube, DAST - OWASP, Burp Suite, Docker Scout, Trivy) into CI/CD pipelines.
  • Enforce IAM, RBAC, secrets management, and access control policies to protect sensitive data.
  • Ensure compliance with ISO 27001, SOC 2, HIPAA, PCI DSS, NIST, GDPR.
  • Perform security audits, vulnerability assessments, and penetration testing to enhance security posture.

8. Database Administration & Optimization

  • Manage and optimize MySQL, MariaDB, Cloud SQL, Oracle DB with replication, failover, sharding, database tunning and caching.
  • Implement backup, disaster recovery, and query optimization strategies for improved performance.

9. Business Continuity & Disaster Recovery (BCDR)

  • Develop and maintain disaster recovery (DR) plans ensuring infrastructure resilience against failures.
  • Conduct failover testing and disaster recovery drills to validate DR plans.
  • Implement multi-region deployments to guarantee business continuity.

 

Qualifications and Skills:-

  1. 5+ years of experience in DevOps, or Cloud Engineering in a SaaS environment.
  2. Expertise in cloud platforms (OCI, AWS, GCP) and cloud cost optimization strategies.
  3. Strong experience with CI/CD automation, Kubernetes, Docker, Helm, and Terraform.
  4. Hands-on knowledge of monitoring, logging, and incident response best practices.
  5. Proficiency in DevSecOps practices, including security scanning, IAM, and compliance frameworks.
  6. Experience with database optimization, performance tuning, and disaster recovery planning.
  7. Excellent leadership, communication, and collaboration skills to work cross-functionally with teams.

Mid-Senior Level

Full Time

Mohali Punjab India


Share Job Opening