Key Responsibilities:

  • Troubleshoot and resolve issues in AWS, Linux, and Windows environments, ensuring high availability and performance.
  • Lead incident response efforts, identifying root causes and implementing long- term solutions to prevent recurrence.
  • Implement and manage patch management processes to ensure systems are up to date and secure.
  • Assign tasks and coordinate team activities to ensure timely project delivery.
  • Collaborate with offshore and onsite teams to align on project goals and objectives.
  • Participate in the architectural design and planning of new features and systems.
  • Maintain documentation related to processes, architectures, and workflows.
  • Drive Site Reliability Engineering (SRE) practices, focusing on system performance, reliability, and monitoring.
  • Hands-on experience with AWS services (VPC, EC2, EBS, RDS, ALB, ASG, IAM, S3 etc.).and Linux.
  • Strong conceptual knowledge of DevOps practices, CI/CD pipelines, and Infrastructure as Code (Terraform, CloudFormation etc).
  • Familiarity with monitoring and managing complex centenaried production environment
  • Familiarity with app/infra monitoring and log management tools (CloudWatch, Prometheus, ELK, New relic, Datadog, Splunk , Grafana etc).
  • Familiarity with automations related to routine operational task.

Qualifications:

  • Bachelor’s degree in Computer Science, Engineering, or a related field.
  • 5+ years of experience in a leadership role, managing medium to large-scale projects.
  • Strong troubleshooting experience with AWS, Linux, and Windows systems.
  • Proficient in web services management using Apache and Nginx.
  • Excellent communication and interpersonal skills, with the ability to work effectively with diverse teams.
  • Experience coordinating between offshore and onsite teams is highly desirable.
  • Strong organizational skills and the ability to manage multiple tasks simultaneously.

Preferred Skills:

  • Knowledge of CI/CD practices and tools.
  • Familiarity with AWS Compute services and architecture.
  • Experience with monitoring and logging tools for system reliability.
  • Understanding of SRE principles and practices, including SLAs, SLOs, and error.

Job Overviews

  • Location:

    Banglore

  • Cloud Service Lead

    Developer

  • Work Mode

    On-Site

  • Salary

    Up to 15 LPA

  • Notice

    Immediate - 15 days

Apply Now