DevOps Engineer

Hyderabad, Telangana, India
Mar 27, 2025
Mar 27, 2026
Onsite
Full-Time
6 Years
Job Description

We are looking for a highly skilled and motivated DevOps Engineer to join our team and take ownership of designing, implementing, and managing cloud infrastructure on AWS. This role requires expertise in deploying AI and machine learning models, ensuring security and compliance, monitoring system performance, and developing backup and recovery strategies. The ideal candidate should have a strong technical background in cloud infrastructure, automation, and DevOps best practices, along with a passion for continuous improvement and efficiency.

If you are a proactive problem solver who thrives in a collaborative environment and enjoys optimizing cloud operations, we would love to hear from you!

Experience. 6 Years

Key Responsibilities

Cloud Infrastructure Management

  • Design, build, and maintain scalable and secure cloud infrastructure on AWS.
  • Configure and manage VPCs, subnets, security groups, load balancers, and other networking components.
  • Ensure optimal resource utilization and cost efficiency by monitoring and fine-tuning cloud resources.
  • Manage network configurations, including routing, NAT gateways, and network ACLs, to ensure seamless connectivity.
  • Implement infrastructure as code (IaC) using tools like Terraform, AWS CloudFormation, or Ansible to automate deployments and configurations.

AI and Machine Learning Model Deployment

  • Work closely with data scientists and engineers to deploy and manage AI and ML models using Amazon SageMaker.
  • Ensure smooth training, testing, and deployment of models in production environments.
  • Monitor model performance and optimize deployment strategies for scalability and reliability.
  • Troubleshoot issues related to ML model inference, latency, and resource utilization.

Security and Compliance

  • Implement security best practices, including least privilege access, encryption, logging, and IAM policies.
  • Ensure compliance with industry standards and regulatory requirements.
  • Conduct regular security audits and vulnerability assessments to identify and mitigate risks.
  • Establish monitoring and alerting mechanisms for proactive security incident response.

Performance Monitoring and Analysis

  • Continuously monitor and analyze system performance using AWS CloudWatch, AWS X-Ray, and other observability tools.
  • Identify and resolve performance bottlenecks to improve system efficiency.
  • Provide detailed reports with insights and recommendations for optimizing cloud workloads.
  • Implement real-time monitoring dashboards to enhance visibility into infrastructure health.

Backup and Recovery Strategies

  • Design and implement backup and disaster recovery (DR) strategies using AWS Backup, Amazon S3, and automated snapshots.
  • Ensure high availability and data integrity by conducting regular backup and restoration tests.
  • Develop automated workflows to mitigate data loss risks and ensure business continuity.

Automation and Continuous Improvement

  • Automate repetitive tasks using scripting languages (Python, Bash) and configuration management tools (Terraform, Ansible).
  • Build and maintain CI/CD pipelines to streamline development and deployment workflows using Jenkins, GitLab CI, or CircleCI.
  • Optimize infrastructure for scalability, performance, and cost efficiency.
  • Identify opportunities for process automation and DevOps best practices adoption.

Collaboration and Communication

  • Work closely with cross-functional teams, including developers, data scientists, and security engineers, to align cloud infrastructure with business needs.
  • Provide technical guidance and training to team members on DevOps tools and best practices.
  • Translate complex technical information into clear, non-technical language for stakeholders.
  • Document processes, configurations, and best practices to facilitate knowledge sharing.

Required Skills and Experience

  1. AWS Certification. AWS Certified DevOps Engineer - Professional (or equivalent).
  2. Cloud Infrastructure. Expertise in AWS networking, VPCs, subnets, security groups, and load balancers.
  3. AI/ML Deployment. Hands-on experience with Amazon SageMaker for training, testing, and deploying machine learning models.
  4. Security & Compliance. Strong understanding of IAM policies, access control, encryption, and security auditing.
  5. Performance Monitoring. Experience with AWS CloudWatch, AWS X-Ray, and other performance analysis tools.
  6. Backup & Recovery. Knowledge of AWS Backup and Amazon S3 for disaster recovery strategies.
  7. Automation & Infrastructure as Code. Proficiency with Terraform, AWS CloudFormation, and Ansible.
  8. CI/CD Implementation. Experience with CI/CD tools like Jenkins, GitLab CI, or CircleCI.
  9. Programming & Scripting. Strong scripting skills in Python, Bash, or similar.
  10. Collaboration & Communication. Excellent ability to work in cross-functional teams and explain technical concepts to non-technical stakeholders.

Why Join Us?

  • Work on cutting-edge cloud technologies and contribute to the deployment of AI and ML models at scale.
  • Be part of a collaborative and innovation-driven team.
  • Enjoy a hybrid work environment that balances office collaboration with remote flexibility.
  • Opportunities for continuous learning, skill enhancement, and career growth.

If you are an enthusiastic DevOps Engineer who is passionate about cloud computing, automation, and AI/ML integration, we invite you to apply and be a part of our dynamic team!

Apply Now!

Related Jobs