Are you adept at running production environments and writing code? Do you understand the operational challenges of managing services at scale and prefer to address these through software engineering rather than manual intervention? Are you enthusiastic about exploring new technologies and charting new paths? If you have strong communication skills, a customer-focused mindset, and a passion for learning and problem-solving, we want to hear from you!
What You’ll Do
- Collaborate with product and engineering teams to enhance operational readiness throughout the lifecycle—from design and deployment to operation and refinement.
- Develop software layers, scripts, deployment frameworks, tracers, monitors, and self-healing/auto-remediation tools to automate processes.
- Build and maintain reusable software modules for cloud and on-premise systems automation.
- Drive business continuity by identifying and implementing opportunities for system resilience and automation.
- Work closely with the software engineering team to integrate accurate monitoring and metrics into applications before production.
- Maintain comprehensive documentation of deployments, processes, and standard operating procedures with the goal of minimizing manual runbooks through automation.
- Troubleshoot and conduct root-cause analysis for complex issues across hardware, software, database, and network stacks.
- Participate in a shared on-call schedule (follow-the-sun model) with SRE & Engineering teams.
- Advocate for and promote lean operations culture through self-service, self-healing, and automation.
- Collaborate with product management to define SLAs, SLOs, and implement SLIs for core capabilities.
- Enhance software observability through effective monitoring, tracing, and logging.
What You Need to Succeed
- Bachelor’s or Master’s degree in Engineering (BE/B.Tech/MCA/M.Tech) or equivalent experience.
- Proficiency in one or more programming languages such as C, C++, Java, Python, or Go.
- Hands-on experience with relational databases like MySQL or PostgreSQL.
- Familiarity with web API calls.
- Experience operating and supporting large-scale internet-hosted applications.
- Expertise in cloud technologies (AWS, Azure, Google Cloud Platform) and related tools (Terraform, configuration management).
- Recent experience with on-premise platforms and tools (VMware, Ansible, Salt, Chef, Puppet).
- Troubleshooting and system engineering experience in Linux production environments.
- Knowledge of networking concepts, protocols, and technologies.
- Experience with CI/CD tools (Jenkins, Spinnaker, GitLab, Azure DevOps).
- Proficiency in designing, deploying, and maintaining monitoring solutions (Splunk, Nagios, Prometheus, New Relic).
- Familiarity with AWS/Azure well-architected frameworks and resiliency/reliability patterns (Circuit Breaker, Bulkhead).
- Strong communication, interpersonal, and teamwork skills.
- Ability to work independently and manage problem statements end-to-end.
- Familiarity with agile software development processes including source code control.
Why Join Us
Adobe is an Equal Employment Opportunity and affirmative action employer. We celebrate diversity and are committed to creating an inclusive environment for all employees. If you need any accommodations to navigate our website or complete the application process due to a disability, please reach out to [email protected] or call (408) 536-3015.
Adobe values open employment practices and adheres to policies that ensure a fair and legal recruitment process.