We are seeking a motivated Site Reliability Engineer (SRE) to join our TechOps team. This role is ideal for a coder, hacker, or engineer who is passionate about solving problems at their root cause in an elegant and sustainable way. Our TechOps team is dedicated to building and supporting the foundational tools that our product teams use to create products our customers love and trust. We value simplicity, reliability, consistency, and speed in our delivery pipeline.
As a Site Reliability Engineer at Daxko, you will thrive if you have a deep love for automation, building scalable systems, embracing new technologies, and sharing knowledge with teammates.
Key Responsibilities
- Apply cloud computing skills (AWS, VMWare) to deploy upgrades and fixes
- Design, develop, and implement software integrations based on user feedback
- Troubleshoot production issues and coordinate with the development team to streamline code deployment
- Implement automation tools and frameworks (CI/CD pipelines)
- Collaborate with team members to improve engineering tools, systems, procedures, and data security
- Work with core components such as load balancers, firewalls, etc.
- Conduct systems tests for security, business continuity, performance, and availability
- Develop and maintain design and troubleshooting documentation
- Monitor system activity 24/7 as part of an on-call rotation
- Execute our disaster recovery plan, ensuring it is up-to-date and thoroughly tested
- Travel required: Less than 5%
- No budget responsibilities
Required Skills/Abilities
- Excellent verbal and written communication skills
- Excellent interpersonal and customer service skills
- Strong analytical and problem-solving skills
- Excellent time management skills with a proven ability to meet deadlines
- Ability to prioritize tasks and delegate them when appropriate
- Ability to function well in a high-paced and sometimes stressful environment
- Strong command of software-automation production systems (Jenkins and Selenium)
Qualifications
- Bachelor’s degree in a technical discipline OR equivalent experience
- Two (2) to three (3) years’ experience in a DevOps/SRE role
- Expertise in software development methodologies
- Experience with DevOps tools like Git and GitLab
- Extensive experience with automation tools such as Terraform, Chef, or Ansible
Preferred Education/Experience
- Four or more (4+) years’ experience in Linux and Windows management background
- Proficient in AWS and/or VMWare technologies
- Understanding of internet technologies (DNS, SNMP, HTTP, TCP/IP, CDNs)
- Experience with monitoring technologies (Logicmonitor, Instana, NewRelic, Rapid7, CloudPassage, etc.)
- Experience working tickets and managing priorities within issue tracking systems (Jira, etc.)
- Experience in multiple scripting languages such as Perl, Python, Java, Bash
- Experience with containers and orchestration (Docker, K8s, Rancher, AKS, EKS)
- Working knowledge of Microsoft SQL, MySQL, and/or Postgres
Additional Information
Daxko is dedicated to pursuing and hiring a diverse workforce. We are committed to diversity in the broadest sense, including thought and perspective, age, ability, nationality, ethnicity, orientation, and gender. The skills, perspectives, ideas, and experiences of all our team members contribute to the vitality and success of our purpose and values.
If you are a driven individual with a passion for automation, scalability, and new technologies, and you thrive in a collaborative environment, we encourage you to apply for the Site Reliability Engineer position at Daxko.