Job Title: Lead Site Reliability Engineer (SRE) 5
Job Code: 18392
Job Location: Greenville, TX
Schedule: 9/80 (Every other Friday off)
Job Description:
The Lead Site Reliability Engineer (SRE) serves as the technical lead in a Linux environment in support of Agile Development Engineering Teams. The Lead Site Reliability Engineer will deploy, configure and support DevSecOps tools related to automation, testing, monitoring, and cybersecurity. You will be part of a team responsible for making other teams more efficient and successful. Candidate must be a self-motivated team player with excellent technical and communication skills.
Essential Functions:
- Support all phases of the software development life cycle, including requirements analysis, design, implementation, integration, and test
- Mentor junior engineers
- Deploy and maintain GitLab instances
- Create and manage CI/CD templates and pipelines
- Deploy and maintain on prem cloud environments
- Integrate with a team of multidiscipline, multileveled Software Engineers supporting development environments
- Work closely with cross functional members of the engineering organization to develop and evaluate interfaces between hardware and software, and operational performance requirements and design of the overall system
- Develop software test procedures, software programs, and related documentation
- Learn and stay up to date on current DevOps trends and tools.
- Support senior staff on documenting operational processes, configurations, and diagrams required for management, Security operations, and the software teams
- Project management tasks to include defining EPICs and breaking down work tasks
- Perform other duties as assigned
Qualifications:
- Ability to obtain DoD Security Clearance, which requires U.S. Citizenship
- Bachelor of Science Degree in Computer Science, Computer Engineering, or Electrical Engineering
- At least 5 years of professional software development experience, including usage of version control systems
- Advanced working knowledge of the Linux Operating System (Red Hat)
- Experience with DevOps tools
- Experience in designing and implementing end-to-end continuous delivery pipelines
- Experience and understanding in SRE principles for highly scalable and reliable systems
- Strong experience with Configuration Management and Infrastructure as a Code
Preferred Additional Skills:
- Strong Python and/or other programming skills, including experience with modern debuggers
- Knowledge in troubleshooting complex distributed systems
- Strong experience utilizing Object Oriented design methodologies
- Experience with any of the following:
- Agile methodologies to include using the Jira tool
- SQL and database architecture
- Static Code Analysis
- Creating and improving documented procedures and/or playbooks
- Scripting in modern languages python, bash
- Certifications RHCSA, RHCE, RHCSS, RHCDS, Security+, CISSP, CISM