Breeze is hiring- join us!
The Site Reliability Engineer will be responsible for the availability and reliability of critical platform services and applications. They will work closely with the Software Development Team in order to devise effective release schedules as well as designing and implementing deployment processes and tools that will best support the continuous improvement of Breeze production environments. The Site Reliability Engineer will also be responsible for service delivery, reliability, scalability, monitoring, and helping define all this as immutable infrastructure-as-code.
Here’s what you’ll do
- Manage core production systems, including frequent changes and updates
- Rapidly identify and resolve problems in production systems
- Handle complex service faults
- Develop or implement tools to improve our monitoring of infrastructure
- Improve technologies and processes while maintaining a rapid agile delivery cycle
- Deploy and maintain highly available, scalable, critical applications on cloud-native microservices architecture
- Implement automation, effective monitoring, and infrastructure-as-code
- Deploy and maintain CI/CD pipelines across multiple environments
- Support and work alongside cross-functional engineering teams on the latest technologies
- Iterate on best practices to increase the quality & velocity of deployments
- Sustain and improve the process of knowledge sharing throughout the engineering team
- Have on-call responsibilities in rotation with the engineering team
- Achieve performance measures and adhere to established standards in conjunction with Breeze Aviation Group Values of Safety, Kindness, Integrity, Ingenuity and Excellence
Here’s what you need to be successful
Minimum Qualifications
- 4-year degree in Computer Science or equivalent additional practical experience
- 4+ years experience maintaining and deploying highly available, fault-tolerant systems at scale
- A drive towards automating repetitive tasks (e.g. scripting via Bash, Python, Ruby, etc.)
- Practical experience with containerization and clustering
- Experience with AWS (e.g. IAM, EC2, VPC, ELB, ALB, Autoscaling, Lambda, EKS)
- Version control system experience (e.g. Git)
- Experience implementing CI/CD pipelines
- Experience with configuration management
- Familiarity with PaaS
- Experience with infrastructure-as-code
- Experience with troubleshooting and monitoring distributed systems in Dev and Prod environments
- Understanding of system and networking concepts and troubleshooting techniques
- Comfortable with Linux & Network administration including load balancing, routing, firewalls, VPN
- Experience in infrastructure security
- High performance orientation, ability to work well under pressure, prioritize projects, meet deadlines, and maintain flexibility
- Strong attention to detail, organization, and time management skills
- Self-starter must have a positive attitude and strong desire for success
- Complete projects on time with minimal supervision, ability to work varied hours when necessary to meet deadlines
- Ability to read, write, speak, and understand the English language
Preferred Qualifications
- Graduate degree in CS or other relevant field of study
- Experience using EKS, Terraform and Ansible
Skills/Talents
- Exemplifies Breeze’s safety culture, values, and mission
- Excellent oral and written communication skills
- Excellent problem-solving skills
- Ability to work with individuals and teams at all levels in the organization
Perks of the Job
- Health, Vision and Dental
- Health Savings Account with Breeze Employee Match
- 401K with Breeze Employee Match
- PTO
- Travel on Breeze and other Airlines too!
#li-remote