Job Information
Blue Origin LLC Site Reliability Engineering II in Seattle, Washington
Application close date: Applications will be accepted on an ongoing basis until the requisition is closed. At Blue Origin, we envision millions of people living and working in space for the benefit of Earth. We're working to develop reusable, safe, and low-cost space vehicles and systems within a culture of safety, collaboration, and inclusion. Join our diverse team of problem solvers as we add new chapters to the history of spaceflight! At Blue Origin, we envision millions of people living and working in space for the benefit of Earth. We're working to develop reusable, safe, and low-cost space vehicles and systems within a culture of safety, collaboration, and inclusion. Join our diverse team of problem solvers as we add new chapters to the history of spaceflight! We are a diverse team of collaborators, doers, and problem-solvers who are relentlessly committed to a culture of safety. This position will directly impact the history of space exploration and will require your commitment and detailed attention towards safe and repeatable space flight. Join us in lowering the cost of access to space and enabling Blue Origin's vision of millions of people living and working in space to benefit Earth. We invite you to be a part of our mission as a Site Reliability Engineer. Your focus will be to maintain and enhance our source code management (SCM) infrastructure, debug and develop robust CI/CD pipelines, most importantly support our developers. You will also be expected to participate in on-call rotations to provide support for our critical developer services. The ideal candidate will have a strong background in SCM Tools, such as GitLab/GitHub and CI/CD such as GitLab runner/GitHub Actions, experienced in Golang or Python for automation and backend services, and React programming experience for GUI applications. We are looking for someone to apply their technical expertise, leadership skills, and commitment to quality to positively impact safe human spaceflight. Passion for our mission and vision is required! What makes our Site Reliability Engineering successful? A strong bias for automating everything. Technical breadth and depth with a strong understanding of emerging trends. A natural curiosity. You will be tasked with problems that nobody knows how to solve and you will need to figure out the solution with minimal direction / guidance. Humility and a willingness to operate in unfamiliar domains. A strong "customer first" personality and desire to be a subject matter expert. Our tech stack at glance: Amazon Web Services GitLab/Github Nexus/Artifactory Kubernetes and Docker Linux Terraform, Cloud Formation and Ansible Go, React and Python Responsibilities: Configure, deploy, scale, and administer open source and commercial software. Administer and scale our SCM and Artifacts repositories, ensuring best practices in branching, tagging, and versioning are followed. Design, implement, and maintain CI/CD pipelines, optimizing build and deployment processes to increase developer productivity. Develop and maintain scripts and automation tools using Golang or Python to streamline development operations. Monitor system performance, proactively identifying and resolving bottlenecks in collaboration with the development teams. Manage artifact repositories and ensure the secure storage and retrieval of build artifacts. Engage in on-call rotation duties to troubleshoot, diagnose, and resolve urgent issues affecting the developer platforms, minimizing downtime. Continuously evaluate and recommend improvements to our source code management and automation practices. Document systems, processes, and procedures to enhance the knowledge base and foster a learning culture. Collaborate closely with software engineering teams to align SRE