Kubernetes Engineer

Other Jobs To Apply

About the position Join us and make YOUR mark on the World! Lawrence Livermore National Laboratory (LLNL) has turned bold ideas into world-changing impact advancing science and technology to strengthen U.S. security and promote global stability. Our mission spans four critical national security areas nuclear deterrence, threat preparedness, energy security, and multi-domain defense empowering teams to take on the toughest challenges of today and tomorrow. With a culture built on innovation and operational excellence, LLNL is a place where your expertise can make a real impact. Do you have a strong passion for containers and Cloud Native technologies. Do you understand the value containers and Kubernetes brings to managing applications deployments? As a Kubernetes Engineer, you will be responsible for one of our Computing Platform’s most critical components, our Kubernetes cluster. You will help streamline the deployment, maintenance, monitoring and expand its capabilities to meet LLNL’s intensive scientific workloads. If you are ready to drive meaningful change and make a real impact from the ground up, this opportunity is for you. We have an opening for an experienced Kubernetes Engineer to develop and support a 24x7 large-scale bare-metal Kubernetes environments. As a member of the team, you’ll bridge the gap between hardware and high-performance software, ensuring our scientific teams have the reliable, secure platforms needed to build world-class products. This is a high-impact mission where your expertise in on-prem automation and cluster orchestration directly accelerates our product innovation. This position is within the Global Security Computing Applications Division (GS-CAD) of the Computing Directorate, matrixed to the Global Security Directorate. This position requires part-time on-site presence due to the nature of the work. This position will be filled at either level based on knowledge and related experience as assessed by the hiring team. Additional job responsibilities (outlined below) will be assigned if hired at the higher level. Responsibilities • Contribute to the design and deployment of large-scale bare-metal clusters, integrating control planes with VAST Storage arrays to deliver high-performance persistent storage. • Contribute to the implementation of advanced cluster networking to ensure seamless, low-latency communication across multi-rack and multi-site topologies. • Participate in building and maintaining automated, self-healing workflows using CI/CD pipelines to manage cluster lifecycles, ensuring zero-touch deployments and consistent platform health. • Support rigorous SLIs/SLOs by engineering robust observability stacks (Prometheus, Grafana) and enforcing airtight security through RBAC, OIDC, and network isolation. • Partner with internal business units to onboard moderately complex workloads, while elevating the technical bar for the team through design reviews and mentorship. • Collaborate with team members on integrating AI agents to assist with troubleshooting and automating cluster management operations. • Perform other duties as assigned. • Collaborate with researchers on developing repeatable software stacks used for research and automate deployment and configuration across multiple environments. • Partner with security teams to incorporate security checks and audits for Kubernetes deployments into our Gitlab CI/CD system. • Extend Kubernetes to help simplify usage, operations and user applications. Requirements • Ability to secure and maintain a U.S. DOE Q-level security clearance, which requires U.S. citizenship • Bachelor’s degree in computer science, software engineering, or a related technical discipline, or an equivalent combination of education and relevant experience. • Comprehensive knowledge of and experience distributing standard configurations for clusters leveraging configuration management tools such as Ansible or Puppet. • Broad experience performing Kubernetes administration in a moderately complex to complex environment, including tasks such as installation, networking, security, troubleshooting, and monitoring. • Broad experience with software development or system administration using scripting languages (e.g. Python, Bash, Perl, Ruby, Groovy, etc.). • Strong understanding of core Kubernetes concepts such as: Pods, Deployments, Services and/or PVCs. • Proficient interpersonal skills necessary to interact with all levels of personnel and ability to work independently, under limited direction, in a multi-disciplinary team environment. • Proficient verbal and written communication skills necessary to effectively collaborate in a team environment and present and explain technical information. • Ability to set priorities and independently resolve moderately complex problems in a fast-paced environment. • Advanced knowledge of and experience with Kubernetes internals, such as networking, kubelet function/responsibility, etcd and/or control plan archite

Back to blog

Common Interview Questions And Answers

1. HOW DO YOU PLAN YOUR DAY?

This is what this question poses: When do you focus and start working seriously? What are the hours you work optimally? Are you a night owl? A morning bird? Remote teams can be made up of people working on different shifts and around the world, so you won't necessarily be stuck in the 9-5 schedule if it's not for you...

2. HOW DO YOU USE THE DIFFERENT COMMUNICATION TOOLS IN DIFFERENT SITUATIONS?

When you're working on a remote team, there's no way to chat in the hallway between meetings or catch up on the latest project during an office carpool. Therefore, virtual communication will be absolutely essential to get your work done...

3. WHAT IS "WORKING REMOTE" REALLY FOR YOU?

Many people want to work remotely because of the flexibility it allows. You can work anywhere and at any time of the day...

4. WHAT DO YOU NEED IN YOUR PHYSICAL WORKSPACE TO SUCCEED IN YOUR WORK?

With this question, companies are looking to see what equipment they may need to provide you with and to verify how aware you are of what remote working could mean for you physically and logistically...

5. HOW DO YOU PROCESS INFORMATION?

Several years ago, I was working in a team to plan a big event. My supervisor made us all work as a team before the big day. One of our activities has been to find out how each of us processes information...

6. HOW DO YOU MANAGE THE CALENDAR AND THE PROGRAM? WHICH APPLICATIONS / SYSTEM DO YOU USE?

Or you may receive even more specific questions, such as: What's on your calendar? Do you plan blocks of time to do certain types of work? Do you have an open calendar that everyone can see?...

7. HOW DO YOU ORGANIZE FILES, LINKS, AND TABS ON YOUR COMPUTER?

Just like your schedule, how you track files and other information is very important. After all, everything is digital!...

8. HOW TO PRIORITIZE WORK?

The day I watched Marie Forleo's film separating the important from the urgent, my life changed. Not all remote jobs start fast, but most of them are...

9. HOW DO YOU PREPARE FOR A MEETING AND PREPARE A MEETING? WHAT DO YOU SEE HAPPENING DURING THE MEETING?

Just as communication is essential when working remotely, so is organization. Because you won't have those opportunities in the elevator or a casual conversation in the lunchroom, you should take advantage of the little time you have in a video or phone conference...

10. HOW DO YOU USE TECHNOLOGY ON A DAILY BASIS, IN YOUR WORK AND FOR YOUR PLEASURE?

This is a great question because it shows your comfort level with technology, which is very important for a remote worker because you will be working with technology over time...