Description:When it comes to using cutting-edge machine learning to tackle complex problems, Lockheed Martin is driven by a singular mission focus and desire to continuously innovate! Today’s challenges to global security aren’t just changing – they’re accelerating faster than ever before. Through our dedication to our mission, our AI-enabled systems are changing the way militaries operate and protect their forces, the way first responders fight fires, and how researchers explore the far reaches of space and the ocean’s depths.
The Lockheed Martin Artificial Intelligence Center (LAIC) team is seeking an Infrastructure Operations Manager to support the AI Factory, an enterprise Artificial Intelligence and Machine Learning (AI/ML) platform.
Your Mission:
As the Infrastructure Operations Manager, you will be embedded in the AI Factory team working alongside MLOps platform engineers, software engineers, product managers, and data scientists. You will focus on the delivery of secure AI/ML computing resources, orchestration, automation, and services. Additionally, you will contribute to a broad range of projects across the enterprise to increase machine learning availability and value to Lockheed Martin. Your responsibilities will include the following:
• Leading a team of infrastructure and platform engineers
• Managing stability and reliability for AI platforms and environments deployed on centralized on-prem systems and cloud environments
• Operating and maintaining complex computing environments designed to train, deploy, and operate Artificial Intelligence systems
• Addressing user needs and resolving system reliability and stability concerns
• Closely collaborating with other AI/ML Engineers, Data Scientists, and Data Engineering subject matter experts
• Closely collaborating with Central IT, Cybersecurity, and Engineering teams for on-premises, isolated, and cloud deployments of the platform
• Working with the vendors to advocate for Lockheed Martin’s needs for Machine Learning and Artificial Intelligence platforms
Ideally, you are a highly skilled and dynamic individual with excellent strategic planning, problem-solving, and project management skills. You are self-motivated, resourceful, and adaptable, with a can-do attitude and strong interpersonal skills to effectively collaborate with internal stakeholders in a fast-paced and constantly evolving industry.
What’s In It For You:
From onsite to remote, we offer flexible work schedules to comprehensive benefits investing in your future and security, Learn more about Lockheed Martin’s comprehensive benefits package here.
Lockheed Martin provides the resources and the flexibility to enable inspiration and focus! If you have the passion and courage to dream big, work hard, and have fun doing what you love, then we want to build a better tomorrow with you!
Our Commitment to Diversity and Inclusion:
We Hear You, We See You. At LM Enterprise Operations we invest in people and promoting the sharing of ideas to create incredible solutions. We know that our success depends on the combined efforts of diverse-thinkers like you! At LM Enterprise Operations, we cultivate an inclusive environment that appreciates differences and unique thinking.
Our global commitment to diversity and inclusion reflects our values of doing whats right, respecting others and performing with excellence. Learn more here: Global Diversity and Inclusion.
Further Information About This Opportunity:
This is fully remote position, but requires an active DoD Secret Clearance to start
LMLAIC
Basic Qualifications:
• Leadership experience, especially within IT System Operations
• Experience with diverse IT principles, including Networking, Storage, Computing, Security, and Distributed Services
• Experience with Software Security practices, such as MFA, Least Privilege, Monitoring, Encryption, and Mutual Authentication
• Experience with Kubernetes, including distributions (Openshift, Rancher, GKE)
• Experience with Programming and Scripting, such as Python, Go, Bash
• Strong oral and written communication skills, and ability to collaborate with cross-functional partners
• Active DoD Secret Clearance
Desired Skills:
• Linux experience, including distributions (RHEL (Red Hat Enterprise Linux), Debian, UNIX, etc)
• Experience with Pipeline Automation, such as ArgoCD, Tekton, Gitlab CI/CD
• Knowledge of DevSecOps and Cloud Native software development practices
• Familiarity with Container Storage, including Container Storage Interfaces (CSI) and Persistent Volumes
• Familiarity with Pipeline and GitOps Automation, such as ArgoCD, Tekton, Gitlab CI/CD
• Familiarity with Kubernetes Automation, such as Helm or Kustomize
• Familiarity with Infrastructure Automation, such as Ansible or Terraform
• Experience with Collaboration Tools, such as Slack, Confluence, and Gitlab
• Knowledge of Monitoring and Performance, such as Prometheus, Grafana, and Thanos
• Knowledge of Image Registries, such as Quay or Harbor
• Knowledge of Storage, such as Ceph, NetApp, Object/AWS S3
• Knowledge of Machine Learning Architectures, including GPU Computing, High Performance Computing (HPC)
• Knowledge of AI/ML Orchestration tools, such as Kubeflow or OpenDataHub
Security Clearance Statement: This position requires a government security clearance, you must be a US Citizen for consideration.
Clearance Level: Secret
Other Important Information You Should Know
Expression of Interest: By applying to this job, you are expressing interest in this position and could be considered for other career opportunities where similar skills and requirements have been identified as a match. Should this match be identified you may be contacted for this and future openings.
Ability to Work Remotely: Full-time Remote Telework: The employee selected for this position will work remotely full time at a location other than a Lockheed Martin designated office/job site. Employees may travel to a Lockheed Martin office for periodic meetings.
Work Schedules: Lockheed Martin supports a variety of alternate work schedules that provide additional flexibility to our employees. Schedules range from standard 40 hours over a five day work week while others may be condensed. These condensed schedules provide employees with additional time away from the office and are in addition to our Paid Time off benefits.
Schedule for this Position: 4x10 hour day, 3 days off per week
Pay Rate:
The annual base salary range for this position in most major metropolitan areas in California and New York is $148,200 - $279,300. Please note that the salary information is a general guideline only. Lockheed Martin considers factors such as (but not limited to) scope and responsibilities of the position, candidate’s work experience, education/ training, key skills as well as spanet and business considerations when extending an offer.
Benefits offered: Medical, Dental, Vision, Life Insurance, Short-Term Disability, Long-Term Disability, 401(k) match, Flexible Spending Accounts, EAP, Education Assistance, Parental Leave, Paid time off, and Holidays.
This position is incentive plan eligible.
Pay Rate:
The annual base salary range for this position in California and New York (excluding most major metropolitan areas), Colorado, Hawaii, Maryland, Washington or Washington DC is $128,900 - $247,000. For states not referenced above, the salary range for this position will reflect the candidate’s final work location. Please note that the salary information is a general guideline only. Lockheed Martin considers factors such as (but not limited to) scope and responsibilities of the position, candidates work experience, education/ training, key skills as well as spanet and business considerations when extending an offer.
Benefits offered: Medical, Dental, Vision, Life Insurance, Short-Term Disability, Long-Term Disability, 401(k) match, Flexible Spending Accounts, EAP, Education Assistance, Parental Leave, Paid time off, and Holidays.
(Washington state applicants only) Non-represented full-time employees: accrue at least 10 hours per month of Paid Time Off (PTO) to be used for incidental absences and other reasons; receive at least 90 hours for holidays. Represented full time employees accrue 6.67 hours of Vacation per month; accrue up to 52 hours of sick leave annually; receive at least 96 hours for holidays. PTO, Vacation, sick leave, and holiday hours are prorated based on start date during the calendar year.
This position is incentive plan eligible.
Lockheed Martin is an Equal Opportunity/Affirmative Action Employer. All qualified applicants will receive consideration for employment without regard to race, color, religion, sex, pregnancy, sexual orientation, gender identity, national origin, age, protected veteran status, or disability status.
The application window will close in 90 days; applicants are encouraged to apply within 5 - 30 days of the requisition posting date in order to receive optimal consideration.
Join us at , where your mission is ours. Our customers tackle the hardest missions. Those that demand extraordinary amounts of courage, resilience and precision. They’re dangerous. Critical. Sometimes they even provide an opportunity to change the world and save lives. Those are the missions we care about.
As a leading technology innovation company, ’s vast team works with partners around the world to bring proven performance to our customers’ toughest challenges. has employees based in many states throughout the U.S., and Internationally, with business locations in many nations and territories.
Experience Level: Experienced Professional
Business Unit: ENTERPRISE BUSINESS SERVICES
Relocation Available: No
Career Area: Artificial Intelligence
Type: Full-Time
Shift: First
• Linux experience, including distributions (RHEL (Red Hat Enterprise Linux), Debian, UNIX, etc)
• Experience with Pipeline Automation, such as ArgoCD, Tekton, Gitlab CI/CD
• Knowledge of DevSecOps and Cloud Native software development practices
• Familiarity with Container Storage, including Container Storage Interfaces (CSI) and Persistent Volumes
• Familiarity with Pipeline and GitOps Automation, such as ArgoCD, Tekton, Gitlab CI/CD
• Familiarity with Kubernetes Automation, such as Helm or Kustomize
• Familiarity with Infrastructure Automation, such as Ansible or Terraform
• Experience with Collaboration Tools, such as Slack, Confluence, and Gitlab
• Knowledge of Monitoring and Performance, such as Prometheus, Grafana, and Thanos
• Knowledge of Image Registries, such as Quay or Harbor
• Knowledge of Storage, such as Ceph, NetApp, Object/AWS S3
• Knowledge of Machine Learning Architectures, including GPU Computing, High Performance Computing (HPC)
• Knowledge of AI/ML Orchestration tools, such as Kubeflow or OpenDataHub