DevOps Engineer

Full Time
Remote
Posted
Job description

Summary:

Penguin Computing Managed Services provides dedicated, remote, Linux systems administration for complex, integrated environments involving high-performance computing, cloud, and enterprise systems. This position requires the ability to understand, document, modify, configure, administer, troubleshoot, and resolve issues regarding Penguin’s current configuration management code. Successful candidates must have excellent communication skills, a friendly demeanor, the ability to work with others, and to remain calm, focused, and organized.

Essential Duties and Responsibilities:

  • Support, improve, document and write unit tests for all Ansible configuration management code for a Linux-based, high-performance computing (HPC) and artificial intelligence (AI) environment.
  • Understand, monitor and measure customer requirements and project KPIs
  • Encourage and build automated processes.
  • Implement automation and testing tools to transform IT infrastructure
  • Provide technical review, verification, and validation of the software code developed in the project.
  • Provide general Linux system administration.
  • Plan the team’s activities and involvement in project management activities.
  • Manage stakeholders and external interfaces
  • Define development, test, release, update, and support processes for DevOps operation
  • Troubleshoot and remediate tool issues
  • Support and improve AWS s3 automation workflow
  • Identify and deploy cybersecurity measures by continuously performing vulnerability assessment and risk management
  • Incidence management and root cause analysis
  • Coordination and communication within the team and with customers
  • Strive for continuous improvement and build continuous integration, continuous development, and constant deployment pipeline (CI/CD Pipeline)
  • Provide periodic reports on progress to management and the customer.

Preferred Qualifications:

  • Red Hat Certified Systems Administrator (RHCSA) or better.
  • Experience with bare metal provisioning (PXE and kickstart).
  • Experience with KVM virtualization.
  • Experience with use and configuration of Nagios and Prometheus.
  • AWS Datasync or other s3 tools.
  • Some knowledge of C as relates to Linux kernel/driver development.

Experience/Education:

  • Bachelor’s Degree in Computer Science, Computer/Electrical Engineering, or a related field (or equivalent experience).
  • 5 years working on Linux-based infrastructure.
  • Excellent understanding of (Ansible, Chef, or Puppet), Python, Perl, and Bash scripting
  • Proven professional experience with source control management.
  • Working knowledge of various tools, open-source technologies, and cloud services
  • Configuration and managing databases such as MySQL, Mongo
  • Proven professional experience with CI/CD pipelines (GitLab-CI, GitHub Actions, Jenkins, etc.)
  • Knowledge of Nvidia GPU architecture, drivers and monitoring APIs.
  • Practical knowledge of implementation and administration of High-Performance Computing (HPC) technologies, including cluster resource management, job scheduling, etc.
  • Configuration and managing databases such as MySQL, Mongo
  • Understanding of network technologies, architectures, and protocols.
  • Awareness of critical concepts in DevOps and Agile principles
  • Ability to communicate clearly and effectively

Travel:

Up to 30%

Work Authorization:

Must be eligible to work in the U.S. without restriction or sponsorship.

Penguin Computing is an Affirmative Action/Equal Opportunity Employer and is strongly committed to all policies which will afford equal opportunity employment to all qualified persons without regard to age, national origin, race, ethnicity, creed, gender, disability, veteran status, or any other characteristic protected by law.

Job Type: Full-time

Pay: $125,000.00 - $169,036.70 per year

Benefits:

  • 401(k) matching
  • Dental insurance
  • Health insurance
  • Parental leave
  • Vision insurance

Schedule:

  • Monday to Friday

Supplemental pay types:

  • Bonus pay

Application Question(s):

  • Are you a citizen of the US?

Experience:

  • Linux: 5 years (Required)
  • Ansible: 2 years (Required)

Work Location: Remote

www.arclintfl.com is the go-to platform for job seekers looking for the best job postings from around the web. With a focus on quality, the platform guarantees that all job postings are from reliable sources and are up-to-date. It also offers a variety of tools to help users find the perfect job for them, such as searching by location and filtering by industry. Furthermore, www.arclintfl.com provides helpful resources like resume tips and career advice to give job seekers an edge in their search. With its commitment to quality and user-friendliness, www.arclintfl.com is the ideal place to find your next job.

Intrested in this job?

Related Jobs

All Related Listed jobs