HPC Linux Systems Administrator Job Posting coming soon.

Organization

IC-Information Technology

https://lbl.taleo.net/careersection/2/jobdetail.ftl?lang=en&job=83085

 

Berkeley Lab (LBNL) addresses the world’s most urgent scientific challenges by advancing sustainable energy, protecting human health, creating new materials, and revealing the origin and fate of the universe. Founded in 1931, Berkeley Lab’s scientific expertise has been recognized with 13 Nobel prizes. The University of California manages Berkeley Lab for the U.S. Department of Energy’s Office of Science.


Position Summary:

Berkeley Lab’s Information Technology (IT) Division has an immediate opening for a HPC Linux Systems Administrator with the High Performance Computing Services Group. Under the supervision of the Group Lead or senior team members, you will participate in building, integrating and supporting Linux-based resources and working with end-users to meet the computing needs for various scientific disciplines. You may also support large high performance computing cluster systems depending on the individual's experience, aptitude and technical skillset. This position will be filled at the Computer Systems Engineer (CSE) level 2 or 3, dependent on experience.


Specific Responsibilities for CSE 2:

  • Within defined policies, procedures and practices provide Linux and system administration and user support for LBNL scientific research groups.  This includes:

    • System installations, upgrades, OS and security patches, system security hardening and intrusion detection, file system management, system hardware and peripheral management, security configuration, customization of user group working environment, troubleshooting, network monitoring, and crash recovery.

    • Assist users with program compilation, commercial and public domain software installation, and use of Linux tools.

    • Configure, administer, and troubleshoot desktop, server and storage infrastructures as well as racking, installing, and maintaining systems in a datacenter.

    • Plan, organize, prioritize and complete assigned tasks and projects in a timely manner.

    • Communicate task or project status to customers to either set or negotiate expectations.

    • Market HPCS Group services to the scientific community by providing excellent customer service coupled with competent technical support skill.

    • Participate in developing system administration, security, and network policies, documentation, and tools oriented towards efficient systems management.

In addition to the above responsibilities, the CSE 3 will:

  • Provide cluster support to LBNL and UC researchers at local and remote sites. This includes: travel to remote site if necessary, initial installation, integration and the on-going maintenance of Linux High Performance Computing cluster systems.

  • Lead HPCS technical efforts in one or more areas of HPC technologies such as job schedulers, high performance interconnects, parallel filesystems, cybersecurity, cluster management, VM infrastructure, networking, performance tuning, support of scientific applications, or data center planning.

  • Lead group projects to implement and deploy new technologies and services.


Required Qualifications for CSE 2:

  • Bachelor’s degree or equivalent experience and a minimum of 5 years of full time professional Linux system administration experience in a large distributed computing environment or the equivalent combination of education and experience.

    • Experience providing systems and end-user support for multiple scientific or computational research groups.

  • In-depth knowledge and experience with Red Hat Enterprise Linux (including derivatives such as CentOS and Scientific Linux), Debian, Ubuntu and use of large scale system administration tools such as Kickstart, CFEngine, Puppet, or in-house developed systems management tools. Support of common services such as NFS, LDAP, NIS, CIFS, MySQL, Apache.

  • Moderate knowledge of Linux internals, TCP/IP networking, software programming, and cybersecurity concepts. Must demonstrate technical understanding of Linux internals including the boot process, kernel versions, and the differences between major Linux distributions. Experience with building, patching, and modifying Linux RPMs is required. Able to quickly troubleshoot computer and storage hardware problems such as RAID devices, and be familiar with procedures to expedite or coordinate vendor service and bring resolution to outstanding problems.

  • Demonstrate programming proficiency in a procedural language such as C, C++, Java, and/or Fortran; and scripting languages such as Perl, Python. Must have experience with popular compilers (e.g. GCC, Intel), program debugging tools, use of Makefiles, use of software repositories such as GitHub or Subversion.

  • Experience with implementing solutions based on Virtual Machines (VM) technologies such as KVM, VMWare, VirtualBox etc. as well as container technologies such as Docker and Linux LXC.

  • Excellent interpersonal, communications and customer service skills and exhibit tact and good judgement. Must be able to work with multiple end-user groups where each group may have different needs and requirements.

  • Able to plan, organize, prioritize, and complete assigned tasks and projects with general supervision while providing timely updates on work progress to end-users and co-workers.

  • Ability to physically rack and cable computer equipment in a datacenter.

In addition to the above requirements, the CSE 3 will have:

  • Bachelor’s degree or equivalent experience and a minimum of 8 years of full time professional Linux system administration experience in a large distributed computing environment including 2 years experience providing support for Linux HPC clusters used for scientific research or the equivalent combination of education and experience.

  • Advanced knowledge and experience in one or more areas of HPC technologies such as Linux operating systems, job schedulers, high performance interconnects, parallel filesystems, cybersecurity, cluster management, VM infrastructure, networking, performance tuning, support of scientific applications, or data center planning.

  • Ability to plan, organize and successfully implement group projects for deploying new technologies and services.


Additional Desired Qualifications for Both Levels:

  • Experience supporting HPC systems and HPC end-users.

  • Fluency with HPC Linux clustering technologies (Job schedulers such as SLURM, MPI, Infiniband, parallel file system storage such as Lustre or GPFS).

  • Parallel programming experience.

  • Experience implementing cloud technologies such as Amazon AWS or Google Compute for production use.

  • Scientific computation programming experience.

  • Software development experience and related skills are desirable in order to provide user assistance with their applications or user environment.


This position requires collaboration with Lawrence Livermore National Laboratory (LLNL). Therefore the selected incumbent requires U.S. citizenship. The selected candidate will be subject to a Federal background investigation through LLNL.


LLNL security clearance requirements:

Anticipated Clearance Level:  Q (Position will be cleared to this level). Applicants selected will be subject to a Federal background investigation and must meet eligibility requirements for access to classified information or matter. In addition, all L or Q cleared employees are subject to random drug testing. If you hold multiple citizenships (U.S. and another country), you may be required to renounce your non-U.S. citizenship before a DOE L or Q clearance will be processed/granted.


The posting shall remain open until the position is filled, however for full consideration, please apply by close of business on December 5, 2016.


Notes:

  • This is a full time 2 year term appointment with the possibility of extension or conversion to Career appointment  based upon satisfactory job performance, continuing availability of funds and ongoing operational needs.

  • Classification will depend upon the applicant's level of skills, knowledge, and abilities.

  • Full-time, M-F, exempt (monthly paid) from overtime pay.

  • Salary is commensurate with experience.

  • This position is contingent on the successful completion of a background check.

  • Work will be primarily performed at Lawrence Berkeley National Lab, 1 Cyclotron Road, Berkeley, CA.


Equal Employment Opportunity: Berkeley Lab is an Equal Opportunity/Affirmative Action Employer. All qualified applicants will receive consideration for employment without regard to race, color, religion, sex, sexual orientation, gender identity, national origin, disability, age, or protected veteran status. Berkeley Lab is in compliance with the Pay Transparency Nondiscrimination Provision under 41 CFR 60-1.4.  Click here to view the poster and supplement: "Equal Employment Opportunity is the Law."