San Antonio, Texas, USA
69 days ago
Cloud Systems Administrator II - TS/SCI w/POLY
*

Senior Linux System Administrator to perform functions in dynamic and operational environment supporting cloud-based repositories. A successful candidate for this position has experience working with large Hadoop and Accumulo based clusters and interfacing with hardware, software, security, and network teams. Experience with Bash, Python, and configuration management tools such as Puppet preferred. Responsibilities include monitoring systems and resolving or escalating network and data flow alerts for cloud systems hardware, software or network issues. Experience with Hadoop and Accumulo based clusters sustaining a strong background in troubleshooting operational system issues as they arise required. Performance in a fast-paced mission environment where requirements change on a daily basis due to operational and world events.  

 

Additional responsibilities include the following\: 

Monitor system health Troubleshoot system problems  Maintain storage systems  Interface with external teams for hardware, network, and infrastructure support  Provide after-hours on-call/call-in support  Ensure system security requirements are satisfied  Participate in engineering discussions  Create/maintain system scripts  Patch/upgrade system  Administer user accounts  Maintain hardware inventory 

#dvscyber
#divergent

*

Senior Linux System Administrator to perform functions in dynamic and operational environment supporting cloud-based repositories. A successful candidate for this position has experience working with large Hadoop and Accumulo based clusters and interfacing with hardware, software, security, and network teams. Experience with Bash, Python, and configuration management tools such as Puppet preferred. Responsibilities include monitoring systems and resolving or escalating network and data flow alerts for cloud systems hardware, software or network issues. Experience with Hadoop and Accumulo based clusters sustaining a strong background in troubleshooting operational system issues as they arise required. Performance in a fast-paced mission environment where requirements change on a daily basis due to operational and world events.  

 

Additional responsibilities include the following\: 

Monitor system health Troubleshoot system problems  Maintain storage systems  Interface with external teams for hardware, network, and infrastructure support  Provide after-hours on-call/call-in support  Ensure system security requirements are satisfied  Participate in engineering discussions  Create/maintain system scripts  Patch/upgrade system  Administer user accounts  Maintain hardware inventory 

#dvscyber
#divergent

* At least three (3) year of experience performing system administration and monitoring large  distributed system consisting of\: Multiple clusters; Clustering implemented across at least 3 racks of equipment; Minimum of 60 nodes per site. Experience diagnosing and troubleshooting large scale cloud computing systems including familiarity with distributed systems for storage and retrieval of data e.g. Hadoop, CASSANDRA, SCALITY, SWIFT, Gluster, Lustre, GPFS, Amazon S3, or another other comparable technology for big data management or High performance computing.  Demonstrated ability to work within a pre-defined mission focused team structure, follow SOP’s, communicate effectively, accept constructive feedback, and receive technical guidance and advice from senior level technical resources.  Demonstrated a willingness to learn new technologies and leverage senior level resources to expand current technical foundation using team structure.  Demonstrated ability to work independently on complex tasks, show a willingness to educate and train more junior technical resources.  Demonstrated ability to plan, communicate, lead, and oversee complex technical tasks requiring interaction with multiple groups.  Five (5) years experience writing software scripts using scripting languages including bash, perl, or python.  Seven (7) years experience demonstrating a fundamental understanding and working knowledge of core components of the Linux operating system including the management of user and group accounts in LDAP configuration of DHCP, DNS, and TFTP.  Demonstrated experience with configuration management tools including Puppet and SALT.  Expert understanding of the end to end Linux PXE/Network provisioning process to include familiarity with Anaconda Kickstart configurations, RAID controller utilities, TFTP images, and disk detect scripts.  Experience accessing and troubleshooting systems via remote utilities to perform hardware diagnosis and repair including VNC, serial over LAN interfaces, and IPMI, BIOS-level configuration.  Understanding of overall corporate architecture as well as familiarity with openSSL and Java keystore manipulation.  Expert in troubleshooting commodity hardware platforms including previous experiences with SGI/HP hardware including SGI’s J series. 

Preferred\: 

Advanced knowledge of SSH tunneling and protocols including the implementation of dynamic SOCKS proxies as well as other ssh-based utilities, including rysn, pdsh, pdcp, and WinSCP.  Basic understanding of low level network concepts including vlans, port channel bonding and layer2/layer 3 switch interactions.  Familiarity with software load balancers for large scale webservice implementations including HAProxy and NGINX.  Experience with Kubernetes orchestration services and Docker images.  Experience with log aggregation and search tools including ElasticSearch, logstash, filebeats, Grafana, and rsyslog.  * At least three (3) year of experience performing system administration and monitoring large  distributed system consisting of\: Multiple clusters; Clustering implemented across at least 3 racks of equipment; Minimum of 60 nodes per site. Experience diagnosing and troubleshooting large scale cloud computing systems including familiarity with distributed systems for storage and retrieval of data e.g. Hadoop, CASSANDRA, SCALITY, SWIFT, Gluster, Lustre, GPFS, Amazon S3, or another other comparable technology for big data management or High performance computing.  Demonstrated ability to work within a pre-defined mission focused team structure, follow SOP’s, communicate effectively, accept constructive feedback, and receive technical guidance and advice from senior level technical resources.  Demonstrated a willingness to learn new technologies and leverage senior level resources to expand current technical foundation using team structure.  Demonstrated ability to work independently on complex tasks, show a willingness to educate and train more junior technical resources.  Demonstrated ability to plan, communicate, lead, and oversee complex technical tasks requiring interaction with multiple groups.  Five (5) years experience writing software scripts using scripting languages including bash, perl, or python.  Seven (7) years experience demonstrating a fundamental understanding and working knowledge of core components of the Linux operating system including the management of user and group accounts in LDAP configuration of DHCP, DNS, and TFTP.  Demonstrated experience with configuration management tools including Puppet and SALT.  Expert understanding of the end to end Linux PXE/Network provisioning process to include familiarity with Anaconda Kickstart configurations, RAID controller utilities, TFTP images, and disk detect scripts.  Experience accessing and troubleshooting systems via remote utilities to perform hardware diagnosis and repair including VNC, serial over LAN interfaces, and IPMI, BIOS-level configuration.  Understanding of overall corporate architecture as well as familiarity with openSSL and Java keystore manipulation.  Expert in troubleshooting commodity hardware platforms including previous experiences with SGI/HP hardware including SGI’s J series. 

Preferred\: 

Advanced knowledge of SSH tunneling and protocols including the implementation of dynamic SOCKS proxies as well as other ssh-based utilities, including rysn, pdsh, pdcp, and WinSCP.  Basic understanding of low level network concepts including vlans, port channel bonding and layer2/layer 3 switch interactions.  Familiarity with software load balancers for large scale webservice implementations including HAProxy and NGINX.  Experience with Kubernetes orchestration services and Docker images.  Experience with log aggregation and search tools including ElasticSearch, logstash, filebeats, Grafana, and rsyslog.  All qualified applicants will receive consideration for employment without regard to race, color, religion, sex, sexual orientation, gender identity, national origin, disability or protected veteran status.
Confirm your E-mail: Send Email