Hotline: 678-408-1354

Lead SRE/DevOps Engineer- Seattle, WA

Job Summary and Mission

If you’ve got what it takes to help transform one of the most iconic brands in the world into a lean, efficient and highly automated platform, we’d like to talk to you.

As a Lead Site Reliability Engineer – Starbucks Technology, you will be responsible for leading the day-to-day maintenance and administration of Internet-based enterprise systems and team initiatives. On an on-going basis, this position will identify root causes of operational issues in order to resolve them and training members of the teams on best practice techniques. As required, this position will help develop tools and scripts and define best practices for the team.

This position will also work closely with other teams to document the enterprise infrastructure and monitoring systems. You will also be responsible for planning and leading the execution of small to large-scale projects within the Starbucks Technology teams under the direction of the manager.

This role requires your A-Game: deep technical proficiency in both enterprise-scale systems as well as next gen cloud native applications required. So if you believe, like we do, that a cup of coffee can change a life and change our world, come check us out and help us deliver that same amazing experience to our customers around the globe.

Models and acts in accordance with Starbucks guiding principles.

Required Knowledge, Skills, and Abilities:

  • Experience working in a high capacity, highly scalable mission-critical web serving environment
  • Proven ability to participate with other functional teams in systems integration and design including writing operational specifications, test plans and requirements management with attention to detail
  • UNIX/LINUX and Windows and server experience, including expertise in system installation, configuration, administration, troubleshooting, performance tuning, preventative maintenance, capacity planning, monitoring, and security procedures
  • Web (IIS, Apache), .Net & Java application (Tomcat, Jboss, etc) server expertise including installation, administration, configuration, troubleshooting, performance tuning, preventative maintenance, capacity planning, monitoring, and security procedures
  • Experience in at least two relevant scripting or programming languages (Ruby, Perl, Python, Shell, PowerShell, etc.)
  • Experience with Configuration Management platforms (Chef, Ansible, CFEngine, Puppet, etc.)
  • Database Administration – setup, configuration and basic database troubleshooting skills
  • Understanding of internet standards such as HTTP, DNS, FTP, SSH, HTML, XML, JDBC, ODBC, SNMP and other protocols
  • Understanding of high availability hardware and database systems design and implementation including cluster management, redundancy and failover testing
  • Knowledge of storage systems (SAN, NAS, RAID Array, etc)
  • Experience hardening and maintaining secure systems (Safe Harbor or PCI experience a plus!)
  • Network hardware architecting experience with load balancing equipment, switches, routers, and network troubleshooting
  • Ability to produce system documentation, including writing requirements, operational specifications, system architecture, test plans and as-built documentation, all with attention to detail
  • Experience working with ITIL and Service Management best practices is a plus.
  • Ability to build strong relationships and influence others across the organization
  • Demonstrated knowledge of agile project methodologies
  • 7+ years experience designing, supporting and deploying Internet-based products or services
  • 5+ years operating complex, large-scale Enterprise guest-facing Applications or web sites
  • 2+ years leading project or functional teams
Email Me Jobs Like These
Share this job

Contact Us

Eltas EnterPrises Inc.
3978 Windgrove Crossing
Suite 200A
Suwanee, Georgia
30024, USA
contact@eltasjobs.com

Subscribe to our Newsletter