Search

Use your LinkedIn profile to find the right job match for you.

Job Match
Skip to main content

Senior Manager - Site Reliability Engineering, Cloud Operations

Palo Alto, California

Apply now
Job ID R1903301-2 Date posted Mar. 15, 2019

As a Senior Manager in CPE Cloud Operations, you will be working with and managing a team of seasoned systems, network, security, database, automation, and storage specialist responsible for complex operational issues and tasks found in our global cloud environment. This groups of people is challenged with solving operational issue using code and are adopting a strong DevOps mindset.

This team builds and operates one of (if not) the largest VMware private clouds in the world and is responsible for full end to end lifecycle of the service. With scale and high demand it is essential that the SRE team manages infrastructure efficiently leveraging practices for configuration management, Infrastructure as Code, efficient auto-remediation, etc

Success in this role requires strong experience managing engineering focused teams, an aptitude for distributed systems and attention to minute details. You need to have well developed systems and code-level troubleshooting abilities. You are expected to analyze complex system behaviors or performance problems, and be able to trace issues across multiple systems.


Responsibilities:

  • Directly Manage a team of senior level specialized
  • Manage initiative across a globally distributed workforce
  • Operate in a dynamic cloud services environment
  • Develop automation, mature processes, and design tools to improve cloud lifecycle
  • Participate in troubleshooting, capacity analysis and planning, and performance analysis
  • Work with internal engineering, product management, and other strategic teams to create and articulate VMware’s vision for the software defined data center, including global cloud infrastructure architecture, network, systems and storage, virtualization design, cloud operating models and tools frameworks, and other supporting technologies and processes


Basic Qualifications:

  • Minimum of 7 years of experience managing and developing a highly technical team within large (>$1 billion) companies
  • Familiarity with cloud-based computing services like AWS, Rackspace Cloud, Azure, etc.
  • Hands-on operational experience in a critical production service environment
  • Multiple years of experience with the following technologies: Systems Administration (Linux/Windows), Networking (LAN, WAN), Storage, and Virtualization
  • Proven technical troubleshooting and performance tuning experience
  • Ability to attract, motivate, and retain top talent
  • Excellent verbal and written communication skills
  • Excellent teamwork and leadership skills

Preferred Qualifications:

  • Thorough understanding of cloud service delivery infrastructure ecosystem, operational processes, and orchestration models
  • Experience in creating and governing reference architecture processes and artifact portfolios that clearly link business and technical requirements to the strategic architectures, solution designs and technology strategies being used to satisfy them
  • Ability to articulate strategy at CxO levels
  • Experience with writing scripts and tools to diagnose and address issues (Python, Ruby, Ansible)
  • Experience with integration tools like Jenkins or stackstorm
  • Relevant technical certifications (MCSE, CCNA, VCP, etc.)
Employees at work Explore This Location

Interested InSelect locations and/or categories of interest below and click the plus symbol.

  • Engineering, Palo Alto, California, United StatesRemove
  • Engineering and Technology, Palo Alto, California, United StatesRemove
  • VMware, Palo Alto, California, United StatesRemove

Related Stories