VMware Staff II Site Reliability Engineer (Tech Lead), Tanzu Observability - Opportunity for Working Remotely in Wichita, Kansas
100% Remote Opportunity- can work anywhere in the U.S!
The Elevator Pitch: Why will you enjoy this new opportunity?
The Tanzu Observability by Wavefront Site Reliability Engineering team is growing with a laser focus on ensuring the Tanzu Observability SaaS offering operates reliably and at scale. What differentiates our product in the current observability landscape is our scalable and extremely powerful data platform and UI, and you will play a key role in taking our platform to the next level.
This role offers many opportunities for applying your creativity and skills to a cutting-edge cloud-native observability platform. We are a growing team which delivers a SaaS product that is used 24/7 by development and site-reliability teams at leading enterprises such as Reddit, Snowflake, Intuit, Box, Workday, and many more!
We are looking for someone who will play a lead role in building and operating the world's best real-time data collection and visualization system. You will be responsible for the reliability of the platform, continuously sharing your learnings and best practices with others. You’ll make critical automation and platform architecture decisions, guiding others as they navigate cloud platforms, linux systems, and infrastructure as code, and work closely with developers to ensure reliability is engineered within the product.
The fully-remote SRE team you’ll join is responsible for thousands of cloud instances in multiple regions at scale and our footprint continues to grow! You should be experienced and enjoy working remotely within a fully remote and distributed team.
What is the primary need, technical challenge, and/or problem you will be responsible for?
The Tanzu Observability SRE team is going through a significant transition from a purely operational team to a reliability engineering mindset. This requires the guidance of a strong technical leader who has achieved system scale, who is passionate about automation, infrastructure as a code, configuration as a code, implementing and improving SLIs, and is disciplined at eliminating alert fatigue and toil.
You will partner closely with software engineers, product managers, the release team, and SaaS Value Engineers as you design and implement the necessary automation and infrastructure services that allow us to scale our footprint.
Success in the Role: What are the performance goals over the f irst 6-12 months you will work toward completing?
The Tanzu Observability platform achieves an SLA of 99.95%,
SLIs and SLOs are defined and measured for critical services
A platform that can scale to meet the needs of the business
SREs spend >50% of their time on enduring engineering work
A technical roadmap guiding the teams’ initiatives for the next year.
Additionally, you will participate in the on-call rotation with a commitment to blameless retrospectives and tracking of action items within a fully remote and geographically distributed team.
What type of work will you be doing? What assignments, requirements, or skills will you be performing on a regular basis?
Participate as a key member of the TObs SRE Leadership team.
Act as a technical leader on the team through mentoring others, working collaboratively with the Tanzu Observability product engineering team, and demonstrating strong scoping and project execution skills.
Passionate about learning new technologies and adopting the right tools to manage these services in production, keeping SLOs and MTTR in mind at all times.
Participate in the learning culture at Tanzu Observability and attend and maybe even give tech talks.
Build a deep understanding of the Tanzu Observability architecture, discover failure points, and work with other teams to design solutions to prevent future issues.
Drive reliability improvements within the product by providing feedback to the product management and design teams, influenced by a commitment to using the Tanzu Observability service for monitoring production environments (act as customer zero).
Identify, scope and build tools to reduce the regular operational load on SREs and SWEs.
What is leadership like for this role? What is the structure and culture of the team like?
- The hiring manager for this role is Elisa Binette, Director of Site Reliability Engineering overseeing the Tanzu Observability SaaS SRE group, a critical component of the SaaS Foundations program for Tanzu products. Elisa joined VMware to add her Reliability industry expertise to this fast growing group. Prior to this role, Elisa spent almost two decades leading engineering teams across a broad range of industries and technologies. #tanzu-re
What are the benefits and perks of working at VMware?
You and your loved ones will be supported with a competitive and comprehensive benefits package. Below are some highlights, or you can view the complete benefits package by visiting www.benefits.vmware.com .
Employee Stock Purchase Plan
Medical Coverage, Retirement, and Parental Leave Plans for All Family Types
Generous Time Off Programs
40 hours of paid time to volunteer in your community
Rethink's Neurodiversity program to support parents raising children with learning or behavior challenges, or developmental disabilities
Financial contributions to your ongoing development (conference participation, training, course work, etc.)
Healthy and local inspired snacks in all our pantries when visiting an office
This job may require the candidate to travel and/or work from a facility that requires full vaccination prior to entry.
Category : Engineering and Technology
Subcategory: Site Reliability
Experience: Business Leadership
Full Time/ Part Time: Full Time
Posted Date: 2022-04-29
VMware Company Overview: At VMware, we believe that software has the power to unlock new opportunities for people and our planet. We look beyond the barriers of compromise to engineer new ways to make technologies work together seamlessly. Our cloud, mobility, and security software form a flexible, consistent digital foundation for securely delivering the apps, services and experiences that are transforming business innovation around the globe. At the core of what we do are our people who deeply value execution, passion, integrity, customers, and community. Shape what’s possible today at http://careers.vmware.com.
Equal Employment Opportunity Statement: VMware is an Equal Opportunity Employer and Prohibits Discrimination and Harassment of Any Kind: VMware is committed to the principle of equal employment opportunity for all employees and to providing employees with a work environment free of discrimination and harassment. All employment decisions at VMware are based on business needs, job requirements and individual qualifications, without regard to race, color, religion or belief, national, social or ethnic origin, sex (including pregnancy), age, physical, mental or sensory disability, HIV Status, sexual orientation, gender identity and/or expression, marital, civil union or domestic partnership status, past or present military service, family medical history or genetic information, family or parental status, or any other status protected by the laws or regulations in the locations where we operate. VMware will not tolerate discrimination or harassment based on any of these characteristics. VMware encourages applicants of all ages. Vmware will provide reasonable accommodation to employees who have protected disabilities consistent with local law.