As a SRE, you will join our team to help grow our systems into best-in-class for efficiency, stability, observability, velocity, and scale in the e-commerce space, engage with the product and engineering team from Day 1 to design, build and maintain the system / software proactively
Influence the design and architecture of Wayfair system as part of Cloud Enablement journey; collaborate with development teams to design scalable and reliable systems, considering aspects such as fault tolerance, availability and performance
Influence the design and architecture of Wayfair system as part of Cloud Enablement journey; collaborate with development teams to design scalable and reliable systems, considering aspects such as fault tolerance, availability, and performance
Work with both software engineers and fellow SREs to optimize and develop repeatable systems for the two sides to leverage each other. There’s a wide range of opportunities to both guide the broad conversation and dive into the nuance of our code & architecture
Help service owners build realistic SLOs, set SLAs and error budgets, and ensure production services have “reliability” built into their design
What You’ll Need:
3+ years experience working as a SRE Software Engineer in a SRE role, or software development with an understanding of cloud infrastructure
Experience with cloud platforms GCP, AWS, Azure, and containerization technologies (e.g. Docker, Kubernetes)
Experience with server-side software engineering (Java, Go, Perl, Python etc)
Design experience with distributed systems, microservices architecture, and related technologies
Strong understanding of monitoring and alerting, with a focus on performance monitoring and tracing instrumentation & SLI/SLO/SLAs
Experience decoupling monolith services a plus
Knowledge of CI/CD pipelines and version control systems (e.g., Git).
Excellent communication skills across engineers, product managers, and business stakeholders alike