Senior Site Reliability Engineer - Waltham, MA
Dentsply Sirona is the world’s largest manufacturer of professional dental products and technologies, with a 130-year history of innovation and service to the dental industry and patients worldwide. Dentsply Sirona develops, manufactures, and markets a comprehensive solutions offering including dental and oral health products as well as other consumable medical devices under a strong portfolio of world-class brands. Dentsply Sirona’s products provide innovative, high-quality and effective solutions to advance patient care and deliver better and safer dentistry.
Dentsply Sirona's Waltham, MA location is hiring a Sr. Site Reliability Engineer to join a global team that will ensure system reliability and performance. Together, this team will act as 24/7 emergency 2nd/3rd level support for products, restoring services ASAP when downtime occurs. This role is partially remote, providing a mix of working remotely and in the office.
KEY RESPONSIBILITIES
- Gather and analyze metrics from operating systems as well as applications to assist in performance tuning and fault finding.
-
Partner with development and operations teams to improve services through rigorous testing and release procedures; perform root cause analyses and implement solutions.
-
Partner with architecture teams.
-
Improve existing systems through automation and uplifts.
- Participate in system design consulting and platform management.
- Balance feature development speed and reliability with well-defined service-level objectives.
ACCOUNTABILITIES
-
Run the production environment by monitoring availability and taking a holistic view of system health.
- Build software and systems to manage platform infrastructure and applications.
-
Improve reliability and quality of products in our microservice architecture.
-
Measure and optimize system performance, with an eye toward pushing our capabilities forward, getting ahead of customer needs, and innovating for continual improvement.
-
Act as 24/7 emergency 2nd/3rd level support for products; restore services ASAP when downtime occurs.
EDUCATION AND EXPERIENCE
-
Bachelor's or Master's degree in Computer Science or Software Engineering or relevant experience.
-
At least 5 years' experience in a Site Reliability Engineering / Platform Engineering / DevOps role or similar.
-
Excellent troubleshooting skills and proven experience resolving production downtime with immediate and long-term solutions.
- A deep understanding of algorithms, data structures, complexity analysis and software design.
-
Good analytical skills coupled with excellent communication skills; professional English is required, German is a bonus.
- At least Google Associate Cloud Engineer certification, higher certifications are a bonus.
TECHNICAL SKILLS
- Experience with Kubernetes and GCP cloud both as an admin and user.
- Previous software development experience in one of: Golang, C++, or any other modern programming language; Flutter experience is a bonus.
- Extensive knowledge of relational databases, file systems and Linux.
- Familiarity with monitoring tools (e.g. Datadog) and project tracking software (e.g. Jira).
- Proficiency in building / maintaining CI and CD pipelines.
- Experience working with container orchestration platforms such as Kubernetes.
- Good understanding of systems automation and IT Security.
Dentsply Sirona is an Equal Opportunity/ Affirmative Action employer. All qualified applicants will receive consideration for employment without regard to race, color, religion, sex, national origin, age, sexual orientation, disability, or protected Veteran status.
#J-18808-Ljbffr