Site Reliability Engineering (SRE) Architect

Dell

Granger, TX

Full Time

Expires On: 04/01/2026

Site Reliability Engineering (SRE) Architect  

Any additional information you require for this job can be found in the below text Make sure to read thoroughly, then apply.

Join us to do the best work of your career and make a profound social impact as a   Site Reliability Engineering (SRE) Architect   on our Site Reliability Engineering Team in   Austin, Texas.  

We are   seeking   a highly experienced   Site Reliability Engineering (SRE) Architect   to lead the design, evolution, and reliability of our large scale distributed systems. The ideal candidate will   demonstrate   deep   expertise   in   Dynatrace ,   AIOps platforms ,   observability engineering , and   AI driven   automation , including   hands on   development with   AI agents   and modern coding frameworks.  
This is a technical leadership role requiring   architecture level   thinking, strong coding ability, and the ability to drive   enterprise wide   transformation.  

Every Dell Technologies team member brings something unique to the table.   Architecture & Reliability Engineering  
~ Design and architect   highly reliable , scalable, and   self healing   systems across hybrid,   multi cloud , and   on prem   environments

~ Establish reliability patterns, guardrails, and architecture standards including SLIs, SLOs, error budgets, and resiliency patterns

~ Lead root cause prevention strategies, chaos engineering practices, and resilience validation frameworks

Application Performance Monitoring (APM) 

~ Infrastructure monitoring  

~ Real user   monitoring (RUM) 

~ Architect deterministic and   AI driven   alerting, Davis AI configurations, and   service level   dependency mapping

AIOps & Automation  
~ Lead adoption and integration of   AIOps platforms  (Dynatrace Davis AI, ServiceNow AIOps,   Moogsoft , or equivalent)

~ Build intelligent automation pipelines for:  

~ Auto remediation  

~ Drive automation-first operations to reduce toil and improve operational efficiency

Coding & AI Agents  
~ Develop and integrate   AI agents   capable of:  

~ Workflow automation  

~ Write   high quality   code in languages such as   Python, Go, TypeScript, or Java

~ Build internal tools, automation frameworks, and platform APIs

Partner with SRE teams, platform engineering, application engineering, cybersecurity, and infrastructure groups

~ Provide architectural governance,   participate   in design reviews, and influence engineering standards

~ Mentor engineers on reliability, observability, and automation   best practices  


Bachelor’s degree with 12+ years of experience, Master’s or PhD with 8+ years of experience, or an equivalent combination of education and experience

Benefits and Perks of   working   at Dell Technologies  
You can explore the overall benefits experience that awaits you as a Dell Technologies team member — right now at MyWellatDell.If   you’re   looking for an opportunity to grow your career with some of the best minds and most advanced tech in the industry,   we’re   looking for you.  
 
Dell Technologies is a unique family of businesses that   helps   individuals and organizations   transform how they work,   live   and play. xhqgsiq Dell Technologies is committed to the principle of equal employment opportunity for all employees and to providing employees with a work environment free of discrimination and harassment. Read the full Equal Employment Opportunity Policy   here .  
 

Apply Now