dolby9h ago
New
New
Staff Operational Support Engineer
lead
Customer SupportSupport Engineer
0 views0 saves0 applied
Quick Summary
Overview
## Role Summary Dolby OptiView is building a dedicated Operational Support (L2) team responsible for the stability, availability, and operational excellence of our 24/7 live video streaming, ads,
Technical Tools
Customer SupportSupport Engineer
## Role Summary
Dolby OptiView is building a dedicated Operational Support (L2) team responsible for the stability, availability, and operational excellence of our 24/7 live video streaming, ads, player, and realtime delivery platforms.
As an Operational Support Engineer (L2), you take end to end ownership of customerimpacting production incidents once they are triaged by Level 1 support. You operate directly on production systems, lead live incident resolution, and act as the operational bridge between Support, Engineering, DevOps, and customers, particularly during high impact live events.
This is a hands-on, customer facing role focused on incident ownership, production operations, automation, and operational scalability, not just reactive troubleshooting.
* * *
## Key Responsibilities
### Incident & Operational Support
* Take ownership of escalated customer issues from Level 1 Support and drive them to resolution
* Troubleshoot and resolve complex, high-impact production incidents affecting live streams, VOD playback, ad delivery, DRM, and real-time WebRTC services
* Operate directly on production environments, including configuration changes, CDN adjustments, and corrective actions, following established operational procedures, including executing mitigations and emergency changes during live incidents when customer impact requires immediate action
* Lead or actively contribute to live incident bridges involving customers, internal teams, and partners
* Provide clear, timely communication during incidents, including status updates and customer-facing explanations
* * *
### Infrastructure as Code & Production Operations
* Work fluently with Infrastructure as Code (IaC) to understand, troubleshoot, and safely modify production environments
* Leverage tools and frameworks such as:
* Terraform
* Helm
* Kubernetes manifests
* GitOps workflows
* CI/CD and deployment pipelines
* Use IaC as the primary mechanism for safe, auditable, and repeatable operational changes
* Collaborate with Engineering and DevOps to improve deployment reliability and operational safety
* Validate and execute infrastructure or configuration changes through codified workflows
* * *
### AI-Driven Operations & Automation
* Leverage AI tools and automation to enhance operational efficiency and incident response
* Contribute to and use:
* AI-assisted incident triage and classification
* Automated runbook execution
* AI-based pattern detection across incidents
* Intelligent alert correlation and noise reduction
* Use AI to:
* Generate or improve incident communications
* Accelerate troubleshooting workflows
* Identify recurring patterns and systemic issues
* Drive adoption of automation-first and AI-augmented operational practices
* * *
### Pre-Event Planning & Operational Readiness
* Participate in pre-event readiness planning for critical customer events
* Validate system readiness through:
* Runbook checks
* Monitoring coverage validation
* Risk identification and mitigation planning
* Define and rehearse incident response strategies for high-risk scenarios
* Collaborate with customers and internal teams to ensure smooth event execution
* * *
### On-Call & 24/7 Operations
* Participate in a 24/7 on-call rotation, including nights, weekends, and holidays, as part of a global support model
* Ensure smooth handovers between shifts and regions
* Respond to critical alerts within defined SLAs for stream health, player errors, and delivery infrastructure
* * *
### Root Cause & Continuous Improvement
* Perform or contribute to root cause analysis (RCA) for production incidents
* Document findings, corrective actions, and preventive measures
* Identify recurring issues and work with Engineering and Product teams to eliminate them permanently
* Contribute to and improve runbooks, operational playbooks, and knowledge bases for all OptiView products (Player, ads, live and real time streaming)
* * *
### Collaboration & Engineering Feedback Loop
* Work closely with Engineering teams to escalate defects, validate fixes, and support production deployments
* Provide feedback on system observability, tooling gaps, and operational risks
* Act as the operational voice during post-incident reviews
* * *
## Required Skills & Experience
### Technical Skills
* Strong experience supporting production video streaming platforms, OTT services, live systems
* Solid troubleshooting skills across distributed systems (APIs, microservices, cloud infrastructure)
* Familiarity with HLS, DASH, CMAF, WebRTC, DRM and CDN architectures
* Experience working with monitoring, alerting, and logs to diagnose live incidents (Grafana, Kibana/ELK, Prometheus, Loki)
* Correlate backend streaming metrics, player telemetry, and CDN signals to diagnose live customer issues endtoend
* Comfort performing controlled changes in production environments
* Working knowledge of incident management and on-call operations
* * *
### Operational Mindset
* Proven ability to remain calm, structured, and decisive during high-pressure incidents
* Strong sense of ownership and accountability for customer outcomes
* Excellent written and verbal communication skills, including customer-facing communication during incidents
* * *
## Nice to Have
* Experience with video player SDKs (Web, Android, iOS, React Native, Flutter)
* Knowledge of ad insertion technologies (SSAI, CSAI, SGAI)
* Familiarity with real-time streaming protocols (WebRTC, SRT, RTMP)
* Exposure to incident response frameworks
* Scripting or automation skills (Python, Bash) used in troubleshooting or operational tooling
* Prior experience in a Level 2 or Level 3 support role for mission-critical media systems
Location & Eligibility
Where is the job
—
Location terms not specified
Listing Details
- Posted
- May 27, 2026
- First seen
- May 27, 2026
- Last seen
- May 27, 2026
Posting Health
- Days active
- 0
- Repost count
- 0
- Trust Level
- 49%
- Scored at
- May 27, 2026
Signal breakdown
freshnesssource trustcontent trustemployer trust
External application · ~5 min on dolby's site
Please let dolby know you found this job on Jobera.
3 other jobs at dolby
View all →Explore open roles at dolby.
Similar Support Engineer jobs
View all →Support Engineer II
【Process Support Engineer 】地點: 新竹 | 大量招募中
Specialist Support Engineer: DataOps
Internal Support Engineer
Remote
S
SimberoboticsRemoteRobotics Deployment and Support Engineer - Contract
USD 6000–8000
ContractRemote
Escalation Support Engineer
full-timeRemote
Browse Similar Jobs
Customer Support Specialist2kCustomer Service Representative941Technical Support Engineer600Call Center Agent442Technical Support Specialist152Technical Account Manager143Product Support Specialist101Customer Care Specialist47Client Support Specialist43It Support Specialist34Service Desk Manager32Service Desk Analyst25Operations Support Specialist23Customer Support Manager22Application Support Specialist17Billing Support Specialist17NOC Technician16Support Operations Manager15Desktop Support Technician11Helpdesk Specialist8
Newsletter
Stay ahead of the market
Get the latest job openings, salary trends, and hiring insights delivered to your inbox every week.
A
B
C
D
No spam. Unsubscribe at any time.