Cision14d ago
Staff Site Reliability & DevOps Engineer - Observability
OtherDevOps & InfrastructureSite Reliability EngineerSite ReliabilityInfrastructure & Cloud
0 views0 saves0 applied
Quick Summary
Key Responsibilities
• Design, build, and operate observability platforms based on Grafana and Prometheus • Define and maintain metrics standards, dashboards, alerts, and SLOs • Improve signal quality: reduce alert noise,
Technical Tools
OtherDevOps & InfrastructureSite Reliability EngineerSite ReliabilityInfrastructure & Cloud
At Cision, we believe in empowering every individual to make an impact. Here, your voice is heard, your ideas are valued, and your unique perspective fuels our collective success. As part of our global team, you'll thrive in an environment that champions curiosity, collaboration, and innovation, all while making meaningful contributions to the brands we accelerate.
Join us in shaping the future of communication and building authentic connections that matter. Whether you're solving complex problems or driving bold innovations, your growth is our success, and together, we’ll create the conversations of tomorrow.
Empower your impact at Cision. Be seen, be understood, be you.
This role focuses on designing, operating, and evolving observability platforms with a strong emphasis on metrics, logging, and alerting. The primary tooling is Grafana and Prometheus, with responsibility for ensuring production systems are observable, reliable, and operable at scale. The role works closely with platform, infrastructure, and application teams.
Key responsibilities:
• Design, build, and operate observability platforms based on Grafana and Prometheus
• Define and maintain metrics standards, dashboards, alerts, and SLOs
• Improve signal quality: reduce alert noise, tune thresholds, and improve runbooks
• Support incident response by providing actionable telemetry and post-incident analysis
• Integrate metrics, logs, and traces across distributed systems
• Work with engineering teams to instrument services correctly
• Automate observability configuration using infrastructure as code
• Contribute to reliability improvements through capacity planning and performance analysis
• Required skills and experience
• Strong experience with Prometheus (scraping, federation, recording rules, alerting)
• Strong experience with Grafana (dashboards, alerting, templating, RBAC)
• Solid Linux and networking fundamentals
• Experience running observability stacks in Kubernetes environments
• Infrastructure as code experience (Terraform preferred)
• Familiarity with incident management and on-call practices
• Ability to debug production systems using metrics and logs
• Design, build, and operate observability platforms based on Grafana and Prometheus
• Define and maintain metrics standards, dashboards, alerts, and SLOs
• Improve signal quality: reduce alert noise, tune thresholds, and improve runbooks
• Support incident response by providing actionable telemetry and post-incident analysis
• Integrate metrics, logs, and traces across distributed systems
• Work with engineering teams to instrument services correctly
• Automate observability configuration using infrastructure as code
• Contribute to reliability improvements through capacity planning and performance analysis
• Required skills and experience
• Strong experience with Prometheus (scraping, federation, recording rules, alerting)
• Strong experience with Grafana (dashboards, alerting, templating, RBAC)
• Solid Linux and networking fundamentals
• Experience running observability stacks in Kubernetes environments
• Infrastructure as code experience (Terraform preferred)
• Familiarity with incident management and on-call practices
• Ability to debug production systems using metrics and logs
Nice to have:
• Experience with logs and traces (e.g. Loki, Tempo, OpenTelemetry)
• Experience operating large-scale or multi-cluster Kubernetes platforms
• Experience with cloud platforms (GCP, AWS, OCI)
• Exposure to SRE concepts such as error budgets and SLO-driven prioritisation
What success looks like
• Engineers trust dashboards and alerts to reflect system health
• Incidents are detected earlier and diagnosed faster
• Alert fatigue is reduced and on-call quality improves
• Observability is treated as a first-class platform capabilit
As a global leader in PR, marketing and social media management technology and intelligence, Cision helps brands and organizations to identify, connect and engage with customers and stakeholders to drive business results. PR Newswire, a network of over 1.1 billion influencers, in-depth monitoring, analytics and its Brandwatch and Falcon.io social media platforms headline a premier suite of solutions. Cision has offices in 24 countries throughout the Americas, EMEA and APAC. For more information about Cision's award-winning solutions, including its next-gen Cision Communications Cloud®, visit www.cision.com and follow @Cision on Twitter.
Cision is committed to fostering an inclusive environment where all employees can be their authentic selves and perform at their best. We believe diversity, equity, and inclusion is vital to driving our culture, sparking innovation and achieving long-term success. Cision is proud to have joined more than 600 companies in signing the CEO Action for Diversity & Inclusion™ pledge and named a “Top Diversity Employer” for 2021 by DiversityJobs.com.
Cision is proud to be an equal opportunity employer, seeking to create a welcoming and diverse environment. All qualified applicants will receive consideration for employment without regard to race, color, religion, sex, gender identity or expression, sexual orientation, national origin, genetics, disability, age, veteran status, or other protected statuses.
Cision is committed to the full inclusion of all qualified individuals. In keeping with our commitment, Cision will take the steps to assure that people with disabilities are provided reasonable accommodations. Accordingly, if reasonable accommodation is required to fully participate in the job application or interview process, to perform the essential functions of the position, and/or to receive all other benefits and privileges of employment, please contact hr.support@cision.com
Please review our Global Candidate Data Privacy Statement to learn about Cision’s commitment to protecting personal data collected during the hiring process.
Location & Eligibility
Where is the job
Hungary
Remote within one country
Who can apply
HU
Listed under
Hungary
Listing Details
- Posted
- April 15, 2026
- First seen
- April 15, 2026
- Last seen
- April 29, 2026
Posting Health
- Days active
- 14
- Repost count
- 0
- Trust Level
- 44%
- Scored at
- April 29, 2026
Signal breakdown
freshnesssource trustcontent trustemployer trust

Cision
greenhouse
Cision Ltd. is a public relations and earned media software company and services provider, offering solutions for media monitoring, content distribution, and communications analysis.
View company profileExternal application · ~5 min on Cision's site
Please let Cision know you found this job on Jobera.
3 other jobs at Cision
View all →Explore open roles at Cision.
Similar Site Reliability jobs
View all →Staff Site Reliability & DevOps Engineer - Observability
Remote
O
OrigisenergySite Reliability/Data Engineer
H
HeartflowincStaff/Lead Site Reliability Engineer (SRE)
$201k–$251k/yr
Staff Site Reliability Engineer
USD 196033-245041
Staff Site Reliability Engineer
USD 180000-225000
Staff Site Reliability Engineer
USD 180000-225000
Browse Similar Jobs
Manager2.8kFitness & Wellness2.1kData Collector1.9kAssistant Manager1.8kEngineer1.7kDirector1.6kAssociate1.3kConsultant1.2kBehavioral Health1.1kSocial Work & Counseling1kSocial Worker990Assistant964Social782Technician713Analyst685Operations Associate571Coordinator564Psychiatric Mental Health Nurse Practitioner487Development482Staff Engineer480
Newsletter
Stay ahead of the market
Get the latest job openings, salary trends, and hiring insights delivered to your inbox every week.
A
B
C
D
No spam. Unsubscribe at any time.