Site Reliability Engineer
Quick Summary
Experience with monitoring & Observability stacks such as Grafana and Prometheus; Kubernetes, Cloud and Hashicorp experience is valued; Knowledge or experience with AWS or GCP.
Feedzai is the world’s first RiskOps platform for financial risk management, and the market leader in safeguarding global commerce with today’s most advanced cloud-based risk management platform, powered by machine learning and artificial intelligence. Feedzai is securing the transition to a cashless world while enabling digital trust in every transaction and payment type. The world’s largest banks, processors, and retailers trust Feedzai to protect trillions of dollars and manage risk while improving the customer experience for everyday users, without compromising privacy. Feedzai is a Series D company and has raised $282M to date. With a valuation of $2 billion, our technology protects 1 billion consumers and 90 billion transactions each year.
With Cloud at its core, the Platform Engineering area supports our product development life cycle, from development through testing and deployment to operations and maintenance, enabling a DevOps way of working. Formed by engineers and managed by engineers, at Feedzai, you will find one of the most talented teams out there, from junior to senior engineers.
While building the best value for our customers, you will work with a wide range of technical challenges. Such as building distributed systems that need to operate 24/7 with ultra-low latencies, plus cooperating with other teams towards high performance and reliability.
We are fast-paced and provide a safe, open, and collaborative environment that encourages us to lean in, try new things and discover our potential with continuous learning for everyone.
If you are passionate about distributed systems, performance, reliability on cloud environments and like challenges of low latencies and high throughput systems, this may be the job for you.
You’ll be part of Feedzai Platform Engineering Performance & Reliability team. Much of our software development focuses on optimizing existing systems, building infrastructure and eliminating work through automation. As part of this team you’ll have the opportunity to manage the complex challenges of scale which are unique to Feedzai Fraud detection mission, while working with talented platform engineers in complexity analysis and large-scale system design, developing the automation, tooling and platforms that support Feedzai top notch cloud service.
- Provide recommendations about capacity allocation considering cost, resilience and performance.
- Work together with product teams to support best practices and drive improvements on systems performance and reliability before and after they go live;
- Development with Go, Python or similar languages;
- Automate all aspects of cloud infrastructure and incident response;
- Develop playbooks related to actionable alerts;
- Participate in incident response, root cause investigation and resolution;
- Maintain and develop our infrastructure as code (IaC) to manage and operate end-to-end lifecycle operations (monitoring, alerting, security, cost optimization, configuration, backup, etc.) in production environments;
- Utilize your experience and problem solving skills to help prevent and investigate production issues.
- A bachelor's degree in Computer Science, Information Systems, or the equivalent combination of education, experience, and training;
- Programming skills (Go, Python or similar languages);
- 3+ years of experience in data structures, algorithms, programming, asynchronous & multithreaded designs
- 3+ years of experience with building scalable and distributed cloud services
- 3+ years operating production environments
- 2+ years of experience in cross team collaboration within a supportive role
- Self-driven & motivated, with a strong work ethic and a passion for problem solving;
- Systematic problem-solving approach, coupled with effective verbal and written communication skills.
- Experience being oncall.
Requirements
~1 min read- Experience with monitoring & Observability stacks such as Grafana and Prometheus;
- Kubernetes, Cloud and Hashicorp experience is valued;
- Knowledge or experience with AWS or GCP.
#LI-Remote #LI-LS1
You will be immersed in our brand with training, connections, and one-on-one time with your manager. You may shadow your colleagues virtually or onsite at an office depending on where you work as you are supported through your Feedzai journey. In addition, you will have access to a ton of information to give you history, context, and all the knowledge you can handle about Feedzai and the team. Finally, you will start working on projects and collaborating on work currently being done. We can't wait to have you join the team!
Listing Details
- First seen
- April 3, 2026
- Last seen
- April 26, 2026
Posting Health
- Days active
- 23
- Repost count
- 0
- Trust Level
- 31%
- Scored at
- April 26, 2026
Signal breakdown
Feedzai is a global leader in AI-driven fraud prevention, dedicated to protecting financial institutions and their customers from fraud and financial crime.
View company profilePlease let Feedzai know you found this job on Jobera.
4 other jobs at Feedzai
View all →Explore open roles at Feedzai.
Similar Devops Engineer jobs
View all →Browse Similar Jobs
Stay ahead of the market
Get the latest job openings, salary trends, and hiring insights delivered to your inbox every week.
No spam. Unsubscribe at any time.