Platform Reliability Engineer
Quick Summary
Ensure the production reliability of the firm’s Linux-based platform as part of a globally distributed engineering team. Provide rapid emergency response to production infrastructure issues.
WorldQuant develops and deploys systematic financial strategies across a broad range of asset classes and global markets. We seek to produce high-quality predictive signals (alphas) through our proprietary research platform to employ financial strategies focused on market inefficiencies. Our teams work collaboratively to drive the production of alphas and financial strategies – the foundation of a balanced, global investment platform.
WorldQuant is built on a culture that pairs academic sensibility with accountability for results. Employees are encouraged to think openly about problems, balancing intellectualism and practicality. Excellent ideas come from anyone, anywhere. Employees are encouraged to challenge conventional thinking and possess an attitude of continuous improvement.
Our goal is to hire the best and the brightest. We value intellectual horsepower first and foremost, and people who demonstrate an outstanding talent. There is no roadmap to future success, so we need people who can help us build it.
Technologists at WorldQuant research, design, code, test and deploy projects while working collaboratively with researchers. Our environment is relaxed yet intellectually driven. We seek people who think in code and are motivated by being around like-minded people.
Responsibilities
~1 min read- →4+ years of experience in SRE, DevOps, or other infrastructure engineering roles, preferably within the financial industry.
- →Strong understanding of Linux system internals, including kernel operations, memory management, and performance optimization.
- →In-depth knowledge of storage technologies, particularly those used in high-performance computing (GPFS experience is a plus).
- →Broad understanding of IT infrastructure components, such as networking, DNS, NTP/PTP, and NIS.
- →Proficiency in system automation, monitoring, and self-healing (experience with Salt is a plus).
- →Experience with container orchestration and virtualization technologies (e.g., Kubernetes, Nomad, VMware).
- →Familiarity with on-premises and cloud-based HPC infrastructure (operational knowledge of Slurm and GPU is a plus).
- →Understanding of AI technologies and their applications in infrastructure automation and management. Experience with or a strong interest in implementing AI/ML solutions for infrastructure optimization, anomaly detection, or predictive analytics.
- →A passion for technology and automation, with a deep sense of curiosity and ownership.
- →A hands-on approach to problem-solving and a demonstrable enthusiasm for technology.
- →Excellent verbal and written communication skills.
What We Offer
~2 min readListing Details
- Posted
- November 20, 2025
- First seen
- March 26, 2026
- Last seen
- April 21, 2026
Posting Health
- Days active
- 25
- Repost count
- 0
- Trust Level
- 31%
- Scored at
- April 21, 2026
Signal breakdown
Please let Worldquant know you found this job on Jobera.
4 other jobs at Worldquant
View all →Explore open roles at Worldquant.
Similar Reliability Engineer jobs
View all →Browse Similar Jobs
Stay ahead of the market
Get the latest job openings, salary trends, and hiring insights delivered to your inbox every week.
No spam. Unsubscribe at any time.
