Site Reliability Engineer
Quick Summary
At Grasshopper, you will be working in a diverse and dynamic environment with a flat hierarchy. With over 100 employees and 15 nationalities working in an open office,
Grasshopper is a quantitative trading technology provider based in Singapore, and is the holding company of Grasshopper Asset Management. Our state-of-the-art technology, built from the ground up in-house, puts us at the forefront of developments in electronic trading. An unbroken record of consistency and profitability is underpinned by firm values of curiosity, empowerment and flexibility.
About the Role
~1 min readAs a Site Reliability Engineer on the Infrastructure Team, you will play a key role in strengthening the reliability, scalability, and operational efficiency of our platform. You will work closely with cross-functional teams to design, build, and operate robust systems across our Google Cloud and on-premise infrastructure, with a focus on observability, automation, and production stability.
- Design, implement, and maintain robust observability systems, including monitoring, logging, tracing and alerting, to ensure high availability, rapid incident detection, and deep system visibility across all services.
- Architect, develop and maintain scalable solutions on Google Cloud and on-premise infrastructure
- Advancing and supporting our research platform capabilities
- Investigate infrastructure/application issues on a live production system
- Working together with developers to improve our development environment, including CI/CD, built tools, etc.
- Help drive an SRE mindset within the organisation
- 3–5 years of hands-on experience in Platform, SRE, or Infrastructure Engineering.
- Experience working in a trading, research, or compute-intensive environment (e.g., research platforms, backtesting systems, large-scale batch processing, or AI/ML workloads) is preferred.
- Solid engineering fundamentals in Linux, systems, networking, debugging, and distributed systems.
- Strong problem-solving and analytical skills, with a structured approach to troubleshooting and root cause analysis.
- Practical experience operating and supporting Kubernetes-based systems in production.
- Familiarity with GitOps workflows, using tools such as Argo CD and CI/CD platforms (e.g., GitLab CI).
- Good understanding of cloud infrastructure (GCP or AWS), including deployment, scaling, and basic networking concepts.
- Programming experience in Python or Go, with a focus on automation and reliability tooling.
- Strong collaboration and communication skills, with attention to detail.
- Self-motivated, adaptable, and comfortable working in a fast-moving technical environment.
- Curiosity and willingness to learn new systems, tools, and technologies.
- Previous exposure to kubernetes operators.
- Experience with clean and maintainable Terraform for declarative infrastructure management.
- Prior knowledge and experience in on-premises bare metal environments.
- Familiarity with configuration management tools such as Puppet, Chef, or Ansible.
- Experience with Argo-CD and Argo Workflows for workflow automation.
- Working knowledge of monitoring and observability tools such as Prometheus and or OpenTelemetry (OTel) ecosystem.
- Familiarity with RedHat and CentOS-based Linux distributions.
- Prior contributions to open-source projects..
- Experience working with large-scale workflow, batch, or HPC-style compute workloads, including scheduling and execution reliability.
What We Offer
~1 min readAt Grasshopper, you will be working in a diverse and dynamic environment with a flat hierarchy. With over 100 employees and 15 nationalities working in an open office, communication is essential to performance. To keep our edge as the “small giant” of trading technology, we give employees a high level of autonomy and encourage them to get creative, take risks, make mistakes and learn from them. The sprint is on!
Grasshopper is an equal opportunity employer.
Listing Details
- Posted
- February 25, 2026
- First seen
- March 26, 2026
- Last seen
- April 16, 2026
Posting Health
- Days active
- 20
- Repost count
- 0
- Trust Level
- 23%
- Scored at
- April 16, 2026
Signal breakdown
Please let Grasshopperasia know you found this job on Jobera.
3 other jobs at Grasshopperasia
View all →Explore open roles at Grasshopperasia.
Similar Site Reliability Engineer jobs
View all →Stay ahead of the market
Get the latest job openings, salary trends, and hiring insights delivered to your inbox every week.
No spam. Unsubscribe at any time.