Job details
Location: | Hong Kong |
Salary: | Negotiable |
Job Type: | Permanent |
Discipline: | |
Reference: | 83528_1752138758 |
Posted: | about 22 hours ago |
Job description
A global hedge fund is looking for a Site Reliability Engineer (SRE) to join their dynamic technology team. The SRE will play a critical role in ensuring the reliability, availability, and performance of trading platforms and infrastructure. The role will work closely with software engineers, operations, and quantitative researchers to enhance systems and processes.
The job:
- Design, implement, and maintain scalable and reliable infrastructure.
- Monitor system performance and reliability to proactively address issues.
- Develop and maintain tools for automation of deployment and operations.
- Collaborate with development teams to enhance application performance and reliability.
- Implement and manage CI/CD pipelines for efficient software delivery.
- Conduct post-mortem analyses of outages and incidents to improve system resilience.
- Manage cloud infrastructure and services, optimizing for cost and performance.
- Participate in on-call rotations to provide support for production systems.
The candidate:
- Bachelor's degree in Computer Science, Engineering, or a related field.
- 3+ years of experience in a Site Reliability Engineering, DevOps, or systems engineering role.
- Strong experience with Linux/Unix systems and networking.
- Proficiency in scripting languages (e.g., Python, Bash) and configuration management tools.
- Familiarity with cloud platforms (e.g., AWS, Azure, GCP) and container orchestration (e.g., Kubernetes).
- Experience with monitoring and logging tools (e.g., Prometheus, Grafana, ELK stack).
- Knowledge of database systems (SQL and NoSQL) and data storage solutions.
- Excellent problem-solving skills and the ability to work under pressure.
