Platform Operations Engineer
About the job
About the Platform Operations Engineer role
The Platform Operations Engineer plays a critical role in maintaining and enhancing government agencies’ mission-critical on-premises and cloud infrastructure platforms. This position combines operational excellence with strategic modernisation initiatives, ensuring robust platform reliability while driving the adoption of industry best operational practices. Working within a collaborative environment, you will support infrastructure transformation while maintaining seamless service delivery for Sentosa IT systems.
Key Responsibilities:
- Infrastructure Management & Operations
- Manage and optimise core infrastructure platforms, including compute, storage, virtualisation, and related systems across development, staging, and production environments.
- Ensure consistent platform performance through proactive monitoring, capacity planning, and lifecycle management of virtualisation platforms and GCC 2.0 Azure environments.
- Operational Excellence & Automation
- Implement and uphold platform standards while driving infrastructure automation initiatives to improve efficiency and reliability.
- Champion modern operational practices, including configuration management, to streamline processes and reduce manual effort.
- Security & Compliance
- Execute monthly server patching and update cycles across GCC 2.0 Azure and on-premises environments to maintain a strong security posture.
- Implement and manage security controls such as access frameworks, system hardening, and continuous compliance monitoring.
- Incident Management & Support
- Deliver expert L1/L2 technical support for platform-related incidents, ensuring timely problem diagnosis and resolution.
- Collaborate with application and operations teams to maintain platform stability, optimise performance, and ensure scalability within defined SLAs.
- Modernisation & Innovation
- Support containerisation efforts and manage hybrid cloud solutions for both modern and legacy workloads.
- Contribute to infrastructure modernisation and innovation projects aligned with enterprise architecture and government technology strategies.
- Business Continuity & Documentation
- Maintain comprehensive backup, disaster recovery, and high-availability solutions for critical infrastructure components.
- Develop and update platform documentation, operational runbooks, and SOPs to ensure continuity, standardisation, and effective knowledge transfer.
Requirements:
- Bachelor’s degree in Computer Science, Information Technology, Engineering, or related technical discipline
- Minimum 3-5 years of experience in infrastructure operations, platform engineering, or related technical roles
- Proven track record in supporting large-scale infrastructure modernization initiatives within enterprise environments
- Cloud Platforms: Demonstrated experience with GCC 2.0 Azure services, hybrid cloud architectures, and cloud-native technologies
- Operating Systems: Advanced proficiency in Linux and Windows Server administration, including performance tuning and troubleshooting
- Containerisation: Practical knowledge of container technologies including Docker, Kubernetes, and container orchestration platforms
- Infrastructure as Code: Experience with automation tools and IaC practices using technologies such as Terraform, Ansible, or similar platforms
- Virtualisation: Understanding of Nutanix hyperconverged infrastructure concepts and virtualisation best practices
- Networking: Solid grasp of networking concepts, protocols, and technologies including TCP/IP, DNS, load balancing, and network security
- Scripting & Automation: Proficiency in scripting languages including Python, PowerShell, and Bash for automation and operational tasks
- Monitoring & Observability: Experience with monitoring platforms, logging systems, and observability tools for proactive infrastructure management
- Understanding of Government IM8 compliances requirement and/or best industry practices
- Demonstrated experience in maintaining high-availability systems and managing critical infrastructure platforms
Preferred Qualifications:
- Industry certifications in cloud platforms (Azure, AWS), virtualization technologies, or infrastructure management
- Experience with DevOps practices and CI/CD pipeline implementation
- Knowledge of ITIL framework and service management practices
- Previous experience in government or regulated industry environments
- Understanding of cybersecurity frameworks and compliance requirements
Added Advantage:
- Responsible, dependable, and able to commit to standby duties
- Calm under pressure, with good problem-solving and escalation judgment
- Detail-oriented with strong documentation discipline
- Team player with eagerness to learn

