Platform Operations Engineer

About the job

About the Platform Operations Engineer role

The Platform Operations Engineer plays a critical role in maintaining and enhancing government agencies’ mission-critical on-premises and cloud infrastructure platforms. This position combines operational excellence with strategic modernisation initiatives, ensuring robust platform reliability while driving the adoption of industry best operational practices. Working within a collaborative environment, you will support infrastructure transformation while maintaining seamless service delivery for Sentosa IT systems.

Key Responsibilities:

  • Infrastructure Management & Operations
    • Manage and optimise core infrastructure platforms, including compute, storage, virtualisation, and related systems across development, staging, and production environments.
    • Ensure consistent platform performance through proactive monitoring, capacity planning, and lifecycle management of virtualisation platforms and GCC 2.0 Azure environments.
  • Operational Excellence & Automation
    • Implement and uphold platform standards while driving infrastructure automation initiatives to improve efficiency and reliability.
    • Champion modern operational practices, including configuration management, to streamline processes and reduce manual effort.
  • Security & Compliance
    • Execute monthly server patching and update cycles across GCC 2.0 Azure and on-premises environments to maintain a strong security posture.
    • Implement and manage security controls such as access frameworks, system hardening, and continuous compliance monitoring.
  • Incident Management & Support
    • Deliver expert L1/L2 technical support for platform-related incidents, ensuring timely problem diagnosis and resolution.
    • Collaborate with application and operations teams to maintain platform stability, optimise performance, and ensure scalability within defined SLAs.
  • Modernisation & Innovation
    • Support containerisation efforts and manage hybrid cloud solutions for both modern and legacy workloads.
    • Contribute to infrastructure modernisation and innovation projects aligned with enterprise architecture and government technology strategies.
  • Business Continuity & Documentation
    • Maintain comprehensive backup, disaster recovery, and high-availability solutions for critical infrastructure components.
    • Develop and update platform documentation, operational runbooks, and SOPs to ensure continuity, standardisation, and effective knowledge transfer.

Requirements:

  • Bachelor’s degree in Computer Science, Information Technology, Engineering, or related technical discipline
  • Minimum 3-5 years of experience in infrastructure operations, platform engineering, or related technical roles
  • Proven track record in supporting large-scale infrastructure modernization initiatives within enterprise environments
  • Cloud Platforms: Demonstrated experience with GCC 2.0 Azure services, hybrid cloud architectures, and cloud-native technologies
  • Operating Systems: Advanced proficiency in Linux and Windows Server administration, including performance tuning and troubleshooting
  • Containerisation: Practical knowledge of container technologies including Docker, Kubernetes, and container orchestration platforms
  • Infrastructure as Code: Experience with automation tools and IaC practices using technologies such as Terraform, Ansible, or similar platforms
  • Virtualisation: Understanding of Nutanix hyperconverged infrastructure concepts and virtualisation best practices
  • Networking: Solid grasp of networking concepts, protocols, and technologies including TCP/IP, DNS, load balancing, and network security
  • Scripting & Automation: Proficiency in scripting languages including Python, PowerShell, and Bash for automation and operational tasks
  • Monitoring & Observability: Experience with monitoring platforms, logging systems, and observability tools for proactive infrastructure management
  • Understanding of Government IM8 compliances requirement and/or best industry practices
  • Demonstrated experience in maintaining high-availability systems and managing critical infrastructure platforms

Preferred Qualifications:

  • Industry certifications in cloud platforms (Azure, AWS), virtualization technologies, or infrastructure management
  • Experience with DevOps practices and CI/CD pipeline implementation
  • Knowledge of ITIL framework and service management practices
  • Previous experience in government or regulated industry environments
  • Understanding of cybersecurity frameworks and compliance requirements

Added Advantage:

  • Responsible, dependable, and able to commit to standby duties
  • Calm under pressure, with good problem-solving and escalation judgment
  • Detail-oriented with strong documentation discipline
  • Team player with eagerness to learn