Executive recruitment company Monroe Consulting Group's Technology Division is partnering with a leading software company that specializes in advanced security solutions.
Our distinguished client is currently seeking a highly skilled Senior Platform Engineer to contribute to the development of their cutting-edge software products and collaboration with a team of engineers.
You will be responsible for the R&D/platform management and site reliability of secure and scalable software solutions. You will play a crucial role in working closely with a team of engineers, ensuring best practices in platform engineering and system design.
This position offers an exciting opportunity to work with the latest technologies in cybersecurity in both on premise and cloud computing.
Key Responsibilities
- Design, implement, and maintain highly available Kubernetes clusters for mission-critical applications in both public cloud and on-premise infrastructure.
- Develop and enhance cloud-native platforms that support cybersecurity software products.
- Architect scalable, resilient, and secure containerized environments.
- Conduct performance tuning, resource optimization, and cost analysis across Kubernetes workloads.
- Troubleshoot cluster and container runtime issues; perform root cause analysis and implement long-term fixes.
- Collaborate with software engineers to containerize applications and ensure smooth CI/CD workflows.
- Implement monitoring, logging, and alerting solutions for proactive cluster health management.
- Mentor engineers on Kubernetes, DevOps practices, and cloud-native architecture.
Requirements
- Bachelor's or Master's degree in Computer Science, Software Engineering, or a related field.
- Minimum 5-7 years of professional software development and/or DevOps engineering experience.
- Strong expertise with Kubernetes (production deployments, scaling, upgrades, troubleshooting).
- Experience with container technologies such as Docker and Containerd.
- Familiarity with cloud platforms (AWS, GCP, Azure) and Kubernetes distributions (Rancher, EKS, GKE, AKS, OpenShift).
- Hands-on experience with infrastructure as code (Terraform, Helm).
- Hands-on experience with Rancher on Proxmox for on-premise Kubernetes cluster.
- Knowledge of networking, service meshes (e.g., Istio, Calico), and Kubernetes security best practices.
- Experience with monitoring and logging tools (Prometheus, Grafana, ELK, OpenTelemetry).
- Strong problem-solving skills, with ability to optimize system reliability and performance.
- Excellent communication and teamwork skills.
- Prior experience in cybersecurity, observability, or large-scale distributed systems is a plus.