The challenge
We're at a pivotal stage in the evolution of our cloud platform. To continue scaling efficiently and strengthening reliability, we are evolving our virtualization platform within our IaaS environment. Our infrastructure supports mission-critical services, where performance, stability and continuous improvement are key.
As a Virtualization Engineer, your mission will be to design, deploy, operate and evolve virtualization infrastructures based on KVM in production environments. You will contribute directly to the evolution of the virtualization layer, participate in architecture decisions, and help improve infrastructure-related products.
You will play a key role in resolving critical incidents, analyzing high-impact escalations from the operations team, and optimizing performance, stability and efficiency of the platform. You will also contribute to reducing operational friction through automation and continuous improvement.
You'll be part of a highly collaborative engineering environment, working closely with Network, Systems, Storage and Operations teams to solve cross-functional issues and ensure the platform evolves in a scalable and reliable way.
Collaboration will be essential. You will support infrastructure decisions, participate in architecture changes, migrations and capacity expansions, proactively identify improvements, and ensure the platform continues to evolve with a strong technical foundation.
Requirements that are important for us
Experience in administration and support of virtualization environments based on KVM in production.
Proven experience in implementation, operation and troubleshooting of virtualization platforms.
Advanced knowledge of Linux system administration.
Strong understanding of KVM virtualization stack, including QEMU and libvirt.
Experience diagnosing complex issues related to performance, capacity, availability and stability.
Practical knowledge of networking in virtualized environments: bonding (link aggregation), VLANs or VXLANs, firewall rules (iptables).
Key skills and expected impact
Strong documentation practices and contribution to technical procedures and operational best practices.
Optimization of performance, stability and efficiency of the virtualization platform.
Ability to lead root cause analysis and propose technical improvements to reduce incident recurrence.
Resolution of complex and critical incidents escalated by the operations team.
Contribution to automation to reduce manual and repetitive operational tasks.
Technical judgment in validating new solutions, designs and changes impacting the virtualization layer.
Acting as a technical reference, contributing to raising the overall technical level with practical knowledge and global vision.
Nice to have
Experience with infrastructure orchestration platforms such as OpenStack, CloudStack or similar.
Experience in cloud providers, IaaS platforms or large-scale infrastructure environments.
Knowledge of centralized storage (NFS, iSCSI) and distributed storage (Ceph).
Experience in high availability, capacity planning and resource optimization.
Familiarity with observability, monitoring and performance analysis tools.
Experience with automation tools such as Ansible.
Participation in architecture evolution, standardization or platform improvement projects.
Ability to document technical decisions and communicate clearly in multidisciplinary environments.
Tools
Virtualization: KVM, QEMU, libvirt
Networking: bonding, VLANs, VXLANs, iptables
Storage: NFS, iSCSI, Ceph
Orchestration: OpenStack, CloudStack or similar
Automation: Ansible or similar
Observability & performance: monitoring and performance analysis tools
Collaboration & documentation: technical procedures and operational documentation
