nPloy Logo

Service Reliability Engineer

Logo of Endava

Endava

Hybrid

Hybrid

Regular employment

1 - 6 years of experience

Full Time

Bucharest, Romania

Responsibilities

Company Description

Technology is our how. And people are our why. For over two decades, we have been harnessing technology to drive meaningful change.
By combining world-class engineering, industry expertise and a people-centric mindset, we consult and partner with leading brands from various industries to create dynamic platforms and intelligent digital experiences that drive innovation and transform businesses.
From prototype to real-world impact - be part of a global shift by doing work that matters.

Job Description

Infrastructure Support: 

  • Maintain and manage the organization's IT infrastructure, networks, and storage systems. 
  • Monitor system performance and troubleshoot issues to ensure high availability and performance. 
  • Implement and manage cloud infrastructure services, ensuring scalability and reliability. 

Application Support: 

  • Provide technical support for business-critical applications, addressing issues and ensuring minimal downtime. 
  • Collaborate with development teams to deploy new applications. 
  • Perform root cause analysis for application failures and implement preventative measures. 

Reliability Engineering: 

  • Apply Site Reliability Engineering (SRE) principles to improve system reliability and performance. 
  • Develop and maintain automated monitoring, alerting, and reporting tools. 
  • Utilise infrastructure as code (IaC) practices to automate and manage infrastructure deployments. 
  • Conduct regular availability and capacity planning and performance tuning. 

Qualifications

  • Proven experience in IT infrastructure management and support; particularly with Windows Server and VMware.  
  • Working within an ITIL framework. 
  • Strong understanding of application support processes and tools.  
  • Knowledge of cloud services (e.g., Azure) and cloud infrastructure management.  
  • Proficiency in scripting and automation (e.g., Bash, Python, Powershell, Kusto Query Language (KQL), Terraform).  
  • Experience with monitoring and logging tools (e.g., Dynatrace, VeeamOne, Azure Monitor).  
  • Strong problem-solving skills and ability to work in a fast-paced environment. 

Additional Information

Discover some of the global benefits that empower our people to become the best version of themselves:

  • Finance: Competitive salary package, share plan, company performance bonuses, value-based recognition awards, referral bonus;   
  • Career Development: Career coaching, global career opportunities, non-linear career paths, internal development programmes for management and technical leadership;
  • Learning Opportunities: Complex projects, rotations, internal tech communities, training, certifications, coaching, online learning platforms subscriptions, pass-it-on sessions, workshops, conferences;
  • Work-Life Balance: Hybrid work and flexible working hours, employee assistance programme;
  • Health: Global internal wellbeing programme, access to wellbeing apps;
  • Community: Global internal tech communities, hobby clubs and interest groups, inclusion and diversity programmes, events and celebrations.

Our diversity makes us stronger - it drives meaningful change and enables us to build innovative technology solutions. We are committed to creating an inclusive community where all of us, regardless of background, identity, or personal characteristics, feels valued, respected, and free from discrimination. As an equal opportunity employer, we welcome applications from all individuals and base hiring decisions on merit, skills, qualifications, and potential.

Required skills

Automation
Development Tools
Monitoring
Support
VMware
Windows Server
Scripting
SRE principles
IT Infrastructure
Cloud Services
ITIL framework
English
Romanian
Job posted 2 days ago

or

to apply.