IRESS

Senior Support & Reliability Engineer

Johannesburg Full time

See yourself being part of a large, transformational change? This could be the role for you!


At Iress, we make things happen

We believe technology should help people perform better every day. Since our beginning in 1993, people across financial services have trusted us to take their performance to the next level. From the world’s most established financial brands to new and disruptive players, we help improve every aspect of our clients’ businesses so that they can work better, every day. Iress is one of Australia’s largest technology companies and employs more than 1900 people across Asia Pacific, North America, Africa and UK & Europe.

Build your career at Iress!

The Support Reliability Engineer plays a vital role in ensuring the stability, performance, and reliability of Iress systems used by clients and internal teams. Success in this role means proactively identifying and resolving production issues, automating processes to improve service resilience, and collaborating across teams to reduce incidents and downtime. This role bridges support and engineering, ensuring smooth operation and rapid recovery when issues arise. The ideal candidate is a problem-solver with strong technical acumen, communication skills, and a continuous improvement mindset

Some of the awesome things you would involved with:

Accountabilities & Deliverables

  • Monitor and maintain the health and performance of production systems and environments.
  • Investigate, troubleshoot, and resolve complex system and application issues.
  • Collaborate with product and platform engineering teams to identify root causes and drive permanent fixes.
  • Implement automation to improve incident response, monitoring, and reporting.
  • Develop and maintain tools, scripts, and dashboards that enhance system observability.
  • Participate in incident management and post-incident reviews to ensure continuous improvement.
  • Contribute to system performance tuning, capacity planning, and reliability improvements.
  • Document operational procedures, playbooks, and best practices for system reliability.
  • Support service transition and release activities to ensure smooth deployments.
  • Identify recurring issues and implement sustainable solutions to reduce technical debt

Key Relationships

  • Product Engineering teams
  • Service Operations and Application Support teams
  • Site Reliability Engineering (SRE) and Platform Engineering teams
  • Client Service and Delivery teams
  • Information Security and Risk teams

What you will bring:

Qualifications and Certifications

  • Bachelor’s degree in Computer Science, Information Technology, or a related field (required)
  • Minimum of 3–5 years’ experience in a production support, DevOps, or reliability engineering role (required)
  • Proficiency in Windows Server administration (preferred)
  • Proficiency in Linux Server Administration (desirable)
  • Knowledge of Networking
  • Certification in AWS, Azure, or Google Cloud Platform (desirable)
  • Proficiency in databases, certification in MSSQL SQL  (desirable)
  • Experience with monitoring tools such as Datadog, Prometheus, or Grafana 

Professional Skills and Competencies

  • Bachelor’s degree in Computer Science, Information Technology, or a related field (required
  • Incident Management: Manages service incidents and ensures timely resolution.
  • Problem Management : Identifies root causes and implements corrective actions.
  • Automation : Develops scripts and tools to automate operational tasks.
  • Systems Integration and Build : Contributes to integrating systems and maintaining deployment pipelines.
  • Service Level Management: Monitors and reports on service performance against SLA
  • Collaborates: Works effectively across teams to solve problems and share knowledge.
  • Manages Self: Demonstrates accountability, prioritisation, and attention to detail.
  • Adapts: Responds flexibly to change and operational challenges.
  • Thinks Analytically: Identifies patterns, root causes, and opportunities for improvement.
  • Communicates Effectively: Shares information clearly and timely with technical and non-technical stakeholders.

Why work with us?

At Iress, we believe technology should help people perform better every day. Since our beginning in 1993, people across financial services have trusted us to take their performance to the next level. From the world’s most established financial brands to new and disruptive players, we help improve every aspect of our clients’ businesses so that they can work better, every day. Iress is one of Australia’s largest technology companies and employs more than 1900 people across Asia Pacific, North America, Africa and UK & Europe.

Our employee benefits are some of the best in the industry, supporting our people at every life stage. It means we work smarter, happier, and healthier with long weekends, flexible ways of working, generous parental leave, opportunities to participate in our local community initiatives and a connected team culture.

Iress is committed to fostering a welcoming and inclusive culture. We strongly believe that diversity is what makes our teams and our products succeed. Our people have different experiences, skills, perspectives and beliefs and everyone’s uniqueness is valued and celebrated. Even if you feel you don’t meet all of the requirements of the role, we would still like to hear from you!

#LI-Hybrid


Employment Type

Employee

Time Type

Full time