
Quality and Reliability Engineer
- Gauteng
- Permanent
- Full-time
The Quality & Reliability Engineer (QRE) ensures the company delivers stable, secure, and high quality software at pace. As a bridge between development, QA, and operations, the QRE owns our CI/CD pipelines, enforces quality gates, monitors reliability, and safeguards the platform against regressions, downtime, and technical debt.Key Responsibilities
1. CI/CD & Deployment
✓ Maintain and improve continuous integration and delivery pipelines (GitHub Actions / Heroku).
✓ Automate test, linting, type-checking, and coverage enforcement gates.
✓ Deploy approved changes to staging and production environments.
✓ Own rollback procedures and ensure safe release practices (e.g. feature flags, canary rollouts).
2. Observability & Incident Response
✓ Monitor production health (Sentry, logs, performance dashboards).
✓ Track SLIs/SLOs (e.g., error rates, MTTR, CFR) and report reliability metrics monthly.
✓ Prepare runbooks for common incidents; support the incident commander rotation.
3. Test Infrastructure & Quality Gates
✓ Maintain and evolve automated test suites (unit, integration, acceptance).
✓ Own flaky test backlog and test fixture/data generation tooling.
✓ Collaborate with developers to ensure code is testable, isolated, and regression-safe.
4. Security & Platform Hardening
✓ Integrate security scans, dependency checks (e.g., Dependabot), and audit trail features into CI/CD.
✓ Partner with the Security Champion to ensure releases meet internal standards and SOC 2 requirements.
✓ Contribute to infrastructure security posture (e.g., IAM scoping, key management, release audit logs).
5. Developer Experience & Release Tooling
✓ Improve developer productivity through tooling, sandbox environments, and local parity.
✓ Reduce cycle time from PR to production by identifying bottlenecks in the build or test stages.Requirements:
✓ Strong experience with CI/CD tooling (e.g., GitHub Actions, CircleCI, GitLab CI).
✓ Familiarity with cloud platform pipelines (e.g., Heroku Pipelines).
✓ Proficient in Python and shell scripting (bonus: Django familiarity).
✓ Experience maintaining test automation frameworks.
✓ Comfort with observability tools (e.g., Sentry, etc).
✓ Track record of delivering reliable, secure systems in a SaaS or FinTech environment.Knowledge/Experience in the following areas will be beneficial, but not required:
✓ Experience with feature flag systems (e.g., Unleash, LaunchDarkly).
✓ Understanding of SOC 2 / ISO 27001 controls in CI/CD.
✓ Exposure to data privacy and multi-tenant architectures.
✓ Experience running post-mortems and tracking incident action items.Metrics you will own:
✓ CI duration and reliability.
✓ Change failure rate (CFR) and mean time to recovery (MTTR).
✓ Defect escape rate and flaky test volume.
✓ Sentry alert volume and resolution latency.
✓ Release cadence and deployment success rate.Traits we value:
✓ Relentlessly pragmatic: You optimise for stability over perfection.
✓ Systems thinker: You see failure modes before they happen.
✓ Quality advocate: You know tests are not overhead, they're leverage.
✓ Security conscious: You think in terms of blast radius and defence-in-depth.
✓ Quietly powerful: You make every engineer faster without becoming a bottleneck.