Service Reliability Engineer

Job Locations LT-Remote
Posted Date 12 hours ago(26/08/2025 11:08)
Job ID
2025-3253
# of Openings
1
Position Class
Employee
Position Type
Full time
Org Level 2
Product Engineering

Who we are

Join the fintech revolution with Mambu, the leading SaaS cloud banking platform. We're on a mission to make banking better for a billion people. Explore exciting career opportunities and help shape the future of financial services. Learn more here.

About the team

 

Reliability is at the core of our product, and our Support Engineering & Reliability Team (SERT) ensures our customers experience a stable, scalable, and resilient platform every day.

 

As SRE at Mambu, you'll be on the front lines of reliability, working at the intersection of engineering, operations, and customer success. You'll make systems reliable, scalable, and observable, while continuously improving the way we work. This is a role for a resilient, customer-obsessed professional who thrives on solving complex, live production incidents.

What you’ll do

  • Own Live Production Incidents: You'll be the first responder for production issues impacting Mambu's mission-critical workloads. Your excellent troubleshooting skills will be crucial for investigating and resolving issues quickly, ensuring minimal disruption and fast recovery for our customers.

  • Design and Define Observability: Design, maintain, and evolve monitoring, alerting, and logging to catch issues before customers do. You'll have a strong understanding of what "good" looks like for monitoring API performance, and you'll define and implement the standards that ensure our platform's health.

  • Lead Incident Communication: You will be the direct point of contact for customers during critical incidents. Your exceptional communication and analytical skills will be essential for balancing technical priorities with customer impact and providing clear, confident updates.

  • Empower and Automate: You will build and document robust knowledge bases, lead training sessions and automate repetitive support processes with an AI-first mindset.

  • Champion Resilience and Operational Excellence: Partner with product and infrastructure teams to embed reliability, capacity management, and operational excellence into how we build software. You will advocate for best practices like blameless post-mortems, SLO/SLI design, and incident command.

What you’ll bring

  • Exceptional troubleshooting skills and hands-on experience resolving complex production incidents in a mission-critical environment.

  • Deep understanding of observability stacks (Prometheus, Grafana, ELK, OpsGenie, Datadog, etc.) and demonstrated experience defining and implementing alerting and monitoring setups.

  • Excellent communication and customer-facing skills, with the ability to manage direct customer conversations during high-stakes situations.

  • Experience with public cloud services (AWS, GCP, or Azure), distributed systems, and cloud-native applications.

  • Proficiency in scripting/programming (Bash, Python, Go, or Java), along with software engineering skills in Java for troubleshooting and debugging production issues.

  • An AI-first mindset and experience leveraging AI to proactively identify problems, automate support processes, and optimize workflows.

  • SQL knowledge for querying, troubleshooting, and performance tuning.

  • Familiarity with the software delivery lifecycle, CI/CD practices, and a DevOps culture.

Nice to Have (or Grow Into)

  • Knowledge of version control systems (Git/GitHub).

  • Passion for automation, resilience engineering, and scaling operations.

  • Certification with one of the cloud providers (AWS, Google Cloud or Azure).

What you’ll get

Join us to shape the future of banking, where your professional growth is equally as valued as your personal well-being. 

  • Competitive base salary

  • Company equity for all

  • Learning and development opportunities

  • Hybrid/Remote working (location dependant)

  • 30 day working abroad 

  • 4 week paid sabbatical after 5 years service

  • Additional benefits based on location

Let's connect!

Follow Mambu on LinkedIn for the latest Fintech trends and success stories. Connect with us on Facebook, Instagram, and YouTube to experience our vibrant culture. Explore our mission, values, and the world we're building at mambu.com/careers. Check out our Insights Hub for industry insights, Mambu blogs, webinars, and upcoming events.

 

As part of the recruitment (or HR onboarding) process, you will be required to obtain authorized criminal background and credit screening results, as well as be queried against a sanctions/anti-money-laundering/counter terrorism financing/politically exposed persons screening service and your employment is conditional upon approval of these results.

 

At Mambu, we encourage all interested candidates to apply, even if they don't meet every listed qualification, as we value diversity and recognize that experience doesn't always perfectly align with job descriptions. We are committed to providing equal opportunities for applicants with disabilities; if you need assistance during the application process, please contact talent.acquisition@mambu.com.

 

Options

Sorry the Share function is not working properly at this moment. Please refresh the page and try again later.
Share on your newsfeed

Need help finding the right job?

We can recommend jobs specifically for you! Click here to get started.