CDI2 days ago

Observability Lead

Greenmist
Accra (Ghana)

Job Description

Job Decription: We are seeking an experienced and proactive Observability Lead to take ownership of the visibility, reliability, and performance monitoring of all production systems across the organisation. This role is responsible for ensuring that infrastructure, applications, databases, and critical services are fully monitored in real time, enabling early issue detection, rapid incident response, and continuous service improvement. The ideal candidate will build a strong observability culture by implementing best-in-class monitoring, alerting, logging, and performance management practices. You will work closely with Engineering, DevOps, Security, Product, and Support teams to maintain highly available and resilient systems in a fast-paced fintech environment. Responsibilities 1. Observability Strategy & Ownership Develop and lead the company-wide observability strategy across infrastructure, applications, cloud environments, databases, and internal services. Establish monitoring standards, frameworks, and governance for all production workloads. Ensure real-time visibility into system health, performance, availability, and capacity. Build a proactive reliability culture through data-driven monitoring practices. 2. Monitoring & Alerting Management Ensure 100% monitoring coverage across all critical production services. Design, configure, and maintain dashboards, alerts, logs, metrics, and distributed tracing systems. Continuously optimise alert thresholds to reduce noise and eliminate false positives. Maintain centralised monitoring systems accessible to relevant teams. 3. Incident Detection & Operational Response Ensure incidents are detected internally before customer impact whenever possible. Lead operational response during outages, degradations, and system anomalies. Coordinate cross-functional teams during incident resolution. Drive post-incident reviews, root cause analysis (RCA), and corrective action plans. 4. Performance Monitoring & Optimization Software Track system latency, throughput, resource utilization, and application performance metrics. Identify performance bottlenecks and collaborate with engineering teams on remediation. Support load readiness, scaling decisions, and capacity planning. Improve platform stability and service responsiveness over time. 5. Reporting & Insights Produce weekly and monthly reports on system health, uptime, incident trends, and risk areas. Provide executive dashboards for leadership visibility into platform performance. Use operational data to recommend improvements and investment priorities. 6. Collaboration & Leadership Partner with Engineering, DevOps, Security, and Product teams to embed observability into all deployments. Support teams with troubleshooting, diagnostics, and production readiness reviews. Mentor engineers on monitoring best practices and observability tooling. Act as the subject matter expert for reliability monitoring and operational intelligence. Requirements Education & Experience Data Management Bachelor’s degree in Computer Science, Information Technology, Engineering, or related field. 5+ years of experience in Observability, Site Reliability Engineering (SRE), DevOps, Infrastructure Monitoring, or Production Operations. Experience in fintech, payments, telecom, banking, or mission-critical environments preferred. Technical Skills Hands-on experience with observability tools such as Grafana, Prometheus, Datadog, New Relic, Signoz, ELK Stack, Splunk, AppDynamics, or similar. Strong understanding of metrics, logs, traces, and alerting systems. Experience with Linux servers, cloud platforms (AWS, Azure, GCP), and container environments. Knowledge of networking, databases, APIs, and distributed systems. Scripting skills in Python, Bash, or similar languages are an advantage. Soft Skills Strong analytical and troubleshooting ability. Calm under pressure during incidents and outages. Strong communication and stakeholder management skills. Leadership mindset with ownership and accountability. Close attention to detail and a continuous improvement focus. What Success Looks Like in This Role Network Monitoring & Management Production issues are identified before customers experience disruption. Leadership has real-time confidence in platform health and uptime. Engineers rely on strong dashboards and actionable alerts. System performance continuously improves through data-driven action. Downtime and recurring incidents reduce significantly over time.

---

**

[Click the Apply button below to see the contact details]

Expert Application Advice

Technical proficiency — Greenmist expects hands-on mastery of Python, DevOps. Don't just list them: describe a concrete project where you used them and the outcome delivered.

Positioning — Your cover letter must answer one question: why YOU for THIS specific role right NOW? Avoid generic templates — one sentence on what you specifically bring beats three generic paragraphs.

Measurable achievements — At this level, back every skill with a precise example and a quantified result. Prepare 2-3 strategic questions about this organization's current challenges — candidates who anticipate difficulties are consistently rated higher.

🎯 Make your application ATS-ready

ATS (Applicant Tracking Systems) are the software recruiters use to automatically filter CVs before any human reads them. Our CV builder is specifically designed to pass these filters — and it takes under 3 minutes.

Create my ATS CV →
Career advice powered by Taf4All

Ready to apply?

Safety Reminder

Never pay money to get an interview. Taf4All will never contact you to request application fees.

You might also be interested in

JO

QUALITY CONTROL LEAD

JobPilotEastern Region, Ghana

QUALITY CONTROL LEAD chez JobPilot à Eastern Region, Ghana.

Freelance
il y a 27 jours
JO

HR LEAD

Role Overview HR Lead – for someone passionate about building highperforming teams, driving operational excellence, and

CDI
il y a 16 jours
A

OPERATIONS LEAD

Auj.

🧠 Profil recherché Job Description ROLE PURPOSE: The Operations Lead is the operational nerve center of FON Packaging V

CDI
il y a environ 23 heures
EN

Operations Lead

🧠 Profil recherché Operations Lead Stratum Poultry Group LTD Management & Business Development 1 week ago Easy appl

CDI
il y a 3 jours