Role Expectations :
- Execute and maintain SOPs for production operations, onboarding, and integration support.
- Handle incident response, troubleshoot system and data issues, and ensure timely resolution.
- Support partner configurations, API setups, and access management processes.
- Monitor systems proactively to identify and prevent outages or performance degradation.
- Collaborate with engineering to analyze root causes and build long-term automation solutions.
- Develop and maintain internal tools, scripts, and dashboards to improve operational efficiency.
- Ensure proper documentation, audit trails, and process compliance.
Must Have Skills : Technical Foundation
Strong hands-on experience with SQL (PostgreSQL / MySQL) for data queries and reconciliation.Good understanding of REST APIs, JSON, and using tools like Postman / Insomnia for testing and debugging.Proficiency in Linux command line, shell operations, and basic networking (DNS, IPs, routing)Scripting Automation
Working knowledge of Python or Bash for automation and scripting small utilities.Experience in automating repetitive tasks such as data syncs, API triggers, and report generation.Monitoring and Incident Management
Practical exposure to New Relic, Grafana, Coralogix, CloudWatch, or equivalent for observability and alerting.Familiarity with incident management tools like Jira Service Management, PagerDuty, Zenduty, or similar.Ability to interpret logs, analyze system behavior, and identify production issues quicklyCloud and InfraExperience working on AWS (EC2, S3, CloudWatch, IAM) or GCP environments.Basic understanding of CI / CD workflows using Jenkins, GitHub Actions, or GitLab CI.Exposure to Kubernetes / Docker for troubleshooting and containerised environments.Soft SkillsStrong ownership mindset with analytical problem-solving.Effective communication and coordination skills with cross-functional teams.Ability to manage priorities under operational pressure.Good to have skills
System and Knowledge IntegrationExperience with Kong Gateway, Cloudflare, or similar API gateway tools.Exposure to message queues like Kafka, RabbitMQ, or SQS.Familiarity with fintech or insurance platforms, including claims or financial reconciliation systems.Automation and Deployment ProcessKnowledge of Git, CI / CD pipelines, and version control best practices.Experience building Ops automation scripts or tools using Python frameworks (e.g., FastAPI, Flask).Awareness of security best practices, API authentication (JWT, OAuth2), and secret management tools (AWS Secrets Manager, Vault).Reporting and DataExperience with reporting automation, data validation scripts, or ETL workflows.Familiarity with Excel / Google Sheets automation and data visualization tools.Process and CollaborationExposure to ITIL processes (incident, problem, and change management).Ability to document SOPs, RCA reports, and create internal knowledge base entriesQualifications
Bachelor’s degree in Computer Science, Information Technology, or equivalent experience.2–4 years of experience in Technical Operations, Platform Support, or DevOps roles.Experience working in Fintech, Insurtech, SaaS, or other large-scale distribute systems preferred.