Job descriptionAbout the Company GoCharting is a next-generation financial analytics and trading platform built for data-driven traders and institutions. It delivers institutional-grade tools through a seamless, cloud-native web experience, enabling users to analyze, strategize, and execute trades across asset classes including equities, futures, options, commodities, forex, and cryptocurrencies. With a strong focus on real-time market data, order flow analytics, and performance, GoCharting operates at the intersection of trading infrastructure and distributed systems—bringing low-latency, high-throughput capabilities to the browser. Role Overview: As a Senior DevOps Engineer, you will be the architect of our infrastructure's reliability, scalability, and security. You will transition our operations toward a pure GitOps model, ensuring that our systems are self-healing, observable, and highly automated. You will be responsible for the entire "plumbing" of the engineering organization, from low-level Linux administration and networking to high-level Kubernetes orchestration and multi-environment deployment strategies. Key Responsibilities - Infrastructure as Code (IaC): Architect and maintain global infrastructure using Terraform, ensuring all cloud resources are version-controlled and reproducible. - Configuration Management: Utilize Ansible for automated server provisioning and application configuration management across various environments. - Kubernetes Orchestration: Manage production-grade Kubernetes clusters and Docker containerization strategies, focusing on resource optimization and security hardening. - Continuous Delivery & GitOps: Design and implement automated CI/CD pipelines. Lead the transition to GitOps workflows using tools like ArgoCD or Flux to automate application deployments. - Deployment Strategies: Implement and manage advanced deployment patterns, including Blue-Green and Canary releases, to ensure zero-downtime updates and easy rollbacks. - Observability Mastery: Build and maintain a comprehensive observability stack: - Metrics: Prometheus, Signoz & Grafana. - Logging: Loki or ELK Stack. - Tracing: OpenTelemetry (OTel) for distributed system insights. - Secure Networking: Manage secure remote access and inter-site connectivity using WireGuard-based VPNs. - SRE & Reliability: Establish Service Level Objectives (SLOs) and Error Budgets. Lead Incident Response and post-mortem analyses to drive systemic improvements. - Disaster Recovery: Own the backup and recovery strategy, implementing automated tools to ensure data integrity and rapid restoration capabilities. Core Requirements - Automation: Expert-level Shell Scripting (Bash/Zsh) and a deep understanding of Linux Administration (Process management, networking stack, storage). - Orchestration & Tooling: Proven experience with Kubernetes, Helm, and Terraform. - Networking: Solid understanding of TCP/IP, DNS, Load Balancing, and modern VPN protocols like WireGuard. - Observability: Hands-on experience configuring exporters, dashboards, and alerting rules in a cloud-native environment. - Cloud Environments: Experience managing infrastructure across Multiple Environments (Dev, Staging, Prod) ensuring parity and isolation. - Chaos Engineering: Experience testing system resilience by injecting controlled failures. - Toil Reduction: A mindset focused on "automating yourself out of a job" by eliminating repetitive manual tasks. - Security First: Implementation of Secret Management (e.g., HashiCorp Vault) and "Shift Left" security in pipelines. Preferred Skills - Certifications such as CKA (Certified Kubernetes Administrator) or HashiCorp Certified: Terraform Associate. - Familiarity with database administration and high-availability (HA) configurations for PostgreSQL/Redis/Cassandra. - Experience with Infrastructure-as-Code testing frameworks (e.g., Terratest). Compensation & Benefits - Competitive salary aligned with experience and capability - Opportunity to work on high-performance, real-time systems at scale - High ownership, minimal bureaucracy, and direct impact on product GoCharting is an equal opportunity employer. We are committed to building a diverse and inclusive workplace and welcome applicants from all backgrounds.