Architecting Test Systems :
- Architect test frameworks and infrastructure for validating microservices and infrastructure components in multi-cluster and hybrid-cloud environments.
- Oversee the design of complex test scenarios simulating production-like workloads, resource scaling, failure injection, and recovery across distributed clusters.
Automation & Scalability :
Spearhead the development of scalable and maintainable test automation integrated with CI / CD (Jenkins, GitHub Actions, etc.).Leverage Kubernetes APIs, Helm, and service mesh tools to build comprehensive automation coverage, including system health, failover behavior, and network resilience.Promote test infrastructure-as-code and drive IaC forward on the team making sure the infrastructure code is repeatable, extensible and reliable.Technical Proficiency :
Deep understanding of Kubernetes internals, cluster lifecycle management, Helm, service meshes (e.g., Istio or Linkerd), and network policies.Strong scripting and automation capabilities (Python, Pytest, Bash, etc.).Familiarity with observability stacks (Prometheus, Grafana, Jaeger), Kubernetes security (RBAC, secrets management), and performance benchmarking tools (e.g., K6).Solid grounding in cloud architecture (AWS, Azure, GCP), infrastructure provisioning, and containerized CI / CD.Moderate to advanced linux knowledge and proficiency is required : Bash scripting and debugging, systemd / logs, networking / firewalling / routing, certificate / PKI management, containers (Docker / containerd), and Kubernetes tooling (kubectl / Helm with OCI registries, GitOps / Flux) to install, test, and troubleshoot multi-cluster & Scalability :Spearhead the design and development of highly scalable, maintainable test automation systems, seamlessly integrated into CI / CD pipelines (Jenkins, GitHub Actions, GitLab, ArgoCD).Leverage Kubernetes APIs, Helm charts, and service mesh frameworks (Istio, Linkerd) to enable full automation coverage for system health monitoring, network resilience testing, failover validation, and scaling scenarios.Advocate and implement Test Infrastructure-as-Code (IaC), ensuring all test systems are repeatable, auditable, extensible, and reliable. Drive the adoption of GitOps practices for test environments.Technical Proficiency :
Kubernetes & Cloud Expertise : Deep understanding of Kubernetes internals, cluster lifecycle management, networking policies, Helm, and service meshes, combined with hands-on experience across major cloud platforms (AWS, Azure, GCP).Observability & Monitoring : Skilled in designing observability pipelines using Prometheus, Grafana, Jaeger, with strong focus on proactive monitoring, tracing, and alerting for distributed systems.Scripting & Automation : Proficient in Python (Pytest, automation frameworks), Bash scripting, and DevOps toolchains, ensuring streamlined automation and test orchestration.Security & Reliability : Familiar with RBAC, PKI management, secrets handling, and network security policies for production-grade Kubernetes clusters.System Benchmarking : Experienced in load and performance benchmarking using K6, Locust, and custom benchmarking harnesses to validate system throughput, latency, and resilience under stress.Linux & System Proficiency : Solid grounding in Linux administration, including systemd management, firewalling / routing, container runtimes (Docker, containerd), networking diagnostics, and log debugging.(ref : hirist.tech)