Position Description :
Works independently under limited supervision and applies knowledge of subject matter in Applications Development. Possess sufficient knowledge and skills to effectively deal with issues, challenges within field of specialization to develop simple applications solutions. Second level professional with direct impact on results and outcome.
Your future duties and responsibilities :
Technical & Behavioral CompetenciesQualification & Experience :
5+ years in architecture or senior engineering roles, with 3+ years designing and operating workloads on Cloud in production environments.
Mandatory Qualifications
- Strong knowledge of Cloud services and constructs, such as VPC, VSI, ROKS / OpenShift, Kubernetes, Cloud Databases, Object Storage (COS), Monitoring and Log Analysis.
- Proven experience designing for high availability, disaster recovery, and continuity, including multi-zone / region architectures, backup / restore, and RTO / RPO planning.
- Hands-on experience with infrastructure-as-code and automation (Terraform, IBM Cloud Schematics), CI / CD pipelines, and GitOps workflows.
- Solid foundation in Linux, containers, networking (firewalls, load balancers, DNS, TLS) and platform operations.
- Strong understanding of security-by-design : IAM / RBAC, least privilege, workload identity, encryption, key management, vulnerability management, and policy enforcement.
- Demonstrated ability to perform architecture reviews, document decisions (ADR), assess risks, and present trade-offs to senior stakeholders.
- Excellent communication and facilitation skills; able to conduct training sessions and influence cross-functional teams without direct authority.
Preferred Qualifications
Experience with SRE practices, reliability engineering, capacity planning, cost optimization, and performance tuning.Familiarity with compliance frameworks and internal control environments (e.g., ISO , SOC 2) and aligning solutions to internal group rules and standards.Exposure to API gateways, eventing / queues, and integration patterns.Experience implementing observability stacks and defining SLOs / SLIs.Background in incident management, postmortems, and resilience testing.Certifications such as IBM Certified Solution Architect – Cloud, Red Hat OpenShift, CKA / CKAD, TOGAF, or ITIL.Responsibilities
Review and provide actionable feedback on solution architectures proposed, ensuring compliance with internal policies, security baselines, observability standards, and production-readiness criteria.Validate designs against application continuity objectives, including RTO / RPO, HA, backup, DR strategy, failover / failback, chaos testing readiness, and runbook completeness.Publish and maintain architecture principles, standards, guardrails, and reference patterns for developers and infrastructure teams; drive adoption across squads.Guide implementation teams on provisioning and automation using infrastructure-as-code and GitOps practices (e.g., Terraform / Schematics, pipelines), including environment strategy, naming, tagging, secrets, and access control.Define production non-functionals and acceptance criteria across reliability, performance, capacity, cost efficiency, security, compliance, networking, and operability.Partner with Architecture, Security, SRE, and Application teams to resolve design gaps; propose constructive alternatives and trade-offs with clear rationale.Represent Production in application design discussions, vendor evaluations, change advisory boards, architecture review committees; clearly articulate Production s position, risks, and recommended mitigations.Develop and deliver information sessions, brown bags, and enablement on key initiatives, new standards, and platform capabilities.Create and evolve application reference architectures with policy enforcement and controls.Ensure observability by design, including logging, metrics, tracing, alerting, SLOs / SLIs, and dashboards; drive readiness checks prior to go-live.Contribute to incident postmortems and problem management with architectural remediations and resilience patterns.Track emerging internal cloud features and industry best practices; incorporate learnings into standards and roadmaps.Skills :
EnglishBitbucketGITGITKubernetesOraclePython