Description
Were hiring a handson (80% build / operate) AWS Platform Engineer to design, harden, and run our cloud platform with an emphasis on Amazon Connect contact centers, Lex V2 conversational bots, and Lambda (Python)based integrations.
In addition to core platform ownership, youll bring software engineering skills to deploy and host Node.js applications on AWS and to build, package, and ship Salesforce apps through modern CI / CD.
Youll own reliability and scalability end to endinstrumenting, automating, and governing a multiaccount AWS environment via Infrastructure as Codewhile enabling new capabilities (including Bedrockpowered AI) safely and at speed.
Key Responsibilities
Platform Engineering & Infrastructure :
- Design, provision, and operate AWS foundations (multiaccount landing zone, VPCs, subnets, routing, security controls, IAM guardrails) using Terraform (and / or CDK) and automated change management.
- Build a paved road of reusable Terraform modules, CI / CD templates, and runbooks for serverless and containerized workloads (Lambda, API Gateway, Step Functions, EventBridge, ECS / EKS as needed).
- Implement robust secrets, encryption, and config management (KMS, Secrets Manager, SSM Parameter Store) and enforce leastprivilege IAM.
- Stand up secure, scalable runtimes for application hostingALB / NLB, CloudFront, S3 static hosting, ECR, EKS / ECS, Aurora / RDS, DynamoDB, ElastiCache, WAF / Shield, and Cognito / identity patterns.
Application Platform & Software Engineering (Node.js & Salesforce)
Design, deploy, and operate Node.js services and web apps on AWS using serverless (Lambda) and containerized approaches (ECS / EKS), including packaging, versioning, health checks, autoscaling, and blue / green or canary deployments.Establish engineering standards for Node.js (TypeScript / JavaScript), including linting, unit / integration tests (e.g. , Jest), dependency management (npm / yarn), and performance profiling / optimization.Build service interfaces and integrations (REST / GraphQL, eventdriven with EventBridge / SQS / SNS), implement resilient retries / backoff, idempotency, and robust error handling.Ship Salesforce apps : manage sourcedriven development with SFDX and / or DevOps Center; handle metadata / unlocked packages, scratch orgs / sandboxes, and automated deployments to higher environments.Implement secure, auditable integrations between AWS and Salesforce (OAuth, Named Credentials, Platform Events, Salesforce Functions / Heroku where appropriate).SRE, Reliability & Observability
Define and manage SLIs / SLOs for platform, applications, and contact flows; implement error budgets and lead incident response, postmortems, and game days.Build endtoend visibility with CloudWatch, Datadog, and Operata (dashboards, traces, logs, RUM / synthetics, alerting), including custom Connect / Lex metrics, Node.js APM, and business KPIs.Engineer high availability and disaster recovery (multiAZ, crossRegion failover where needed); manage Lambda concurrency, cold's tart mitigation, throttling, and backoff patterns.Amazon Connect & Conversational AI
Architect and implement Amazon Connect solutions : contact flows, queues / routing profiles, hours, quick connects, CTR pipelines, Contact Lensbased analytics, recordings / retention, and realtime / nearrealtime reporting.Build and optimize Lex V2 bots (intents, slots, multiturn, locales), including Lambda (Python) fulfillment / validation hooks, session attributes, and strong error / exception observability.Integrate Connect & Lex with CRMs (e., Salesforce Service / Experience Cloud), ticketing, knowledge bases, and data platforms; instruments and continuously tune AHT, containment, and CSAT.
Leverage AWS Bedrock (where appropriate) for generative call / chat assist, summarization, and routingvia safe, auditable Lambda / Step Functions patterns.Automation, CI / CD & Quality
Own and improve CI / CD (GitHub Actions, Azure DevOps; optionally CodeBuild / CodePipeline) with automated tests, security scans (e.g. , Sonarqube, AIkido), and progressive delivery for IaC and app code.Build pipelines for Node.js (install / test / lint / build / package, image builds to ECR, deploy to Lambda / ECS / EKS) and for Salesforce apps (SFDX / DevOps Center flows, org spinup / teardown, data seeding, test runs).Write clean, testable Python and Node.js for Lambdas, services, tooling, and platform automation (unit / integration tests, packaging, type hints / TypeScript), and codify operational tasks (runbooks, CLI tools).Security, Compliance & Cost
Implement PCIaware integrations (e.g. , payment flows, DTMF redaction / tokenization, vaulting) and ensure logging, retention, and audit readiness (CloudTrail, Config).Embed applicationlevel security : secret handling, dependency / vulnerability scans (npm audit / CodeQL / SAST), leastprivilege roles, and secure egress / ingress controls.Drive FinOps : tagging, budgets, rightsizing, lifecycle policies, and data egress controlsespecially for Connect recordings / CTR / Contact Lens in S3, Athena / Glue analytics, and Kinesis streams.CrossFunctional Impact :
Partner with product, data, and engineering teams to ship reliable features; communicate clearly with both technical and business stakeholders on tradeoffs, incidents, and roadmaps.Support scalable integrations with thirdparty platforms (e.g. , Vtex, Shopify) and PCIcompliant payment providers (e.g. , Stripe).Participate in an oncall rotation with welldocumented runbooks and automated remediation.Requirements : Have Qualifications :
Deep, handson AWS expertise : Amazon Connect, Lex V2, Lambda (Python), API Gateway, Step Functions, EventBridge, S3, DynamoDB / RDS, CloudWatch, IAM, VPC networking, WAF / Shield, KMS.Proven experience designing and operating production Connect environments (contact flows, CTR / Contact Lens analytics, call recording / retention, reporting, routing, telephony integration).Node.js engineering in production : building, deploying, and hosting Node.js services / apps on AWS (Lambda and / or ECS / EKS), with CI / CD, testing, and observability (Datadog / CloudWatch).Salesforce application lifecycle management : SFDX and / or DevOps Center, packaging (metadata / unlocked), scratch orgs / sandboxes, automated deployments, and secure AWSSalesforce integrations.Lex V2 bot design and operation, including Lambdabased fulfillment / validation in Python (boto3), robust error handling, and conversation analytics.Strong Infrastructure as Code with Terraform (modules, workspaces, policy as code, drift detection) and GitOps workflows.SRE skills : SLIs / SLOs, incident management / oncall, postmortems, capacity / perf tuning, high availability, and DR patterns.Observability at scale with Datadog, CloudWatch, and / or Operata (metrics, traces, logs, alerts, synthetics).CI / CD automation (GitHub Actions, Azure DevOps) for both application and IaC pipelines.Experience integrating AWS services with Salesforce Service / Experience Cloud and with PCIcompliant payment platforms (e.g. , Stripe).Strong debugging / rootcause skills across distributed systems; clear written and verbal communication with technical and nontechnical audiences.Nice To Have
Architectural leadership for large's cale, multiRegion AWS platforms; AWS Control Tower / Organizations experience.Exposure to callcenter tech beyond Connect (SIP, SBCs, carrier management) and analytics tooling for contact centers.Experience with AI assistants / agentic patterns leveraging Bedrock; realtime agent assist and summarization in Connect.Containers / Kubernetes (ECS / EKS) and service mesh (where appropriate).Experience with Salesforce Functions and / or Heroku; Copado / Gearset pipelines.TypeScript, NestJS / Express, GraphQL, and featureflag / experiment frameworks.AWS certifications (Solutions Architect, DevOps Engineer, SysOps).Fault Injection Simulator / chaos engineering, and security / compliance frameworks (SOC 2, ISO 27001, PCI DSS).(ref : hirist.tech)
Skills Required
Salesforce, S3, Rds, Step Functions, Dynamodb, Node.js, CDK, Vpc, Datadog, Lambda, Kms, Cloudwatch, Terraform, Amazon Connect, Iam, Waf, Python, Api Gateway, Aws, Azure Devops