Talent.com
AI Tech Architect
AI Tech ArchitectRecro • gurugram, uttar pradesh, in
No longer accepting applications
AI Tech Architect

AI Tech Architect

Recro • gurugram, uttar pradesh, in
21 days ago
Job description

Overview

Own the end-to-end architecture of production AI systems with a strong hands-on bias. You’ll design robust, cost-efficient, and secure agentic / GenAI solutions on AWS. Part of your job will be to unblock lead developers by debugging code, optimizing performance, and guiding best practices. Expect to turn complex requirements into scalable, observable, and well-governed platforms.

Responsibilities

  • Define target architectures for agentic systems (planning / reasoning / tool-calling), GenAI / RAG pipelines, and evaluation loops; produce clear design documents with Flow / UML / sequence diagrams and AWS deployment topologies.
  • Size and optimize infrastructure for cost and performance : model throughput / latency, concurrency, autoscaling policies, CPU / GPU needs, memory footprints, vector index sizing, storage / egress, and token budgets.
  • Lead deep-dive debugging and incident resolution : profile bottlenecks, fix defects, stabilize services; pair-program with developers to raise the engineering bar.
  • Establish reference implementations for multi-agent frameworks (Semantic Kernel preferred; LangGraph / AutoGen / CrewAI acceptable), tool / function schemas, validation, memory, grounding, and multi-step planning.
  • Architect retrieval and hybrid search systems : ingestion, chunking, embeddings, ranking, caching, freshness, and grounding; evaluate recall, precision, and hallucination risk.
  • Productionize on AWS using Amazon EKS, S3, SQS / SNS, and AWS Bedrock; integrate identity (Okta / IAM), secrets (AWS Secrets Manager), eventing, and observability; enforce SLIs / SLOs and error budgets.
  • Make systems observable : distributed tracing, metrics, and logs using OpenTelemetry and Datadog; standardize dashboards, alerts, and tool / trace replay.
  • Build evaluation and promotion workflows : prompt / flow tests, golden sets, offline batch runs, A / B experiments, regression suites, and rollout gates.
  • Design security and safety controls : threat modeling, prompt-injection defense, sandboxed tools, policy enforcement, red-team testing, PII / data governance, and audit trails.
  • Define platform standards : reusable SDKs, connectors, CI / CD templates, runbooks, and architecture review checklists.
  • Partner with product, data, and SRE teams to plan capacity, disaster recovery, multi-region posture, and upgrade paths; lead readiness reviews and post-incident RCAs.
  • Mentor engineers and review PRs with a focus on reliability, correctness, and maintainability.

Must Have

  • 7–10 years in software / AI engineering with at least 4+ years building GenAI applications and 2+ years architecting production agentic systems.
  • Strong hands-on expertise in Python 3.11+ (typing, asyncio, packaging, profiling, pytest); able to dive into code, fix bugs, and optimize performance-critical paths.
  • Experience with one or more agent frameworks (Semantic Kernel, LangGraph, AutoGen, CrewAI) and function / tool calling with schema and argument validation.
  • Proven design of GenAI / RAG / hybrid retrieval systems using AWS Bedrock, OpenSearch Serverless, or other vector databases; grounding and retrieval evaluation experience.
  • Deep knowledge of AWS architecture : Amazon EKS, Bedrock, S3, SQS / SNS, RDS (SQL Server / PostgreSQL), ElastiCache (Redis), Secrets Manager, IAM / Okta, Kong API Gateway, OpenSearch Serverless, and Datadog.
  • Observability expertise : distributed tracing (OpenTelemetry), metrics, logs, correlation IDs, and service-level objectives; mature incident response and on-call practices.
  • Cost and performance engineering mindset : capacity modeling, GPU / CPU sizing, autoscaling (HPA), batching / streaming, caching, and FinOps discipline.
  • Security and safety fundamentals : least privilege, data isolation, policy enforcement, content moderation, jailbreak / PII defenses, and compliance awareness.
  • Excellent technical communication : clear diagrams, ADRs, design docs; empathetic, structured code and architecture reviews.
  • Good to Have

  • Multi-agent orchestration patterns : task decomposition, coordinator-worker, human-in-the-loop, graph-based planning.
  • Deep expertise with vector databases and retrieval : OpenSearch Serverless, Pinecone, pgvector, Redis.
  • Evaluation frameworks : red teaming, automated guardrails, regression testing, rollout gates, canary deployments.
  • Frontend integration for agent UIs (streaming responses, tool traces), secure connector APIs, and AuthN / Z best practices.
  • Policy-as-code (OPA) and multi-tenant architecture (RBAC, quotas, usage metering).
  • Knowledge of Kong API Gateway, LaunchDarkly / Flipt for feature management, and NeMo Guardrails for runtime safety.
  • CI / CD exposure (build / test with GitHub Actions, deployments via Terraform / AWS IaC templates).
  • Core Tech Stack (our core; equivalents welcome)

  • Python 3.11+, FastAPI, Pydantic v2, SQLAlchemy 2.x, Alembic, pytest.
  • Amazon EKS, AWS Bedrock, Amazon SQS / SNS, Amazon RDS (SQL Server / PostgreSQL), ElastiCache (Redis).
  • Amazon S3 for storage, Amazon ECR for container images, OpenSearch Serverless for vector storage.
  • AWS Secrets Manager, Okta IAM, NeMo Guardrails, Kong API Gateway.
  • OpenTelemetry + Datadog for observability and monitoring.
  • Custom RAG Services, Bedrock Knowledge Base, and LLM evaluation with Phoenix, Arize, and Promptfoo.
  • Create a job alert for this search

    Ai Architect • gurugram, uttar pradesh, in

    Related jobs
    Ai Architect

    Ai Architect

    EvoluteIQ • Gurgaon, Republic Of India, IN
    We at EvoluteIQ believe in the power of transformation.We are committed to building an industry leading technology that will revolutionize the way enterprises conduct business.To make that happen, ...Show more
    Last updated: 14 days ago • Promoted
    Gen AI Architect

    Gen AI Architect

    IGT Solutions • Gurgaon, Haryana, India
    Job Title : Architect - Gen AI, LLM, Big Data Experience : 18+ Years Location : Pune / Gurugram Employment Type : Full-Time / Permanent Job Summary : We are looking for an experienced Architect with ...Show more
    Last updated: 22 days ago • Promoted
    Senior Architect, Generative AI and AI Agent Factory

    Senior Architect, Generative AI and AI Agent Factory

    PepsiCo • Gurugram, Haryana, India
    The Senior Technical Architect – Generative AI and Agent Factory is responsible for leading the end-to-end architecture, design, and strategic enablement of PepsiCo's enterprise-grade GenAI platfor...Show more
    Last updated: 4 days ago • Promoted
    GenAI Architect

    GenAI Architect

    AIONOS • gurugram, uttar pradesh, in
    Job Description We are seeking an experienced GenAI Architect to design and build highly scalable and reliable systems that leverage cutting-edge Generative AI technologies.This role demands expert...Show more
    Last updated: 2 days ago • Promoted
    Data and Ai Solution Architect

    Data and Ai Solution Architect

    Aventra Group • Gurgaon, Haryana, India
    Job Title : Data & AI Solution Architect Work Experience : 12-15 Years Work Location : Bengaluru Project Role Description The Data & AI Architect role is accountable for designing high-level, end-...Show more
    Last updated: 20 hours ago • Promoted • New!
    AI Agent Architect

    AI Agent Architect

    Luxoft • Gurgaon, Haryana, India
    Project Description : We are seeking a hands-on and qualified AI Agent Architect to design and deploy advanced Agentic AI systems—comprising task-specific autonomous tools governed by a master agent...Show more
    Last updated: 16 days ago • Promoted
    AI Tech Architect

    AI Tech Architect

    Recro • Gurugram, Haryana, India
    Own the end-to-end architecture of production AI systems with a strong hands-on bias.You’ll design robust, cost-efficient, and secure agentic / GenAI solutions on AWS. Part of your job will be to unbl...Show more
    Last updated: 4 days ago • Promoted
    Technical Architect - AI

    Technical Architect - AI

    Cvent • Gurgaon, Haryana, India
    Cvent is a leading meetings, events, and hospitality technology provider with more than 4,800 employees and ~22,000 customers worldwide, including 53% of the Fortune 500. Founded in 1999, Cvent deli...Show more
    Last updated: 29 days ago • Promoted
    AI Architect

    AI Architect

    Mulya Technologies • gurugram, uttar pradesh, in
    We are a US based Stealth mode Start-up.Hyderabad / Bangalore / Remote ( any where in India ).We unify the processes used in Semiconductor and Hardware Systems design - thus reducing bugs, improvin...Show more
    Last updated: 30+ days ago • Promoted
    AI Architect

    AI Architect

    EvoluteIQ • Gurgaon, Haryana, India
    Life at EvoluteIQ We at EvoluteIQ believe in the power of transformation.We are committed to building an industry leading technology that will revolutionize the way enterprises conduct business.To...Show more
    Last updated: 14 days ago • Promoted
    Data Architect – AI Platform

    Data Architect – AI Platform

    Birdeye • Gurugram, Haryana, India
    Job Title : Data Architect – AI Platform.Department : Engineering / Data Platform.This role is central to building the.You’ll architect and implement the systems that transform raw data into.You won’...Show more
    Last updated: 4 days ago • Promoted
    AI Architect

    AI Architect

    KPMG India • Gurugram, Haryana, India
    KPMG Delivery Network India (KDNI) is a diverse entity spread across multiple cities in India.We are an important part of the KPMG Delivery Network (KDN), a global organization that supports KPMG m...Show more
    Last updated: 4 days ago • Promoted
    AI Architect

    AI Architect

    KPIT • Gurgaon, Haryana, India
    Experience-8 to 11 years Notice Period- Immediate joiner Architecture & design of scalable AI platforms Expertise in agentic AI frameworks (LangChain, AutoGPT, CrewAI, AWS) Deep understanding of LL...Show more
    Last updated: less than 1 hour ago • Promoted • New!
    AI Solution Architect

    AI Solution Architect

    Oracle • Gurgaon, Haryana, India
    The Oracle AI Centre of Excellence empowers individuals and organizations of all sizes across India with cutting-edge cloud and AI technologies, world-class training, and a dynamic environment for ...Show more
    Last updated: 20 days ago • Promoted
    Lead IT Architect, AI Architecture, Platinion

    Lead IT Architect, AI Architecture, Platinion

    Boston Consulting Group • Gurgaon, Haryana, India
    Boston Consulting Group partners with leaders in business and society to tackle their most important challenges and capture their greatest opportunities. BCG was the pioneer in business strategy whe...Show more
    Last updated: 10 days ago • Promoted
    AI Architect

    AI Architect

    Tata Consultancy Services • Gurgaon, Haryana, India
    Role : - AI Architect Experience- 8-15years Location- PAN India Job Description We are seeking a visionary AI Architect with 8 –15 years of overall experience to lead the strategic design and implem...Show more
    Last updated: 30+ days ago • Promoted
    AIML Architect

    AIML Architect

    ValueLabs • Gurgaon, Haryana, India
    Dear Aspirants, We at ValueLabs have an Opening for AI / ML Architect role.Role : AI / ML Architect Experience 7+ Years 6months-1year Remote Responsibilities At least 7+ years of relevant AI / ML experi...Show more
    Last updated: 5 days ago • Promoted
    Technical Architect - AI

    Technical Architect - AI

    Confidential • Gurugram, Gurgaon / Gurugram, India
    Cvent is a leading meetings, events, and hospitality technology provider with more than 4,800 employees and 22,000 customers worldwide, including 53% of the Fortune 500. Founded in 1999, Cvent deliv...Show more
    Last updated: 12 days ago • Promoted