Talent.com
AI Tech Architect

AI Tech Architect

RecroDelhi, India
1 day ago
Job description

AI Tech Architect (7–10 yrs) — Agentic & Gen AI Platforms

Location : Bengaluru / Gurugram

Team : AI Platforms & Architecture

Employment : Full-time

Key Skills : Python, FastAPI, AWS (EKS, Bedrock, OpenSearch, S3, RDS), GenAI & RAG Architecture, Agent Frameworks (Semantic Kernel, LangGraph, AutoGen), Vector Databases, Observability (OpenTelemetry, Datadog), Security & Scalability Design.

Overview

Own the end-to-end architecture of production AI systems with a strong hands-on bias. You’ll design robust, cost-efficient, and secure agentic / GenAI solutions on AWS. Part of your job will be to unblock lead developers by debugging code, optimizing performance, and guiding best practices. Expect to turn complex requirements into scalable, observable, and well-governed platforms.

Responsibilities

  • Define target architectures for agentic systems (planning / reasoning / tool-calling), GenAI / RAG pipelines, and evaluation loops; produce clear design documents with Flow / UML / sequence diagrams and AWS deployment topologies.
  • Size and optimize infrastructure for cost and performance : model throughput / latency, concurrency, autoscaling policies, CPU / GPU needs, memory footprints, vector index sizing, storage / egress, and token budgets.
  • Lead deep-dive debugging and incident resolution : profile bottlenecks, fix defects, stabilize services; pair-program with developers to raise the engineering bar.
  • Establish reference implementations for multi-agent frameworks (Semantic Kernel preferred; LangGraph / AutoGen / CrewAI acceptable), tool / function schemas, validation, memory, grounding, and multi-step planning.
  • Architect retrieval and hybrid search systems : ingestion, chunking, embeddings, ranking, caching, freshness, and grounding; evaluate recall, precision, and hallucination risk.
  • Productionize on AWS using Amazon EKS, S3, SQS / SNS, and AWS Bedrock; integrate identity (Okta / IAM), secrets (AWS Secrets Manager), eventing, and observability; enforce SLIs / SLOs and error budgets.
  • Make systems observable : distributed tracing, metrics, and logs using OpenTelemetry and Datadog; standardize dashboards, alerts, and tool / trace replay.
  • Build evaluation and promotion workflows : prompt / flow tests, golden sets, offline batch runs, A / B experiments, regression suites, and rollout gates.
  • Design security and safety controls : threat modeling, prompt-injection defense, sandboxed tools, policy enforcement, red-team testing, PII / data governance, and audit trails.
  • Define platform standards : reusable SDKs, connectors, CI / CD templates, runbooks, and architecture review checklists.
  • Partner with product, data, and SRE teams to plan capacity, disaster recovery, multi-region posture, and upgrade paths; lead readiness reviews and post-incident RCAs.
  • Mentor engineers and review PRs with a focus on reliability, correctness, and maintainability.

Must Have

  • 7–10 years in software / AI engineering with at least 4+ years building GenAI applications and 2+ years architecting production agentic systems.
  • Strong hands-on expertise in Python 3.11+ (typing, asyncio, packaging, profiling, pytest); able to dive into code, fix bugs, and optimize performance-critical paths.
  • Experience with one or more agent frameworks (Semantic Kernel, LangGraph, AutoGen, CrewAI) and function / tool calling with schema and argument validation.
  • Proven design of GenAI / RAG / hybrid retrieval systems using AWS Bedrock, OpenSearch Serverless, or other vector databases; grounding and retrieval evaluation experience.
  • Deep knowledge of AWS architecture : Amazon EKS, Bedrock, S3, SQS / SNS, RDS (SQL Server / PostgreSQL), ElastiCache (Redis), Secrets Manager, IAM / Okta, Kong API Gateway, OpenSearch Serverless, and Datadog.
  • Observability expertise : distributed tracing (OpenTelemetry), metrics, logs, correlation IDs, and service-level objectives; mature incident response and on-call practices.
  • Cost and performance engineering mindset : capacity modeling, GPU / CPU sizing, autoscaling (HPA), batching / streaming, caching, and FinOps discipline.
  • Security and safety fundamentals : least privilege, data isolation, policy enforcement, content moderation, jailbreak / PII defenses, and compliance awareness.
  • Excellent technical communication : clear diagrams, ADRs, design docs; empathetic, structured code and architecture reviews.
  • Good to Have

  • Multi-agent orchestration patterns : task decomposition, coordinator-worker, human-in-the-loop, graph-based planning.
  • Deep expertise with vector databases and retrieval : OpenSearch Serverless, Pinecone, pgvector, Redis.
  • Evaluation frameworks : red teaming, automated guardrails, regression testing, rollout gates, canary deployments.
  • Frontend integration for agent UIs (streaming responses, tool traces), secure connector APIs, and AuthN / Z best practices.
  • Policy-as-code (OPA) and multi-tenant architecture (RBAC, quotas, usage metering).
  • Knowledge of Kong API Gateway, LaunchDarkly / Flipt for feature management, and NeMo Guardrails for runtime safety.
  • CI / CD exposure (build / test with GitHub Actions, deployments via Terraform / AWS IaC templates).
  • Core Tech Stack (our core; equivalents welcome)

  • Python 3.11+, FastAPI, Pydantic v2, SQLAlchemy 2.x, Alembic, pytest.
  • Amazon EKS, AWS Bedrock, Amazon SQS / SNS, Amazon RDS (SQL Server / PostgreSQL), ElastiCache (Redis).
  • Amazon S3 for storage, Amazon ECR for container images, OpenSearch Serverless for vector storage.
  • AWS Secrets Manager, Okta IAM, NeMo Guardrails, Kong API Gateway.
  • OpenTelemetry + Datadog for observability and monitoring.
  • Custom RAG Services, Bedrock Knowledge Base, and LLM evaluation with Phoenix, Arize, and Promptfoo.
  • Create a job alert for this search

    Ai Architect • Delhi, India

    Related jobs
    • Promoted
    Gen AI Architect

    Gen AI Architect

    IGT SolutionsDelhi, IN
    Job Title : Architect - Gen AI, LLM, Big Data.We are looking for an experienced Architect with deep expertise in Generative AI (Gen AI), Large Language Models (LLM), and Big Data technologies.The id...Show moreLast updated: 2 days ago
    • Promoted
    Healthcare AI Solution Architect

    Healthcare AI Solution Architect

    CoforgeNoida, Uttar Pradesh, India
    Title - Healthcare AI Solution Architect Experience - 15+ years Location - Greater Noida, 5 days work from office We are looking for a seasoned Healthcare AI Solution Architect with over 15 year...Show moreLast updated: 15 days ago
    • Promoted
    AI Solution Architect Healthcare

    AI Solution Architect Healthcare

    CoforgeNoida, Uttar Pradesh, India
    Medical Devices AI Solution Architect.Greater Noida, 5 days work from office.We are seeking a highly experienced Medical devices. AI / ML solutioning and architecture.The ideal candidate will have a s...Show moreLast updated: 1 day ago
    • Promoted
    Snowflake Architect

    Snowflake Architect

    Bahwan CyberTekDelhi, IN
    Job Title : Snowflake Architect.Location : [Chennai / Bengaluru / Pune / Mumbai or Remote].Experience Required : 10+ Years. We are seeking a highly skilled Snowflake Architect to design and implement s...Show moreLast updated: 13 days ago
    • Promoted
    Alteryx Architect

    Alteryx Architect

    Tech MahindraDelhi, IN
    Experience : 10+ years (including 4–5 years on Alteryx platform).We are looking for an experienced Alteryx Architect to lead the design and delivery of enterprise-grade analytics and automation work...Show moreLast updated: 2 days ago
    • Promoted
    Technical Lead - Gen AI

    Technical Lead - Gen AI

    AceolutionDelhi, IN
    Freelance Remote Opportunity : Tech Lead – GenAI Code Initiatives.Tech Lead / Senior Software Engineer.AI-driven code generation systems. Write, evaluate, and refine complex code solutions.This is a ...Show moreLast updated: 20 days ago
    • Promoted
    AI Architect

    AI Architect

    Persistent Systemsmeerut, uttar pradesh, in
    We are seeking a highly skilled and innovative Agentic AI Developer to architect and implement intelligent, autonomous AI workflows and backend systems. This role is ideal for someone passionate abo...Show moreLast updated: 17 days ago
    • Promoted
    AI Data Architect

    AI Data Architect

    Tata Consultancy Servicesmeerut, uttar pradesh, in
    We are seeking an inventive Data Architect for AI with 8–15 years of experience to lead the strategic design and implementation of enterprise-scale AI solutions. ETL / ELT, Big Data cloud data solutio...Show moreLast updated: 19 days ago
    • Promoted
    AI Model Architect

    AI Model Architect

    DeepRunner AIGurugram, Haryana, India
    We are seeking a highly skilled and innovative AI Model Architect to lead the development and refinement of our core AI models. You will be responsible for the post-training optimization, fine-tunin...Show moreLast updated: 30+ days ago
    • Promoted
    AI Architect

    AI Architect

    Tata Consultancy Servicesgurugram, uttar pradesh, in
    We are seeking a visionary AI Architect with 8 –15 years of overall experience to lead the strategic design and implementation of enterprise-scale AI solutions. This role requires deep expertise in ...Show moreLast updated: 23 days ago
    • Promoted
    Technical Architect - AI

    Technical Architect - AI

    CventGurugram, Haryana, India
    Cvent is a leading meetings, events, and hospitality technology provider with more than 4,800 employees and ~22,000 customers worldwide, including 53% of the Fortune 500. Founded in 1999, Cvent deli...Show moreLast updated: 9 days ago
    • Promoted
    Full Stack Architect / Lead Engineer / AI solution-building

    Full Stack Architect / Lead Engineer / AI solution-building

    Prowessoft IncMeerut, IN
    We are looking for a Full Stack Architect / Lead Engineer with strong hands-on coding and AI solution-building expertise. You will design, build, and scale modern cloud-native applications — leading...Show moreLast updated: 2 days ago
    • Promoted
    AI Architect with Azure

    AI Architect with Azure

    Sonata Softwarefaridabad, haryana, in
    We are looking for an Architect with expertise in Generative AI and Microsoft Azure.Architect, design, and deliver GenAI-enabled solutions on Microsoft Azure. Develop cloud-native applications using...Show moreLast updated: 23 days ago
    • Promoted
    Gen AI Architect

    Gen AI Architect

    Polestar AnalyticsNoida, Uttar Pradesh, India
    Your Role and Responsibilities Architectural Design : Design and implement scalable Gen AI solutions that align with business objectives. Develop system architectures for real-time and batch processi...Show moreLast updated: 2 days ago
    • Promoted
    • New!
    AI Architect (LLMs, RAG, Vertex AI)

    AI Architect (LLMs, RAG, Vertex AI)

    BIG Language SolutionsNoida, Uttar Pradesh, India
    AI Architect (LLMs, RAG, Vertex AI).Experience Required : 12–15 Years.Location : Noida(Work from Office).We are seeking a highly skilled AI Architect with deep expertise in Large Language Models (LLM...Show moreLast updated: 18 hours ago
    • Promoted
    AI Developer - Transformers - 3 to 6 Years

    AI Developer - Transformers - 3 to 6 Years

    AIMLEAPDelhi, IN
    AI Developer - Transformers (2 – 6 Years).Tech in Computer Science, AI / ML, or related field.AI / ML with hands-on exposure to LLMs and agentic AI development. Strong Python programming background for ...Show moreLast updated: 2 days ago
    • Promoted
    Gen AI Architect / Lead

    Gen AI Architect / Lead

    Sonata Softwaredelhi, delhi, in
    We’re Hiring : GenAI Lead | Microsoft Azure.AI, cloud-native development, and Azure ecosystem mastery.Hiring #GenAI #Azure #AIJobs #CloudComputing #TechLeadership #MicrosoftAzure #AIInnovation.Show moreLast updated: 23 days ago
    • Promoted
    AI / ML Solutions Architect_Consulting

    AI / ML Solutions Architect_Consulting

    HireginieMeerut, IN
    A leading IT and data solutions provider offering services in consulting, systems integration, data science, IoT, and business process outsourcing. The company enables organizations to enhance effic...Show moreLast updated: 2 days ago