Talent.com
AI Tech Architect
AI Tech ArchitectRecro • Delhi, India
AI Tech Architect

AI Tech Architect

Recro • Delhi, India
21 days ago
Job description

Overview

Own the end-to-end architecture of production AI systems with a strong hands-on bias. You’ll design robust, cost-efficient, and secure agentic / GenAI solutions on AWS. Part of your job will be to unblock lead developers by debugging code, optimizing performance, and guiding best practices. Expect to turn complex requirements into scalable, observable, and well-governed platforms.

Responsibilities

  • Define target architectures for agentic systems (planning / reasoning / tool-calling), GenAI / RAG pipelines, and evaluation loops; produce clear design documents with Flow / UML / sequence diagrams and AWS deployment topologies.
  • Size and optimize infrastructure for cost and performance : model throughput / latency, concurrency, autoscaling policies, CPU / GPU needs, memory footprints, vector index sizing, storage / egress, and token budgets.
  • Lead deep-dive debugging and incident resolution : profile bottlenecks, fix defects, stabilize services; pair-program with developers to raise the engineering bar.
  • Establish reference implementations for multi-agent frameworks (Semantic Kernel preferred; LangGraph / AutoGen / CrewAI acceptable), tool / function schemas, validation, memory, grounding, and multi-step planning.
  • Architect retrieval and hybrid search systems : ingestion, chunking, embeddings, ranking, caching, freshness, and grounding; evaluate recall, precision, and hallucination risk.
  • Productionize on AWS using Amazon EKS, S3, SQS / SNS, and AWS Bedrock; integrate identity (Okta / IAM), secrets (AWS Secrets Manager), eventing, and observability; enforce SLIs / SLOs and error budgets.
  • Make systems observable : distributed tracing, metrics, and logs using OpenTelemetry and Datadog; standardize dashboards, alerts, and tool / trace replay.
  • Build evaluation and promotion workflows : prompt / flow tests, golden sets, offline batch runs, A / B experiments, regression suites, and rollout gates.
  • Design security and safety controls : threat modeling, prompt-injection defense, sandboxed tools, policy enforcement, red-team testing, PII / data governance, and audit trails.
  • Define platform standards : reusable SDKs, connectors, CI / CD templates, runbooks, and architecture review checklists.
  • Partner with product, data, and SRE teams to plan capacity, disaster recovery, multi-region posture, and upgrade paths; lead readiness reviews and post-incident RCAs.
  • Mentor engineers and review PRs with a focus on reliability, correctness, and maintainability.

Must Have

  • 7–10 years in software / AI engineering with at least 4+ years building GenAI applications and 2+ years architecting production agentic systems.
  • Strong hands-on expertise in Python 3.11+ (typing, asyncio, packaging, profiling, pytest); able to dive into code, fix bugs, and optimize performance-critical paths.
  • Experience with one or more agent frameworks (Semantic Kernel, LangGraph, AutoGen, CrewAI) and function / tool calling with schema and argument validation.
  • Proven design of GenAI / RAG / hybrid retrieval systems using AWS Bedrock, OpenSearch Serverless, or other vector databases; grounding and retrieval evaluation experience.
  • Deep knowledge of AWS architecture : Amazon EKS, Bedrock, S3, SQS / SNS, RDS (SQL Server / PostgreSQL), ElastiCache (Redis), Secrets Manager, IAM / Okta, Kong API Gateway, OpenSearch Serverless, and Datadog.
  • Observability expertise : distributed tracing (OpenTelemetry), metrics, logs, correlation IDs, and service-level objectives; mature incident response and on-call practices.
  • Cost and performance engineering mindset : capacity modeling, GPU / CPU sizing, autoscaling (HPA), batching / streaming, caching, and FinOps discipline.
  • Security and safety fundamentals : least privilege, data isolation, policy enforcement, content moderation, jailbreak / PII defenses, and compliance awareness.
  • Excellent technical communication : clear diagrams, ADRs, design docs; empathetic, structured code and architecture reviews.
  • Good to Have

  • Multi-agent orchestration patterns : task decomposition, coordinator-worker, human-in-the-loop, graph-based planning.
  • Deep expertise with vector databases and retrieval : OpenSearch Serverless, Pinecone, pgvector, Redis.
  • Evaluation frameworks : red teaming, automated guardrails, regression testing, rollout gates, canary deployments.
  • Frontend integration for agent UIs (streaming responses, tool traces), secure connector APIs, and AuthN / Z best practices.
  • Policy-as-code (OPA) and multi-tenant architecture (RBAC, quotas, usage metering).
  • Knowledge of Kong API Gateway, LaunchDarkly / Flipt for feature management, and NeMo Guardrails for runtime safety.
  • CI / CD exposure (build / test with GitHub Actions, deployments via Terraform / AWS IaC templates).
  • Core Tech Stack (our core; equivalents welcome)

  • Python 3.11+, FastAPI, Pydantic v2, SQLAlchemy 2.x, Alembic, pytest.
  • Amazon EKS, AWS Bedrock, Amazon SQS / SNS, Amazon RDS (SQL Server / PostgreSQL), ElastiCache (Redis).
  • Amazon S3 for storage, Amazon ECR for container images, OpenSearch Serverless for vector storage.
  • AWS Secrets Manager, Okta IAM, NeMo Guardrails, Kong API Gateway.
  • OpenTelemetry + Datadog for observability and monitoring.
  • Custom RAG Services, Bedrock Knowledge Base, and LLM evaluation with Phoenix, Arize, and Promptfoo.
  • Create a job alert for this search

    Ai Architect • Delhi, India

    Related jobs
    Data and Ai Solution Architect

    Data and Ai Solution Architect

    Aventra Group • Ghaziabad, IN
    Job Title : Data & AI Solution Architect.The Data & AI Architect role is accountable for designing high-level, end-to-end data platform solutions that address complex business problems.You will lead...Show more
    Last updated: 23 hours ago • Promoted
    AI Architect

    AI Architect

    Yanthraa Information Systems • Delhi, India
    Company Description Yanthraa Information Systems is a prominent technology solutions provider specializing in advanced AI applications, comprehensive web and mobile development, robust cloud servic...Show more
    Last updated: 21 days ago • Promoted
    AI / ML - Tech Lead / Architect

    AI / ML - Tech Lead / Architect

    Armakuni • Delhi, India
    Armakuni (AWS Premier Partner) , a trusted partner in helping organizations leverage cloud-native technologies to achieve agility, scalability, and resilience. Guiding businesses through adopting cl...Show more
    Last updated: 7 days ago • Promoted
    TC AI Architect Solution Architect Manager

    TC AI Architect Solution Architect Manager

    EY Studio+ Nederland • Delhi, Delhi, India
    At EY were all in to shape your future with confidence.Well help you succeed in a globally connected powerhouse of diverse teams and take your career wherever you want it to go.Join EY and help to ...Show more
    Last updated: 4 days ago • Promoted
    AI Architect

    AI Architect

    Black Box • Delhi, India
    In a technical services company that builds leveraged.The Senior Innovation Architect ensures all innovative solutions align with the client’s security requirements and enterprise architecture stan...Show more
    Last updated: 30+ days ago • Promoted
    Technical Architect AI [T500-21190]

    Technical Architect AI [T500-21190]

    McDonald's Global Office in India • Delhi, India
    About McDonald’s : One of the world’s largest employers with locations in more than 100 countries, McDonald’s Corporation has corporate opportunities in Hyderabad. Our global offices serve as dynamic...Show more
    Last updated: 1 day ago • Promoted
    Ai Architect

    Ai Architect

    EvoluteIQ • Noida, Republic Of India, IN
    We at EvoluteIQ believe in the power of transformation.We are committed to building an industry leading technology that will revolutionize the way enterprises conduct business.To make that happen, ...Show more
    Last updated: 13 days ago • Promoted
    AI Architect (LLMs, RAG, Vertex AI)

    AI Architect (LLMs, RAG, Vertex AI)

    BIG Language Solutions • Noida, Uttar Pradesh, India
    AI Architect (LLMs, RAG, Vertex AI).Experience Required : 12–15 Years.Location : Noida(Work from Office).We are seeking a highly skilled AI Architect with deep expertise in Large Language Models (LLM...Show more
    Last updated: 22 days ago • Promoted
    AI Cloud Architect

    AI Cloud Architect

    HCLTech • Noida, Uttar Pradesh, India
    We are looking for a AI Cloud Architect + developer who can design, develop, and deploy cloud-based solutions for our clients. You will be responsible for creating scalable, secure, and cost-effecti...Show more
    Last updated: 19 days ago • Promoted
    Agentic AI Architect

    Agentic AI Architect

    Infinit-O • Delhi, India
    Role Overview : This role is deeply technical, requiring hands-on expertise in automation platforms, system integration, and architectural leadership to deliver intelligent, scalable, and future-rea...Show more
    Last updated: 5 days ago • Promoted
    AI Architect

    AI Architect

    EvoluteIQ • Ghaziabad, Uttar Pradesh, India
    Life at EvoluteIQ We at EvoluteIQ believe in the power of transformation.We are committed to building an industry leading technology that will revolutionize the way enterprises conduct business.To...Show more
    Last updated: 13 days ago • Promoted
    AI Agent Architect

    AI Agent Architect

    Luxoft • Ghaziabad, IN
    We are seeking a hands-on and qualified AI Agent Architect to design and deploy advanced Agentic AI systems—comprising task-specific autonomous tools governed by a master agent—to support complex t...Show more
    Last updated: 16 days ago • Promoted
    Technical Architect - Data & AI

    Technical Architect - Data & AI

    HCLTech • Delhi, India
    Meet regularly with other team members to discuss progress and find.Generate progress reports for clients and senior leaders. Manage organizational deliverables by using.Required skills and qualific...Show more
    Last updated: 30+ days ago • Promoted
    Gen AI Architect

    Gen AI Architect

    Polestar Analytics • Noida, Uttar Pradesh, India
    Your Role and Responsibilities.Design and implement scalable Gen AI solutions that align with business objectives.Develop system architectures for real-time and batch processing of AI models.Ensure...Show more
    Last updated: 22 days ago • Promoted
    AI Architect

    AI Architect

    Recro • Delhi, India
    As the AI Systems Architect, you’ll own the end-to-end design and delivery of production-grade agentic and Generative AI systems. This is a highly hands-on role requiring deep architectural insight,...Show more
    Last updated: 21 days ago • Promoted
    AI Architect

    AI Architect

    TekPillar® • Delhi, India
    Cloud platforms (AWS or equivalent) - Artificial Intelligence / Large Language Models (LLMs).Architect and design a scalable and robust agentic AI platform for automotive domains - Evaluate and imp...Show more
    Last updated: 23 hours ago • Promoted
    AI Architect

    AI Architect

    Movate • Delhi, India
    Job Description – AI Architect (Generative & Agentic AI) Position : AI Architect Experience : 10–12 Years Employment Type : Full-time. About the Role We are seeking an experienced.Agentic AI framewo...Show more
    Last updated: 13 days ago • Promoted
    Generative AI Architect

    Generative AI Architect

    Leading Healthcare Industry • Delhi, India
    Overview We are seeking a highly skilled and experienced.This role goes beyond building AI models—it involves architecting robust, scalable, and secure AI-powered applications and services across t...Show more
    Last updated: 17 days ago • Promoted