Location : Remote | Job Type : Full-Time | Experience Level : 3+ years
About Us
Cloud202 Limited is a leading technology consulting company dedicated to helping businesses transform and innovate through cutting-edge technology solutions. We specialize in cloud migration, AI / ML, and application development, providing our clients with the expertise they need to stay ahead in a rapidly evolving digital landscape.
Position Overview
We are seeking an innovative AI Engineer to lead the development and implementation of enterprise-grade agentic AI solutions. This role requires deep expertise in the Gen-AI ecosystem, including Amazon Bedrock, Amazon Bedrock AgentCore, SageMaker AI, and emerging AI agent frameworks. The ideal candidate will drive enterprise AI transformation initiatives and build next-generation intelligent applications using cutting-edge agentic platforms and protocols.
Required Qualifications
Experience
- Minimum 3+ years of hands-on experience with AWS cloud services and machine learning infrastructure
- 2+ years of specific experience with generative AI, large language models (LLMs), and foundation models
- Proven track record of building and deploying production-scale AI / ML applications on AWS
Certifications
Preferred : AWS Certified AI Practitioner or AWS Machine Learning SpecialtyCore Technical Skills
Amazon Bedrock AgentCore Platform (Critical)
AgentCore Runtime : Deploy and operate AI agents securely at scale with serverless infrastructure, session isolation, and support for 8-hour execution windowsAgentCore Memory : Implement intelligent session and long-term memory with episodic learning capabilities for context-aware agent interactionsAgentCore Gateway : Build secure, centralized access to tools and APIs with minimal code transformationAgentCore Identity : Implement seamless agent authentication across AWS services and third-party applications (Slack, Zoom, GitHub, Salesforce) using OAuth, Okta, Entra, or Amazon CognitoAgentCore Tools : Utilize Code Interpreter for secure code execution and Browser Tool for enterprise-grade web automation within managed sandbox environmentsAgentCore Observability : Implement end-to-end tracing, debugging, and monitoring through unified CloudWatch dashboards with OTEL compatibilityAgentCore Policy : Set fine-grained boundaries on agent actions with real-time deterministic controlsAgentCore Evaluations : Continuously assess agent quality and behavior for production readinessGen-AI Services & Foundation Models
Amazon Bedrock : Comprehensive experience with foundation model access, fine-tuning, and deploymentSageMaker AI : Model hosting, endpoints, auto-scaling, A / B testing, and deployment pipelinesAmazon Q Developer : AI-powered development automation and code transformation capabilitiesFoundation Models : Hands-on experience with Claude (Anthropic), Llama (Meta), GPT models (OpenAI), Mistral, and Amazon Nova modelsAI Agents Development & Frameworks
Strands Agents SDK : Build production-ready AI agents with model-driven approach, supporting single agents, multi-agent systems, and swarm architecturesFramework Expertise : Experience with CrewAI, LangGraph, LlamaIndex, Google ADK, OpenAI Agents SDK, or custom agent frameworksMulti-Agent Orchestration : Design complex workflows with hierarchical delegation, agent-as-tools patterns, and dynamic capability discoveryAgentic Workflows : Build autonomous agents that reason, plan, use tools, and maintain context across long-running tasksTool Integration : Develop custom tools using Python decorators and integrate external APIs and servicesAgent Protocols & Interoperability (Essential)
Model Context Protocol (MCP) : Implement MCP servers and clients to provide standardized context and tool access to AI agents. Deploy MCP servers in AgentCore Runtime with OAuth authenticationAgent-to-Agent (A2A) Protocol : Build inter-agent communication systems using A2A protocol for peer-to-peer agent collaboration, capability negotiation, and task coordinationAgent Discovery : Implement agent cards and capability manifests for dynamic agent discovery and routingProtocol Integration : Deploy agents supporting both MCP and A2A protocols for maximum interoperability across enterprise systemsAdvanced Technical Skills
Vector Databases : Amazon OpenSearch, Pinecone, or similar for RAG implementationsProgramming : Expert-level Python and JavaScript / TypeScript, with focus on AI / ML libraries and async programmingAPIs & Integration : RESTful APIs, GraphQL, JSON-RPC 2.0, Server-Sent Events (SSE), real-time streaming, webhook integrationPrompt Engineering : Advanced prompt flows, few-shot learning, chain-of-thought reasoning, and structured output generationKnowledge Bases : RAG implementation with enterprise data integration and semantic searchGuardrails & Safety : Bedrock Guardrails, content filtering, bias detection, and responsible AI practicesCustom Model Fine-tuning : Adapting foundation models for domain-specific use casesAdvanced GenAI Applications
Retrieval-Augmented Generation (RAG) : Enterprise search, document Q&A, knowledge managementContent Generation : Text, image, code, and multimedia content creationConversational AI : Chatbots, virtual assistants, customer service automation with memory retentionCode Generation & Analysis : Automated code review, documentation, refactoring, and software modernizationData Analysis & Insights : Natural language to SQL, automated reporting, business intelligenceKey Responsibilities
Solution Architecture & Design
Design end-to-end generative AI solutions using Amazon Bedrock AgentCore as the primary agentic platformArchitect scalable, cost-effective AI pipelines leveraging AgentCore Runtime for serverless deploymentImplement MCP and A2A protocols for agent interoperability and tool integrationDesign multi-agent architectures with proper orchestration, memory management, and observabilityCreate technical documentation and best practices for AgentCore implementationsDevelopment & Implementation
Build production-ready agentic applications using Amazon Bedrock AgentCore services (Runtime, Memory, Gateway, Identity, Observability)Develop AI agents using Strands Agents SDK and other framework-agnostic approachesImplement MCP servers for tool and data access across enterprise systemsDeploy A2A-compliant agents for cross-platform agent collaborationImplement RAG systems with vector databases and AgentCore Gateway for secure data accessCreate automated workflows for model deployment, monitoring, and evaluationIntegrate AI capabilities into existing enterprise applications with proper authentication and governanceModel & Agent Management
Evaluate and select appropriate foundation models for specific use casesImplement AgentCore Policy for fine-grained control over agent actions and permissionsUse AgentCore Evaluations for continuous quality assessment and optimizationOptimize agent performance, cost, and latency using AgentCore Observability insightsEnsure compliance with data privacy, security requirements, and responsible AI practicesInnovation & Research
Stay current with latest AWS AI service releases, AgentCore capabilities, and agentic AI protocolsExperiment with emerging AI techniques, multi-agent patterns, and protocol enhancementsPrototype new use cases and proof-of-concepts using AgentCore platformContribute to internal AI strategy, AgentCore best practices, and community open-source projectsPreferred Experience & Skills
Industry-Specific Knowledge
Experience with industry-specific AI applications (healthcare, finance, retail, manufacturing)Understanding of compliance requirements (GDPR, HIPAA, SOX, PCI-DSS)Knowledge of AI ethics, bias mitigation, and responsible AI governanceAdvanced Technical Skills
MLOps : Model lifecycle management, automated retraining, drift detection with SageMaker PipelinesReal-time AI : Streaming data processing, low-latency inference, event-driven architecturesMultimodal AI : Text, image, audio, and video processing with Amazon Nova modelsEdge AI : Model optimization for edge deploymentCustom Training : Fine-tuning foundation models with proprietary dataInfrastructure as Code : CloudFormation, AWS CDK, or Terraform for AgentCore deploymentsLeadership & Collaboration
Experience leading AI transformation initiatives and AgentCore adoptionAbility to communicate complex agentic AI concepts to non-technical stakeholdersCross-functional collaboration with product, engineering, and business teamsMentoring junior engineers and data scientists on AgentCore best practicesRecent Technology Awareness (2025)
Amazon Bedrock AgentCore GA release with VPC support, A2A protocol, and enhanced observability (October 2025)Strands Agents SDK 1.0 with multi-agent orchestration, session management, and A2A supportAgent-to-Agent (A2A) protocol under Linux Foundation governance with enterprise adoptionModel Context Protocol (MCP) enhancements for agent-to-agent communication and tool integrationLatest Amazon Nova models (Nova Premier, Nova Sonic) for multimodal and conversational AILatest Anthropic Claude models (Claude 4 Sonnet) with extended context and enhanced capabilitiesAgentCore Policy and Evaluations for production-grade agent governanceAWS Q Developer CLI integration with MCP for agentic development workflowsEducation & Background
Bachelor's degree in Computer Science, AI / ML, Mathematics, or related fieldContinuous learning mindset with active participation in AI communities and open-source contributionsStrong understanding of distributed systems, microservices, and serverless architecturesIndustry : IT Services and IT Consulting
Employment Type : Full-time