Talent.com
Site Reliability Engineer (SRE) - AWS
Site Reliability Engineer (SRE) - AWStecholution • Thiruvananthapuram, IN
Site Reliability Engineer (SRE) - AWS

Site Reliability Engineer (SRE) - AWS

techolution • Thiruvananthapuram, IN
10 hours ago
Job description

We are seeking a highly skilled Site Reliability Engineer - AWS to enhance the reliability, scalability, and security of our cloud infrastructure. The ideal candidate will be responsible for designing, implementing, and maintaining high-availability systems, automating processes, and ensuring seamless operations on AWS. This role requires expertise in DevOps, cloud automation, monitoring, and incident response .

Title : Site Reliability Engineer - AWS

Location : Remote Work

Employment Type : Full-time

Work timings : 24

  • 7 rotational shifts

Responsibilities :

  • Design and maintain highly available, scalable, and fault-tolerant AWS infrastructure to ensure system reliability and performance.
  • Proactively monitor and troubleshoot system issues, minimizing downtime and optimizing system performance.
  • Develop and maintain Infrastructure as Code (IaC) using Terraform, CloudFormation, or AWS CDK to automate deployments and infrastructure management.
  • Implement and optimize continuous integration and deployment (CI / CD) pipelines using tools like Jenkins, GitLab CI / CD, or AWS CodePipeline.
  • Ensure AWS environments meet security best practices, including IAM policies, network security configurations, and compliance requirements.
  • Set up and manage monitoring and logging solutions using tools such as Prometheus, AWS CloudWatch, ELK Stack, and Datadog.
  • Identify and address performance bottlenecks through load balancing, caching strategies, and system optimizations.
  • Work closely with developers, security teams, and product managers to enhance system architecture and operational efficiency.
  • Required Skills & Experience

  • Strong experience in AWS services such as EC2, Lambda, EKS, S3, SageMaker, DynamoDB, and IAM .
  • Expertise in Infrastructure as Code (IaC) tools like Terraform or CloudFormation .
  • Proficiency in CI / CD pipelines using GitHub Actions, Jenkins, or AWS CodePipeline .
  • Experience with containerization and orchestration (Docker, Kubernetes, Helm).
  • Strong knowledge of monitoring, logging, and alerting tools (CloudWatch, Prometheus, ELK, Datadog).
  • Solid Python, Bash, or Golang scripting skills for automation.
  • Experience working with ML models in production environments is a plus.
  • Familiarity with security best practices (IAM, VPC security, encryption, WAF).
  • Strong problem-solving and troubleshooting skills.
  • Preferred Qualifications

  • Experience with MLOps frameworks and AI model deployment.
  • Knowledge of AWS AI / ML services like SageMaker, Bedrock, or AI pipelines.
  • Hands-on experience with Kafka, Spark, or other big data technologies .
  • About Techolution :

    Techolution is a next gen Consulting firm on track to become one of the most admired brands in the world for "innovation done right". Our purpose is to harness our expertise in novel technologies to deliver more profits for our enterprise clients while helping them deliver a better human experience for the communities they serve.

    With that, we are now fully committed to helping our clients build the enterprise of tomorrow by making the leap from Lab Grade AI to Real World AI. Other focus areas being Enterprise Cloud, Product Innovation (IoT, 3D printing, Robotics), Real World AI Services (CV, LLM, CNN).

    We are honored to have recently received the prestigious Inc 500 Best In Business award , a testament to our commitment to excellence. We were also awarded - AI Solution Provider of the Year by The AI Summit 2023, Platinum sponsor at Advantage DoD 2024 Symposium and a lot more exciting stuff! While we are big enough to be trusted by some of the greatest brands in the world, we are small enough to care about delivering meaningful ROI-generating innovation at a guaranteed price for each client that we serve.

    Our thought leader, Luv Tulsidas, wrote and published a book in collaboration with Forbes, “Failing Fast? Secrets to succeed fast with AI”. Refer here for more details on the content - https : / / www.luvtulsidas.com /

    Let's explore further!

    Uncover our unique AI accelerators with us :

    1. Enterprise LLM Studio : Our no-code DIY AI studio for enterprises. Choose an LLM, connect it to your data, and create an expert-level agent in 20 minutes.

    2. AppMod. AI : Modernizes ancient tech stacks quickly, achieving over 80% autonomy for major brands!

    3. ComputerVision. AI : Our ComputerVision. AI Offers customizable Computer Vision and Audio AI models, plus DIY tools and a Real-Time Co-Pilot for human-AI collaboration!

    4. Robotics and Edge Device Fabrication : Provides comprehensive robotics, hardware fabrication, and AI-integrated edge design services.

    5. RLEF AI Platform : Our proven Reinforcement Learning with Expert Feedback (RLEF) approach bridges Lab-Grade AI to Real-World AI.

    6. AI Center of Excellence : Establishes an AI Center of Excellence to maximize AI potential and ROI.

    7. FaceOpen : AI-powered user identification system using image recognition and deep neural networks, eliminating the need for keys, badges, or fingerprint scanners!

    Some videos you wanna watch!

  • Computer Vision demo at The AI Summit New York 2023
  • Life at Techolution
  • GoogleNext 2023
  • Ai4 - Artificial Intelligence Conferences 2023
  • WaWa - Solving Food Wastage
  • Saving lives - Brooklyn Hospital
  • Innovation Done Right on Google Cloud
  • Techolution featured on Worldwide Business with KathyIreland
  • Techolution presented by ION World’s Greatest
  • Visit us @ www.techolution.com : To know more about our revolutionary core practices and getting to know in detail about how we enrich the human experience with technology.

    Create a job alert for this search

    Site Reliability Engineer • Thiruvananthapuram, IN

    Related jobs
    Senior Site Reliability Engineer

    Senior Site Reliability Engineer

    GigSky • thiruvananthapuram, kerala, in
    We're Hiring : Site Reliability Engineer (5–10 Years Experience).Location : Bangalore, India | 🏢 Gigsky India Private Limited. Are you passionate about building resilient, scalable, and secure infras...Show more
    Last updated: 10 hours ago • Promoted • New!
    Equifax - Senior Site Reliability Engineer - IAC Terraform

    Equifax - Senior Site Reliability Engineer - IAC Terraform

    Equifax • Trivandrum
    About the job Site Reliability Engineering (SRE) at Equifax is a discipline that combines software and systems engineering for building and running large-scale, distr...Show more
    Last updated: 30+ days ago • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    Capgemini • Kollam, IN
    Choosing Capgemini means choosing a company where you will be empowered to shape your career in the way you’d like, where you’ll be supported and inspired by a collaborative community of colleagues...Show more
    Last updated: 19 days ago • Promoted
    Site Reliability Engineer II

    Site Reliability Engineer II

    Confidential • Thiruvananthapuram, Thiruvananthapuram / Trivandrum, India
    The world's top banks use Zafin's integrated platform to drive transformative customer value.Powered by an innovative AI-powered architecture, Zafin's platform seamlessly unifies data from across t...Show more
    Last updated: 13 days ago • Promoted
    Equifax - Site Reliability Engineer

    Equifax - Site Reliability Engineer

    Equifax • Thiruvananthapuram
    Site Reliability Engineering (SRE) at Equifax SRE is a discipline that combines software and systems engineering for building and running large-scale, distrib...Show more
    Last updated: 30+ days ago • Promoted
    Senior Site Reliability Engineer

    Senior Site Reliability Engineer

    IntraEdge • Thiruvananthapuram, IN
    Strong leadership and people management skills.Exceptional technical proficiency in Pearson's technology stack.Strategic thinking with a focus on long-term operational excellence.Champion operation...Show more
    Last updated: 22 days ago • Promoted
    Lead Engineer

    Lead Engineer

    Hyqoo • Kollam, IN
    Design, deploy, and manage AWS cloud infrastructure, including EC2 instances, S3 buckets, VPCs, RDS databases, and Lambda functions. Assist in the design, implementation, and maintenance of backup, ...Show more
    Last updated: 4 days ago • Promoted
    Senior Site Reliability Engineer (SRE) – Datadog Observability

    Senior Site Reliability Engineer (SRE) – Datadog Observability

    Jade Global • thiruvananthapuram, kerala, in
    Senior Site Reliability Engineer (SRE) – Datadog Observability.SRE and Infrastructure Operations with minimum 3.Hyderabad preferable but open for Pune and remote. Site Reliability Engineer (SRE).SRE...Show more
    Last updated: 9 days ago • Promoted
    Site Reliability Engineer (SRE)

    Site Reliability Engineer (SRE)

    Confidential • Thiruvananthapuram / Trivandrum
    As a Site Reliability Engineer (SRE) you will be responsible for improving the overall reliability of applications by ensuring its availability, performance, and scalability.Should be able to gathe...Show more
    Last updated: 30+ days ago • Promoted
    Site Reliability Engineer (SRE) – Infrastructure & Automation

    Site Reliability Engineer (SRE) – Infrastructure & Automation

    InstaService • Thiruvananthapuram, IN
    InstaService is revolutionizing the home services industry through AI-driven technology, connecting customers with trusted professionals instantly. We’re growing fast across 23+ states and expanding...Show more
    Last updated: 7 days ago • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    Veca Consulting Pvt Ltd • thiruvananthapuram, kerala, in
    Role Name : SRE & Devops Engineer(Bigdata).Location : Bangalore(No relocation).Notice Period : 20-30 days(who are currently serving). You will be a member of our AI Platform Team, supporting the next...Show more
    Last updated: 10 hours ago • Promoted • New!
    Site Reliability Engineer

    Site Reliability Engineer

    Pro5.ai • kollam, kerala, in
    This role is ideal for someone passionate about system reliability, incident response, and cross-team collaboration in a large-scale cloud environment. Act as the first point of contact for all cust...Show more
    Last updated: 10 hours ago • Promoted • New!
    Sr Engineer, Site Reliability [T500-21295]

    Sr Engineer, Site Reliability [T500-21295]

    TMUS Global Solutions • thiruvananthapuram, India
    NASDAQ : TMUS), headquartered in Bellevue, Washington, is America’s supercharged Un-carrier, connecting millions through its strong nationwide network and flagship brands, T-Mobile and Metro by T-Mo...Show more
    Last updated: 7 days ago • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    Synamedia • thiruvananthapuram, kerala, in
    At Synamedia, the world’s most talented innovators and trailblazers are shaping the way the world is entertained and informed. We are backed by the Permira funds and Sky.This is the age of infinite ...Show more
    Last updated: 3 days ago • Promoted
    Senior Staff Site Reliability Engineer

    Senior Staff Site Reliability Engineer

    Talent Collective (India) • kollam, kerala, in
    Client of Talent Collective (India).Our client is seeking a deeply skilled Senior Staff Site Reliability Engineer (SRE) to advance their enterprise security initiatives around identity and access, ...Show more
    Last updated: 6 hours ago • Promoted • New!
    Senior Site Reliability Engineer

    Senior Site Reliability Engineer

    o9 Solutions, Inc. • kollam, kerala, in
    Be part of something revolutionary.At o9 Solutions, our mission is clear : be the Most Valuable Platform (MVP) for enterprises. With our AI-driven platform — the o9 Digital Brain — we integrate globa...Show more
    Last updated: 10 hours ago • Promoted • New!
    Senior Site Reliability Engineer

    Senior Site Reliability Engineer

    Nebula Tech Solutions • kollam, kerala, in
    SRE team supporting mission-critical applications for our.We’re now looking for engineers who can go beyond operations — those who can. Enhance application reliability through code.Add or modify cod...Show more
    Last updated: 9 days ago • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    CodeKarma • thiruvananthapuram, kerala, in
    Site Reliability Engineer (Multi-Cloud Deployments).CodeKarma is redefining how engineering teams understand and evolve complex systems — bringing production context directly into the developer’s w...Show more
    Last updated: 30+ days ago • Promoted