Talent.com
This job offer is not available in your country.
DevOps Site Reliability Engineer

DevOps Site Reliability Engineer

LotusFlarepune, India
4 hours ago
Job description

Responsibilities

  • Monitoring backend services (cloud-based infrastructure)
  • Supporting, troubleshooting, and investigating issues and incidents (support developers and infra team with system metrics analysis, logs, traffic, configuration, deployment changes, etc)
  • Supporting and improving monitoring / alerting systems (Searching, testing, deploying new functionality for existing tools)
  • Creating new features for automating troubleshooting and investigation process
  • Creating new tools to improve the support process
  • Drafting reports and summarizing information after investigations and incidents

Requirements :

  • At least 1 year of work experience with similar responsibilities
  • Strong knowledge and practical experience in working with the Linux(Ubuntu) command-line / administration
  • Understanding of network protocols and troubleshooting (TCP / IP, UDP)
  • Strong scripting skills (Bash, Python)
  • Critical thinking and problem solving
  • Understanding of containerization (Docker, container)
  • Experience with troubleshooting API driven services
  • Experience with Kubernetes
  • Experience with Git
  • Background in release management processes
  • English — Professional written and verbal skills
  • Good to have :

  • Prometheus, Grafana, Kibana (Query language)
  • Experience with Nginx / OpenResty
  • Experience with telco protocols (Camel, Map, Diameter) from advantage
  • Software development / scripting skills
  • Basic knowledge Casandra, PostgreSQL
  • Experience with using AWS cloud services (EC2, Redshift, S3, RDS, ELB / ALB, ElastiCache, Direct Connect, Route 53, Elastic IPs, etc.)
  • CI / CD : Jenkins
  • Terraform
  • Recruitment Process :

    HR Interview followed by 4-5 Levels of Technical Interviews

    About :

    At LotusFlare, we attract and keep amazing people by offering two key things :

  • Purposeful Work : Every team member sees how their efforts make a tangible, positive difference for our customers and partners.
  • Growth Opportunities : We provide the chance to develop professionally while mastering cutting-edge practices in cloud-native enterprise software.
  • From the beginning, our mission has been to simplify technology to create better experiences for customers. Using an “experience down” approach, which prioritizes the customer's journey at every stage of development, our Digital Network Operator™ Cloud empowers communication service providers to achieve valuable business outcomes. DNO Cloud enables communication service providers to innovate freely, reduce operational costs, monetize network assets, engage customers on all digital channels, drive customer acquisition, and increase retention.

    With headquarters in Santa Clara, California, and five major offices worldwide, LotusFlare serves Deutsche Telekom, T-Mobile, A1, Globe Telecom, Liberty Latin America, Singtel, and other leading enterprises around the world.

  • Website :
  • LinkedIn :
  • Instagram :
  • Twitter :
  • Create a job alert for this search

    Site Reliability Engineer • pune, India