Greetings from TCS!!!!!!!
TCS Hiring for Observability(Prometheus , Grafana , ELK Stack)
Job Location : Chennai
Experience Range : 6-10 Years
Job Description :
Strong hands-on experience with observability tools :
Expertise in distributed tracing and metrics collection .
Familiarity with cloud-native observability (AWS CloudWatch, Azure Monitor, GCP Operations Suite).
Proficiency in scripting (Python, Bash) for automation with exposure to UNIX / Linux Shell Scripting
Experience with containerized environments (Docker, Kubernetes) and observability in microservices.
Strong understanding of SRE principles , SLIs / SLOs , and alerting strategies .
Responsibilities :
Lead design and development teams in Fault Monitoring, Performance Management and Configuration Management in Network Management and Service Assurance Domain.
Design and implement observability frameworks across applications, microservices, and infrastructure.
Deploy and manage monitoring, logging, and tracing tools (e.g., Prometheus, Grafana, ELK, Open Telemetry).
Define and implement metrics, dashboards, and alerts for proactive monitoring.
Collaborate with development, DevOps, and SRE teams to embed observability best practices in CI / CD pipelines.
Ensure end-to-end visibility across distributed systems and cloud environments.
Troubleshoot and optimize application performance using observability insights.
Drive root cause analysis (RCA) and improve incident response through observability data.
Maintain documentation, standards, and governance for observability practices.
Observability • dombivli, maharashtra, in