Core Responsibilities :
AI / ML Integration
- Develop and implement AI / ML models for anomaly detection in software systems.
- Collaborate with data scientists, telemetry and observability engineers to refine models and improve accuracy.
- Integrate AI / ML solutions into existing software engineering workflows to enhance reliability and auto triage capabilities.
- Conducts deep statistical analysis, including predictive and prescriptive modeling, to mature and scalre reliability engineering capabilities
- Maintains knowledge of techniques, technology, industry trends, best practices, and emerging methodologies and applies it to projects.
System Reliability
Design and implement strategies to improve the reliability and performance of software systems.Works with reliability engineers, telemetry and observability engineers and data scientists to translate requirements into an analytical approachMonitor system health and performance, identifying and addressing issues proactively.Develop automated triage processes to quickly resolve incidents and minimize downtime.Communication & Collaboration
Collaborate with cross-functional partners to implement AI / ML solutions that scale and integrate seamlessly with existing technology stacksDeliver rapid insights on short timelines when urgent business questions arise, using advanced analytical methods in SQL, Python, or R.Summarize complex analyses into clear and concise presentations or reports for senior leadership decision-making.Education Qualification & Certifications
Required Minimum Qualifications :
List the education, certification, and work experience for an incumbent in the job. Enter the Minimum Qualifications and Preferred Qualifications as directed, and delete the areas not used.
List the education, certification, work experience and skills required to minimally qualify an individual for the job.
B.E / BTech
MTech
Skill Set Required
Primary Skills (must have)
5 Years of experience in programming and scripting languages (eg : Python, Java, JavaScript) to develop and maintain custom solutions.2 years of experience executing and deploying data science, machine learning, deep learning, and generative AI solutions, preferably in a large-scale enterprise setting (fewer years may be accepted with a masters or doctorate degree)3 years of experience in SQL and NoSQL databases, Hadoop ecosystem, Druid, Trino, Big QueryAgile project experienceSecondary Skills (desired)
Strong written and verbal communication skillsStrong interpersonal skills and time management skillsSkills Required
Hadoop Ecosystem, Oracle NoSQL Database, Sql