About Albertsons Companies Inc. :
As a leading food and drug retailer in the United States, Albertsons Companies, Inc. operates over 2,200 stores across 35 states and the District of Columbia. Our well-known banners across the United States, including Albertsons, Safeway, Vons, Jewel-Osco and others, serve more than 36 million U.S customers each week.
We build and shape technology solutions that solve customers’ problems every day, making things easier for them when they shop with us online or in a store. We have made bold, strategic moves to migrate and modernize our core foundational capabilities, positioning ourselves as the first fully cloud-based grocery tech company in the industry.
Our success is built on a one-team approach, driven by the desire to understand and enhance the customer experience. By constantly pushing the boundaries of retail, we are transforming shopping into an experience that is easy, efficient, fun and engaging.
About Albertsons Companies India :
At Albertsons Companies India, we're not just pushing the boundaries of technology and retail innovation, we're cultivating a space where ideas flourish and careers thrive. Our workplace in India is a vital extension of the Albertsons Companies Inc. workforce and important to the next phase in the company’s technology journey to support millions of customers’ lives every day.
At the Albertsons Companies India, we are raising the bar to grow across Technology & Engineering, AI, Digital and other company functions, and transform a 165-year-old American retailer. At Albertsons Companies India associates collaborate directly with international teams, enhancing decision-making processes and organizational agility through exciting and pivotal projects. Your work will make history and help millions of lives each day come together around the joys of food and inspire their well-being.
Position Title : Staff Engineer AIOps
Job Description :
Roles & responsibilities :
- Design, implement, and manage ML pipelines on Databricks, leveraging Unity Catalog for data governance and managing ML workflows using Genie and Mosaic Gateway.
- Automate CI / CD processes for ML workflows, ensuring reproducible and reliable experiments.
- Deploy and monitor model serving architectures across multi-cloud environments—specifically running Databricks on GCP while supporting model invocations on OpenAI’s cloud.
- Integrate LangChain and other framework connectors to enable agent-based architectures that call external APIs (e.g., OpenAI APIs for generative models) within a Databricks context.
- Implement end-to-end model tracking and versioning using MLflow, ensuring clear lineage and reproducibility.
- Develop and maintain shared tooling and workflows to support data scientists, focusing on robust infrastructure and streamlined deployment across GCP and Azure.
- Design and maintain mechanisms for secure and efficient data access and movement across cloud boundaries, using Databricks’ cross-cloud features.
- Configure and monitor performance metrics and logging across environments, ensuring observability and compliance for training and serving workloads.
- Partner with compliance and security teams to ensure ethical and secure model deployments while managing secrets and keys for API access.
- Collaborate with other engineering teams to create best practices around unit catalog, governance, and cross-cloud deployment.
Experience :
Required Qualifications :
Bachelor’s / Master’s degree in Computer Science, Engineering, or a related field.10+ years of professional experience in MLOps / ML engineering, including large-scale deployment on Databricks.Proven expertise with Databricks components like Unity Catalog, Genie, Mosaic Gateway, and MLflow.Experience orchestrating multi-cloud architectures, particularly using Databricks on GCP for development and training, with inference performed via external services such as OpenAI’s APIs.Strong proficiency in Python and integration of external libraries such as LangChain and OpenAI’s API within Databricks notebook workflows.Experience designing and automating CI / CD pipelines for ML workloads with tools like Jenkins, GitHub Actions, or Databricks Workflows.Strong understanding of inter-cloud data movement, secure API access, and best practices for compliance and governance.Hands-on experience with container orchestration (Docker, Kubernetes) and infrastructure-as-code tools (Terraform).Familiarity with model monitoring, observability tools, and performance profiling on GPU and CPU resources.Preferred Qualifications :
Experience deploying ML applications for pricing or optimization algorithms.Contributions to open-source projects related to Databricks, MLflow, or MLOps frameworks.Understanding of AI safety, compliance, and governance frameworks.Experience mentoring junior engineers and establishing MLOps best practices across an organization.Must Have Skills :
Databricks expertise, Multi-Cloud Architecture & Deployment, CI / CD for Machine Learning, Python Programming & Framework Integration, Model Monitoring, Compliance