About Albertsons Companies Inc. :
As a leading food and drug retailer in the United States, Albertsons Companies, Inc. operates over 2,200 stores across 35 states and the District of Columbia. Our well-known banners across the United States, including Albertsons, Safeway, Vons, Jewel-Osco and others, serve more than 36 million U.S customers each week.
We build and shape technology solutions that solve customers’ problems every day, making things easier for them when they shop with us online or in a store. We have made bold, strategic moves to migrate and modernize our core foundational capabilities, positioning ourselves as the first fully cloud-based grocery tech company in the industry.
Our success is built on a one-team approach, driven by the desire to understand and enhance the customer experience. By constantly pushing the boundaries of retail, we are transforming shopping into an experience that is easy, efficient, fun and engaging.
About Albertsons Companies India :
At Albertsons Companies India, we're not just pushing the boundaries of technology and retail innovation, we're cultivating a space where ideas flourish and careers thrive. Our workplace in India is a vital extension of the Albertsons Companies Inc. workforce and important to the next phase in the company’s technology journey to support millions of customers’ lives every day.
At the Albertsons Companies India, we are raising the bar to grow across Technology & Engineering, AI, Digital and other company functions, and transform a 165-year-old American retailer. At Albertsons Companies India associates collaborate directly with international teams, enhancing decision-making processes and organizational agility through exciting and pivotal projects. Your work will make history and help millions of lives each day come together around the joys of food and inspire their well-being.
Position Title : Senior Engineer Site Reliability
Job Description :
Roles & responsibilities :
- Provide technical support for applications and programs currently in production.
- Analyze moderately complex incidents quickly or escalate to next level application support engineer.
- Create appropriate alerts, dashboards, KB articles, Confluence pages and knowledge sharing.
- Monitor dashboards daily to detect anomalies to share and work on corrections with appropriate teams and team members.
- Frequently check for alerts and respond appropriately.
- Work with Development Engineering and DevOps Engineering partners to maintain approvement agendas for services and act on them.
- Support the improved health of Production services.
- Automate both new and existing remediations where possible.
- Utilize basic SRE concepts such as SLI, SLO, SLA and selected types of V.A.L.E.T. SRE dashboards.
- Perform supportability testing, simulations and analysis.
- Determine whether to reuse existing code through the use of program development software alternatives or integrate purchased solutions.
- Coordinate incident management activities with other support teams.
- Resolve issues and / or escalate as needed to meet established service level agreements.
- Provide technical consultation and support in the development of new and currently used computer applications and programs.
- Act as a liaison between clients and applications area.
- Analyze business requirements, designs and writes technical specifications to design or redesign computer solutions.
- Ensure logic and design is in alignment with core architecture of the system / application.
- Develop original and / or complex code or provide coding guidance to less experienced staff.
- Perform modeling, simulations and analysis efforts.
- Verify program logic by overseeing the preparation of test data, testing and debugging of programs.
- Act as an escalation point for application support and troubleshooting.
- Coordinate the migration of applications to production.
- Participate in the development new documentation, participate in the development of department technical procedures and design user guides.
- Contributes to the development of production system documentation standards, procedures and approval hierarchies.
- Assure quality, security and compliance requirements are met for supported area.
- Provide support to less experienced staff in resolution of escalated issues and / or complex production, application or system problems.
Experience Required :
5+ years of programming experience using various standard scripting languages and high level programming languages.Specialized knowledge of troubleshooting skills with an ability to quickly diagnose complex production issuesSpecialized knowledge of application servers (WebSphere, WebLogic, and / or JBoss) and database technologies (Oracle, DB2, UDB, and / or SQL Server).Specialized knowledge of UI / Web 2.0 Development (JavaScript, CSS, Ajax, Adobe Flash / Flex, Dojo, YUI, and / or jQuery).Specialized knowledge of UNIX and Windows operating systemsExperience creating and maintaining application processes and documentationSpecialized knowledge of current monitoring toolsSpecialized knowledge of at least one major cloud platform and Service Container / Instance conceptsSpecialized knowledge of querying and inspection techniques for service and other types of logsSpecialized knowledge of basic NoSQL DB concepts and experience with querying information from sameSpecialized knowledge of Kafka concepts and health confirmation techniquesExposure to network concepts and technologiesSpecialized knowledge of the full software development lifecycle and software development methodologies (Agile).Strong experience creating and maintaining application processes and documentationAbility to understand client expectations and to resolve issues that may affect service.Strong interpersonal skills with the ability to work effectively across multiple levels of the organization.Ability to mentor, coach and train other engineersSelf-starter, with a demonstrated ability to learn beyond formal training with a strong aptitude for delivering quality products.Competencies :
Compassionate and kind, showing courtesy, dignity, and respect. They show sincere interest and empathy for all others.Show integrity in what is done and how it is done - without sacrificing personal / business ethics.Embrace an inclusion-focused mindset, seeking input from others on their work and encouraging the open expression of diverse ideas and opinionsTeam-oriented, positively contributing to team morale and willing to help.Learning-Focused, finding ways to improve in their field and use positive constructive feedback to grow personally and professionallyThink strategically and proactively anticipate future problems, needs or changes in the workSkills Required :
Site Reliability (SRE)Azure / GCPGrafana and PrometheusJavaNoSql / SQLAdditional Skills Required :
SalesforceNetworking skills like TCP / IP, HTTP , routing, switching.