Senior Databricks Administrator -AWSSonatype • Hyderabad, Telangana, India
Senior Databricks Administrator -AWS
Sonatype • Hyderabad, Telangana, India
2 days ago
Job description
Role Summary
The Databricks Administrator will be responsible for the overall health, security, and performance of the Databricks platform. This includes managing user access, implementing and enforcing data governance policies, optimizing cluster resources, and ensuring data sensitivity policies are effectively applied across the data lakehouse. The administrator will also be crucial in identifying, reporting, and resolving discrepancies within the platform's operation and configuration.
Key Responsibilities
User Provisioning and Management :
Onboard and offboard users, groups, and service principals within Databricks, including integration with identity providers (IdPs) like Azure Active Directory or Okta via SCIM.
Manage user roles and entitlements at both the account and workspace levels (Account Admins, Workspace Admins, Metastore Admins, etc.).
Implement and maintain role-based access control (RBAC) and attribute-based access control (ABAC) to ensure appropriate data and resource access.
Data Lake Governance (Unity Catalog focus) :
Configure and manage Unity Catalog metastores, catalogs, schemas, and tables.
Define and enforce data access policies (e.g., table-level, column-level, row-level security) using Unity Catalog.
Manage data lineage and auditing capabilities to track data flow and usage.
Collaborate with data owners and stakeholders to define data quality standards and ensure data integrity.Implement data retention and lifecycle management policies.
Aligning Data Sensitivity Policy to Enforceable Data Governance :
Translate organizational data classification and sensitivity policies into technical controls within Databricks.
Utilize features like data masking and encryption to protect sensitive information.
Ensure compliance with regulatory requirements (e.g., GDPR, HIPAA, CCPA) by implementing appropriate security measures.
Conduct regular security audits and vulnerability assessments.
Managing Cluster and Budget Policies :
Define and implement compute policies to control cluster creation, configuration, and resource usage, ensuring cost optimization.
Monitor and manage serverless budget policies to attribute usage to specific teams or projects.
Optimize cluster configurations for performance and cost-effectiveness, leveraging features like auto-scaling and auto-termination.
Manage cluster pools to reduce startup times and improve resource allocation.
Reporting and Addressing Discrepancies :
Monitor Databricks platform health, performance, and resource utilization.
Identify and troubleshoot issues related to user access, data availability, cluster performance, and policy violations.
Generate reports on platform usage, costs, security incidents, and compliance.
Investigate and resolve discrepancies in data, reports, or system behavior in collaboration with data engineers, data scientists, and other teams.
Develop and maintain comprehensive documentation of configurations, procedures, and best practices.
Collaboration and Support :
Provide technical support and guidance to Databricks users, data engineers, and data scientists.
Collaborate with cloud infrastructure teams (AWS, Azure, GCP) to manage underlying cloud resources.
Stay up-to-date with the latest Databricks features, best practices, and industry trends.
Technical Skills :
Databricks Platform Expertise :
Deep understanding of Databricks architecture, workspaces, and key components (Unity Catalog, Delta Lake, Spark, SQL Analytics).Proficiency in Databricks administration console and APIs.
Experience with Databricks Workflows, Jobs, and Delta Live Tables (DLT) for orchestration and pipeline management.
Cloud Platform Knowledge :
Strong experience with AWS and its relevant services.
Data Governance & Security :
Solid understanding of data governance principles, data classification, and data lifecycle management.
Experience implementing security controls, access policies (RBAC), and encryption.
Familiarity with compliance standards (GDPR, HIPAA, CCPA) and auditing practices.
Programming & Scripting :
Proficiency in SQL for data querying and access control.
Deep expertise in Terraform is essential, extending beyond basic knowledge to managing complex, multi-project infrastructure. This includes hands-on experience with custom Terraform modules crucial for Data Mesh orchestration.
Scripting skills (e.g., Python, Terraform) for automation and administrative tasks.
Familiarity with Spark and PySpark concepts for troubleshooting and optimization.
Identity and Access Management (IAM) :
Experience with enterprise identity providers (e.g., Azure AD, Okta, Active Directory) and SCIM provisioning.
Networking Concepts :
Understanding of network security, VPNs, VPCS, private links, VPC peering, and connectivity within cloud environments.
Monitoring & Logging Tools :
Experience with monitoring tools (e.g., Datadog, Observe, cloud-native monitoring) for platform health and performance.
Soft Skills
Problem-Solving and Troubleshooting : Ability to diagnose and resolve complex technical issues efficiently.
Communication : Excellent verbal and written communication skills to interact with technical and non-technical stakeholders.
Attention to Detail : Meticulous in configuring policies, managing access, and ensuring data integrity.
Proactive and Self-Driven : Ability to anticipate issues, recommend solutions, and continuously improve the platform.
Collaboration : Work effectively with cross-functional teams (data engineers, data scientists, security teams).
Analytical Thinking : Ability to analyze data and system logs to identify trends and discrepancies.
Create a job alert for this search
Aws Administrator • Hyderabad, Telangana, India
Related jobs
Senior Databricks Administrator -Aws
Sonatype • Hyderabad, Republic Of India, IN
The Databricks Administrator will be responsible for the overall health, security, and performance of the Databricks platform.
This includes managing user access, implementing and enforcing data gov...Show more
Last updated: 2 days ago • Promoted
AWS Database Eng
Tata Consultancy Services • Hyderabad, Telangana, India
Desired Competencies (Technical / Behavioral Competency).Should have expertise in creating data warehouses in AWS utilizing the following tools : EC2, S3, EMR, Athena, Sagemaker, Aurora and Snowflake....Show more
Last updated: 19 days ago • Promoted
Senior AWS Data Engineer_Exp : 5+ Years
Atyeti Inc • Hyderabad, Telangana, India
Required Skills & Qualifications.Bachelor’s or Master’s degree in Computer Science or equivalent experience.Application Developer or in similar software engineering roles.Python; strong SQL and clo...Show more
Last updated: 19 days ago • Promoted
Azure Databricks
Tata Consultancy Services • Hyderabad, Telangana, India
TCS has been a great pioneer in feeding the fire of young Techies like you.We are a global leader in the technology arena and there’s nothing that can stop us from growing together.Your role is of ...Show more
Last updated: 19 days ago • Promoted
Senior Aws Data Engineer_exp : 5+ Years
Atyeti Inc • Hyderabad, Republic Of India, IN
Required Skills & Qualifications.Bachelor’s or Master’s degree in Computer Science or equivalent experience.Application Developer or in similar software engineering roles.SQL and cloud-native devel...Show more
Last updated: 19 days ago • Promoted
Azure Databricks Architect
Spot Your Leaders & Consulting LLP • Hyderabad
Role : Databricks Architect - Lead architecture, design, and implementation of data lakehouse solutions using Databricks, Delta Lake, Unity Catalog, and Apache Spark.Define an...Show more
Last updated: 3 days ago • Promoted
Senior Data Engineer - Snowflake / AWS
Resourcetree • Hyderabad
About the Role : A "Senior Data Engineer" is mid-level professional leading the design, build and evolution of the inhouse data platforms.You lead the const...Show more
Last updated: 30+ days ago • Promoted
Senior Databricks Administrator -AWS
Sonatype • Hyderabad, Telangana, India
Role Summary - The Databricks Administrator will be responsible for the overall health, security, and performance of the Databricks platform.
This includes managing user access, implementing and en...Show more
Last updated: 1 day ago • Promoted
Azure Databricks Specialist
KPI Partners • Hyderabad, Telangana, India
KPI Partners are seeking highly skilled and experienced Senior Data Engineers to join our dynamic team at KPI, working on challenging and multi-year data transformation projects for our clients.Thi...Show more
Last updated: 3 days ago • Promoted
AWS Data Engineer
Tata Consultancy Services • secunderabad, telangana, in
TCS is Hiring AWS Data Engineer Bangalore location.Strong hands-on experience in Python programming and PySpark.Experience using AWS services (RedShift, Glue, EMR, S3 & Lambda).Experience working w...Show more
Last updated: 28 days ago • Promoted
AWS Data Solutions Engineer
Tata Consultancy Services • Hyderabad, Republic Of India, IN
TCS is looking for AWS data engineer.Location : Kolkata, Hyderabad, Bangalore, Chennai, Pune, Gurgaon.Strong hands-on experience with AWS Data Services : .
Amazon S3, Glue, Redshift, Athena, Kinesis, E...Show more
Last updated: 30+ days ago • Promoted
Senior Data Engineer - AWS & Python
Egen • Hyderabad, Telangana, India
Design, develop, and maintain ETL / ELT data pipelines using Python and AWS native services (Glue, Lambda, EMR, Step Functions, etc.
Build and manage data lakes and data warehouses using Amazon S3, Re...Show more
Develop and optimize data processing jobs using PySpark to handle complex data transformations and aggregations efficiently.
Design and implement robust data pipelines on the AWS platform, ensuring ...Show more
Last updated: 30+ days ago • Promoted
Data Engineer - AWS
Hirelo • Hyderabad
Responsibilities : - Design and implement data pipelines using Databricks and AWS services (e.Architect and manage the Medallion architecture (Bronze, Silver...Show more
Looking for 10+ Y / highly experienced and deeply hands-on Data Architect to lead the design, build, and optimization of our data platforms on AWS and Databricks.
This role requires a strong blend o...Show more
Detailed job description - Skill Set : .The ideal candidates will have 5+ years of experience in Data Engineering, with a strong focus on Python and SQL programming.
The role requires proficiency in l...Show more
Last updated: 30+ days ago • Promoted
AWS Data Architect
ACL Digital • Hyderabad, IN
AWS (S3, Redshift, Glue, Lake Formation, IAM).Proficient in data modeling, performance tuning, and security best practices.
.AWS Certified Solutions Architect preferred.Show more
Last updated: 10 days ago • Promoted
AWS Cloud Architect
TrueID • Hyderabad, Telangana, India
TrueID is at the forefront of digital transformation in biometric-based identity management, delivering secure and reliable digital identity solutions for fintech, banking, and e-commerce.Our team ...Show more