About the Role :
Join LeadSocs storage validation team to ensure our Enterprise / Datacenter NVMe SSD solutions meet rigorous quality, performance, and reliability standards. Youll own lab setup & calibration, build Python-based automation, execute validation plans, and distill results into actionable insights for firmware and hardware teams.
Key Responsibilities :
- Lab Setup & Calibration : Bring up, configure, and calibrate eSSD validation benches (hosts, HBAs, enclosures, power analyzers, protocol analyzers, thermal chambers).
- Test Planning & Execution : Run functional, performance, stress, endurance, and reliability tests for NVMe SSD firmware; contribute to test method improvements for enterprise / datacenter workloads.
- Automation : Develop / maintain Python scripts and frameworks (pytest / CLI tools) for test orchestration, log parsing, and report generation.
- Data Analysis : Analyze large test datasets to uncover trends, anomalies, and regressions; drive root-cause with firmware / ASIC teams.
- Reporting : Prepare clear technical reports (KPIs, pass / fail, defect trends, power / latency / throughput) and maintain validation documentation.
- Cross-Functional Coordination : Work with procurement / IT on timely equipment availability and installations; collaborate with FW, HW, and QA for release readiness.
- Compliance & Safety : Follow lab safety, EHS, and quality protocols across all validation phases.
Must-Have Qualifications :
35 years in validation / testing / instrumentation for storage or embedded systems.Hands-on with NVMe SSD validation or strong exposure to storage device testing.Solid Python scripting for test automation and data processing.Strong analytical / debug skills with logs, counters, SMART / telemetry, and performance metrics (IOPS, latency, bandwidth).Clear communication and documentation habits; ability to summarize complex results for diverse audiences.Good to Have :
Knowledge of NVMe protocol, PCIe fundamentals; exposure to SATA / SAS a plus.Experience with enterprise / datacenter test methods (QoS, tail latency, power-loss protection, thermal throttling, endurance).Familiarity with Linux host tools (fio, smartctl, nvme-cli), Jenkins / CI, Git, pytest / Robot Framework.Use of lab instruments : protocol analyzers (e.g., Teledyne / LeCroy), oscilloscopes, power meters, thermal chambers.Basics of firmware concepts (FTL, garbage collection, wear leveling, NAND characteristics).Education : B.E. / B.Tech / M.E. / M.Tech in ECE / EE / CS or equivalent.
What Success Looks Like (90 Days)
30 Days : Lab bench operational; core Python utilities running; baseline performance suite executed.60 Days : Automated nightly runs with stable reporting; first set of defects triaged with FW team.90 Days : Coverage expanded to stress / endurance; trend dashboards in place; measurable reduction in escape bugs.Interview Process
Technical Screen : Storage basics, NVMe concepts, Python scripting.Hands-On / Case Study : Log analysis or small automation task.System & Collaboration : Test strategy, lab safety, cross-team communication.Managerial Fit : Ownership, prioritization, and reporting cadence.(ref : hirist.tech)