Skip to main content

Job Description

   Back

Lead - Windows Engineer

27-01-2026 11:34:12

Job_303408

5 - 10 years

  • Chennai, Tamil Nadu, India (CHN)

Core HCI Administration and Lifecycle Management

These activities focus on the central control plane, which manages the entire integrated stack.

• Cluster Initial Setup and Configuration:

 Node Discovery and Formation: Using the vendor's specialized tool (e.g., Nutanix

Foundation, VxRail Manager) to automatically detect new commodity x86 hardware

nodes and logically add them to the cluster.

 Software-Defined Storage (SDS) Configuration: Defining the global policies for the

distributed data plane, including setting up the storage pools that aggregate local drives

(flash/disk) across all nodes into a single logical resource.

 Network Spine/Leaf Integration: Configuring the software layer's view of the physical

network, ensuring proper VLAN tagging, trunking, and jumbo frame support across the

physical NICs for high-speed inter-node communication.

• Non-Disruptive Lifecycle Management:

 Orchestrated Upgrades (One-Click): Performing simultaneous, rolling upgrades of the

entire stack (hypervisor, storage controller, management software, and often firmware)


9


Copyright©2025 Neurealm Private Limited. All rights reserved.


through a single console, which automatically migrates VMs off a node, updates it, and

moves VMs back.

 Software and Firmware Validation: Verifying all update packages against a Hardware

Compatibility List (HCL) and a Software Compatibility Matrix maintained by the vendor

before deployment to eliminate compatibility risks.

• Centralized Monitoring and Reporting:

 Unified Health Dashboard: Monitoring all components (CPU, memory, storage latency,

node health, data reduction ratios) from one interface.

 Predictive Analytics: Utilizing built-in AI/ML features to forecast resource consumption

and trigger alerts based on predicted saturation rather than just threshold breaches

(e.g., predicting when the storage pool will run out of space).


 Virtual Machine (VM) & Resource Provisioning

 The process of managing VMs is streamlined and integrated with the underlying storage

capabilities.

• Intelligent VM Placement and Migration:

o Initial Placement Logic: The control plane determines the optimal host node for a new

VM based on factors like current resource load and Data Locality (placing the VM on the node

that already holds its storage blocks for high performance).

o Automated Load Balancing: Configuring and managing automated features that use live

migration (e.g., Distributed Resource Scheduler/DRS) to continuously move VMs between

nodes to ensure balanced resource consumption and avoid performance hot spots.

• Integrated Data Protection:

 Policy-Based Snapshotting: Scheduling highly efficient, space-saving snapshots for

individual VMs or protection groups directly through the hypervisor management

console, with no reliance on external array snapshot tools.

 Replication Setup: Configuring integrated replication policies to asynchronously copy

VM data to a remote HCI cluster or to a cloud endpoint (DRaaS), defining Recovery Point

Objective (RPO) targets per application.

• Granular Quality of Service (QoS):


10


Copyright©2025 Neurealm Private Limited. All rights reserved.


 I/O Prioritization: Setting minimum or maximum IOPS and throughput limits for specific

VMs to prevent "noisy neighbors" (high I/O VMs) from monopolizing the shared storage

pool, ensuring critical applications maintain performance.


 Software-Defined Storage (SDS) Management

 This is the most specialized area, focusing on managing the pooled, resilient storage

fabric.

• Storage Resiliency and Efficiency Management:

 Failure Domain Configuration: Defining how many concurrent node or drive failures the

cluster can withstand (Fault Tolerance/Replication Factor or Erasure Coding settings).

 Data Reduction Services: Monitoring and managing the efficiency features, including

inline deduplication, compression, and thin provisioning, and reporting on the achieved

capacity savings.

• Dynamic Scaling:

 Non-Disruptive Node Expansion: Adding or removing a node from the cluster while VMs

are running, and automatically rebalancing the distributed storage and compute

resources across the entire cluster.

• Integrated Storage Auditing:

 Verifying data integrity and consistency across the distributed pool, relying on built-in

software tools that continuously monitor the health of all data blocks.


 Virtual Networking and Security

 These tasks involve defining the network topology and security boundaries in software.

• Software-Defined Networking (SDN) Configuration:

 Virtual Switch Management: Configuring and managing the virtual distributed switches

that span all physical nodes, defining port groups, and ensuring proper uplink and

failover configuration.

 Network Isolation: Creating logical networks (often abstracted from physical VLANs) to

isolate traffic between different application tiers or tenants within the cluster.

• Microsegmentation and Distributed Security:

 Policy Creation: Implementing VM-centric firewall rules (Microsegmentation) that

define who/what can talk to a VM, based on security groups or application context

rather than static IP addresses.


11


Copyright©2025 Neurealm Private Limited. All rights reserved.


 Security Policy Mobility: Ensuring that security policies follow the VM during live

migrations, guaranteeing consistent protection regardless of the physical host.