Principal Cloud Consultant
This role has been designed as ‘’Onsite’ with an expectation that you will primarily work from an HPE office.
Hewlett Packard Enterprise is the global edge-to-cloud company advancing the way people live and work. We help companies connect, protect, analyze, and act on their data and applications wherever they live, from edge to cloud, so they can turn insights into outcomes at the speed required to thrive in today’s complex world. Our culture thrives on finding new and better ways to accelerate what’s next. We know varied backgrounds are valued and succeed here. We have the flexibility to manage our work and personal needs. We make bold moves, together, and are a force for good. If you are looking to stretch and grow your career our culture will embrace you. Open up opportunities with HPE.
Job Description:
A Principal Consultant within the HPE GTS Private Cloud AI Center of Excellence is responsible for delivering advanced technical leadership and white-glove support to enterprise and mission-critical customers leveraging HPE PCAI solutions. The role requires deep expertise in NVIDIA GPUs, CUDA programming, and the broader NVIDIA AI software stack, along with a solid understanding of container platforms and underlying infrastructure.
The Principal Consultant works closely with data scientists, software engineers, and product teams to enable efficient execution of AI workflows and high-performance computing workloads. Acting as a technical leader, they take end-to-end ownership of complex customer issues, collaborating with multiple HPE engineering teams to drive timely and sustainable resolutions.
In addition to customer-facing responsibilities, the role includes driving technical readiness across the CoE, representing the support organization in leadership forums, and providing strategic input into the product and services roadmap. This includes influencing design and operational decisions to improve supportability, scalability, and overall customer experience.
Key responsibilities include.
- Advanced Issue Resolution: Drive complex problem management and resolution by leveraging deep expertise across multiple technology domains.
- Executive & Expert Engagement: Interact seamlessly with customer senior executives and HPE business unit technology experts, demonstrating exceptional communication and stakeholder management.
- Analytical Problem-Solving: Apply advanced analytical skills to identify, dissect, and creatively solve challenging technical problems.
- Cross-Functional Collaboration: Work closely with customers, partners, and internal teams to translate business challenges into robust technical solutions.
- Trusted Advisor Role: Serve as a strategic technical advisor in high-impact customer engagements, executive briefings, and key strategic initiatives.
- Assess and appreciate technical issues and customers' business impact, and manage technical communication with customers and other stakeholders.
- Providing leadership in complex technical problem management, working closely with end customers and HPE remote and field support staff
- Identifying and resolving customer issues, particularly with NVIDIA GPUs and related infrastructure components critical to AI processing.
- Work closely with stakeholders to identify opportunities for enhancing product capabilities, drive targeted improvements, and influence future roadmap decisions.
- Troubleshoot and optimize AI applications and infrastructure for maximum efficiency and minimal downtime.
- Fault isolation, Problem reproduction, interacting with the engineering teams, QA, development engineers, and Escalation management
- Technical Readiness Leadership: Lead NPI/NSI technical readiness initiatives to ensure the team is prepared for new solutions and innovations.
- Mentorship & Team Development: Mentor junior engineers, foster knowledge transfer, and inspire high-performing teams to scale HPE’s cloud business for the future.
- Development of knowledge content and runbooks
Knowledge & Skills Required
AI/ML Skills
- Good understanding of AI/ML and Analytics applications such as
- Kubeflow & MLflow
- Apache Sparck and Superset
- NVIDIA AI Enterprise NIM Microservices, Models
- NVIDIA Neural Modules (NeMo)
- Excellent knowledge on below platform components
- Linux operating system (RHEL 8/Rocky)
- Kubernetes, container runtimes and Container networking
- Ezmeral-specific Kubernetes: ezkube, ezfab etc.
- Morpheus software, Morpheus Virtual Machines
- Single Sign-on and IAM
- Postgres database
- Helm, Istio and Spire
- Storage and CSI
NVIDIA GPU, NVIDIA AI and related software’s
- Good Knowledge of GPU technologies, NVIDIA GPU operator, NVIDIA vGPU technology
- Strong GPU Understanding and troubleshooting skills at the HW, OS, SW and Application layers.
- Experience with NVIDIA SDKs (e.g., DeepStream, Jetson, etc.) and GPU performance tuning.
- Familiarity with NVIDIA’s AI software stack
- Experience with cloud platforms such as AWS, Azure, or Google Cloud for NVIDIA GPU-based AI model deployment.
- performance profiling, tuning, and optimization of AI applications on NVIDIA GPUs.
OS, Networking & Virtualization
- Excellent understanding of
- Redhat / Ubuntu Linux
- Linux clustering and Virtualization
- NFS storage configuration and troubleshooting would be desired
- Other skills
- Good knowledge and hands-on experience with at least two various Linux distributions like RHEL, SLES, Ubuntu, and Debian.
- Advanced knowledge of microservices architectures like databases, message queues, indexing and Java applications
- Ability to read/write complex MySQL queries
- Ability to read/write non-trivial scripts and programs in two or more of the following: Bash, PowerShell, Python, Java, Groovy, JavaScript
- Strong ability to read/write basic templates in one or more of the following: CloudFormation, Terraform, Ansible
- Networking troubleshooting skills including IPv4, load balancers, and proxies
- Working knowledge of on-premises hypervisor technologies like VMware vSphere, OpenStack, or KVM
- Knowledge and experience with Linux System Administration, package management, scheduling, boot procedures/troubleshooting, performance optimization, and networking concepts.
- Windows AD administration (user management for EZ authentication integration)
- IPV6 + SLAAC
Common skills and qualifications
- Education: A bachelor's or master's degree in computer science, information technology, or a related field is preferred.
- Problem-Solving Skills: Excellent problem-solving skills and the ability to diagnose and resolve complex technical issues.
- Communication Skills: Effective communication skills to collaborate with other teams, including development, security, and compliance teams.
- Collaboration Skills: The ability to work effectively in a team environment and to coordinate efforts with other teams to resolve issues and implement new solutions. IT Service Management Experience: Familiarity with IT service management (ITSM) frameworks, such as ITIL, and experience with incident, problem, and change management processes.
Accountability, Accountability, Action Planning, Active Learning, Active Listening, Bias, Business Growth, Business Planning, Cloud Computing, Cloud Migrations, Coaching, Commercial Acumen, Creativity, Critical Thinking, Cross-Functional Teamwork, Customer Experience Strategy, Data Analysis Management, Data Collection Management (Inactive), Data Controls, Design Thinking, Empathy, Follow-Through, Growth Mindset, Hybrid Clouds, Infrastructure as a Service (IaaS) {+ 10 more}
Health & Wellbeing
We strive to provide our team members and their loved ones with a comprehensive suite of benefits that supports their physical, financial and emotional wellbeing.
Personal & Professional Development
We also invest in your career because the better you are, the better we all are. We have specific programs catered to helping you reach any career goals you have — whether you want to become a knowledge expert in your field or apply your skills to another division.
Unconditional Inclusion
We are unconditionally inclusive in the way we work and celebrate individual uniqueness. We know varied backgrounds are valued and succeed here. We have the flexibility to manage our work and personal needs. We make bold moves, together, and are a force for good.
Follow @HPECareers on Instagram to see the latest on people, culture and tech at HPE.
Job:
Services
Job Level:
TCP_05
HPE is an Equal Employment Opportunity/ Veterans/Disabled/LGBT employer. We do not discriminate on the basis of race, gender, or any other protected category, and all decisions we make are made on the basis of qualifications, merit, and business need. Our goal is to be one global team that is representative of our customers, in an inclusive environment where we can continue to innovate and grow together. Please click here: Equal Employment Opportunity.
Hewlett Packard Enterprise is EEO Protected Veteran/ Individual with Disabilities.
HPE will comply with all applicable laws related to employer use of arrest and conviction records, including laws requiring employers to consider for employment qualified applicants with criminal histories.
No Fees Notice & Recruitment Fraud Disclaimer
It has come to HPE’s attention that there has been an increase in recruitment fraud whereby scammer impersonate HPE or HPE-authorized recruiting agencies and offer fake employment opportunities to candidates. These scammers often seek to obtain personal information or money from candidates.
Please note that Hewlett Packard Enterprise (HPE), its direct and indirect subsidiaries and affiliated companies, and its authorized recruitment agencies/vendors will never charge any candidate a registration fee, hiring fee, or any other fee in connection with its recruitment and hiring process. The credentials of any hiring agency that claims to be working with HPE for recruitment of talent should be verified by candidates and candidates shall be solely responsible to conduct such verification. Any candidate/individual who relies on the erroneous representations made by fraudulent employment agencies does so at their own risk, and HPE disclaims liability for any damages or claims that may result from any such communication.