top of page

Data Center Manager

Houston, TX, USA

Job Type

Full-time

About the Role

Denvr Dataworks is actively seeking a qualified individual for the role of Data Center Manager/Lead Technician. As a rapidly expanding startup, we thrive on the versatility of our team members who excel at donning various hats, continuously learning, and evolving alongside the company. The ideal candidate should possess expertise in server repairs, along with a comprehensive understanding of GPU servers, including Nvidia, Habana Gaudi, and AMD, and the proficiency to operate and repair them efficiently.

Our ideal candidate will champion and foster a culture of security and safety, with a keen focus on customer needs, emphasizing uptime and availability, and ensuring the data center operates at peak performance.

Responsibilities

Duties and Responsibilities: 


  • Demonstrate proficiency in standard processes and procedures for equipment preparation, installation, diagnostics, troubleshooting, replacement, and decommissioning.

  • Understand the functions of and interactions between network and server equipment.

  • Coordinate, assign, or execute break/fix tasks. providing direction as needed to ensure work is prioritized appropriately and meets KPI and SLA targets.

  • Offer break/fix hardware installation, deployment, and replacement services.

  • Assist in network installation activities and collaborate with Service teams, ODM, and OEM to apply advanced diagnostic and troubleshooting expertise for swift issue identification.

  • Manage third-party vendors to support the completion of work and special projects.

  • Develop and document procedures essential for the safe and effective completion of tasks by others.

  • Exercise judgment and discretion to contribute to issue tracking, follow-up, resolution, and overall service quality.

  • Excellent written and verbal communication, including executive level reporting (internal and external).


Preferred Qualifications: 


  • Associate degree in Computer Science or related field, or equivalent work experience.

  • 5+ years of experience in server systems.

  • 3+ years of leadership and management experience.

  • Strong problem-solving and software engineering skills, coupled with a commitment to high-quality work.

  • Proven experience in Server architectures, CPU baseboards, and GPU technology for the productization of new GPU boards and GPU-accelerated Server architectures.

  • Knowledge of server systems, including SBIOS, BMC, network, power, rack layouts, cabling, and experience with compute, storage, and GPU servers in both air- and water-cooled environments.

  • Familiarity with IPMI/SNMP/Redfish.

  • Experience with machine learning and deep learning frameworks (PyTorch, TensorFlow) and proficiency in benchmarking tools (DeepSpeed, MLPerf).


Expected Hours of Work

This position is intended to be a full-time, 40 hours/week.


Travel

Little to no travel is expected for this position.


Job Type: Full-time


If this is you, please send your resume to careers@denvrdata.com, it would be great to hear from you and learn more about your skills and capabilities.

About the Company

Denvr Dataworks is an Alberta-based company that delivers High Performance Cloud Services (HPCaaS/PaaS/SaaS). Denvr operates first of its kind ultra-efficient, modular, liquid immersion cooled data centers, with high density GPU & CPU based compute clusters along with proprietary cloud services software. The Denvr cloud is designed for customers using data or processor intensive applications inherent to advanced technologies such as Artificial Intelligence, Machine Learning, Deep Neural Networks, Data Rendering, Big Data, and related Data Science applications, with seamless support for hybrid cloud and edge computing scenarios.

Joining the Denvr Dataworks team means that you are a dynamic individual who is responsible but forward-thinking and encouraged by continuous learning and innovation. You have practical and effective communication and interpersonal skills where you lead by example, and you are mindful of building a culture of health in all aspects of the business. You are also a self-motivated and effective problem solver, and you take pride in doing a good job and achieving great results. You are highly collaborative, transparent with information, open to learning and you enjoy learning by “doing”. You are motivated to use your knowledge, experience, relationships, and abilities to help drive an exciting business forward and you love the idea of being part of an exceptional team, that works together to compete hard in the dynamic, cutting-edge world of high-performance computing and cloud services.

bottom of page