Senior Data Center Hardware Engineer
Tehran | Engineering | Full-time
About Us
We are an established company with over eight years of experience providing exceptional cloud computing and data center solutions. Our Data Center team is responsible for designing, deploying, and maintaining a robust hardware infrastructure that underpins our critical services and internal operations. As a Senior Hardware Engineer, you will play a pivotal role in ensuring the reliability and availability of our data center hardware while collaborating with various internal teams to troubleshoot hardware issues and contribute to future innovations.
Position Overview
As a Senior Data Center Hardware Engineer، you will oversee the management, maintenance, and optimization of hardware infrastructure within our data centers. Your role will focus on ensuring high availability and quality of server hardware, troubleshooting issues, and driving continuous improvements in the data center's hardware performance. You will also be responsible for managing the physical infrastructure within the data center, ensuring secure and compliant installations.
Key Responsibilities
- Monitor & Troubleshoot: Proactively monitor hardware health and detect, investigate, and resolve issues in the data center's server hardware.
- Preventive & Corrective Actions: Execute corrective and preventive measures to ensure uninterrupted service and hardware reliability.
- Vendor Management: Collaborate with hardware vendors and manufacturers, following up on support and SLA contracts until issues are resolved.
- Firmware & Hardware Testing: Qualify, test, validate, and manage firmware stack updates and maintain hardware-related documentation.
- Cross-team Collaboration: Work closely with product, customer support, infrastructure, network, and sales teams to resolve hardware-related challenges.
- Technical Documentation: Create and maintain detailed documentation of technical procedures, action plans, and troubleshooting steps.
- Physical Infrastructure Management: Install and manage racks, shelving, power strips, rails, and cabling within the data center. Ensure all installations are secure, properly labeled, and meet industry standards.
- Continuous Learning: Stay informed on the latest technology trends to enhance the hardware landscape in the data center.
Requirements
- Expertise in Data Center Hardware: Deep knowledge and hands-on experience with data center hardware including servers, storage systems, network equipment, and other large-scale hardware components.
- Installation & Maintenance: Proficient in installing, racking, and maintaining hardware such as servers, switches, and routers.
- Physical Infrastructure Experience: Skilled in installing and managing racks, power strips, shelving, rails, and cabling, ensuring proper configuration and compliance with industry standards.
- Troubleshooting Skills: Strong problem-solving abilities in diagnosing and repairing hardware issues, including memory replacements, storage media failures, and server breakdowns.
- Firmware & Server Admin Skills: Expertise in server administration, including firmware updates, UEFI, BMC, PXE, and IPMI protocols.
- Collaboration & Communication: Ability to work collaboratively with cross-functional teams and document technical knowledge effectively.
- Incident Management: Experience managing hardware incidents and ensuring timely resolution, adhering to best practices.
- Self-motivated: Strong initiative to drive improvements within the data center and a proactive approach to solving challenges.
Preferred Qualifications
- Cloud & Data Center Enthusiast: Passion for cloud computing, hardware, and data center infrastructure.
- Scripting & Automation Skills: Experience with Python, BASH, and DevOps automation tools like Ansible, and Git.
- Linux System Administration: Solid understanding of Linux environments and container technologies like Docker.
- Monitoring & Observability: Familiarity with monitoring tools such as Prometheus, Grafana, and Alertmanager.
- Vendor Experience: Hands-on experience with leading vendors such as HPE for servers, storage, and network equipment.
Our Technical Stack
- Hardware: Large-scale environment with a mix of HPE servers, storage, and networking equipment.
- Tools & Platforms: Linux administration, scripting languages (Python/BASH), Automation (Ansible).
- Observability & Monitoring: Prometheus, Grafana, Alertmanager, and SNMP for monitoring and observability.
Benefits
- Supplementary health insurance for you and your family (supports most treatments, including psychotherapy).
- Competitive salary with regular promotion opportunities.
- Reimbursement for educational courses, internet, and even programs for self-development. (like art classes or learning a new language, etc.)
- Flexible working hours, including remote work opportunitties.
- An exciting work environment with talented colleagues, cultural diversity, with an open environment for new ideas.
- We provide everything you need to work comfortably, such as laptops, equipment for remote work, etc.
- Launch and various on-site meals and snacks.