Job Description
Company: Cerebras
about the job
cerebras systems builds the world’s largest ai chip, 56 times larger than gpus. our novel wafer-scale architecture provides the ai compute power of dozens of gpus on a single chip, with the programming simplicity of a single device. this approach allows cerebras to deliver industry-leading training and inference speeds and empowers machine learning users to effortlessly run large-scale ml applications, without the hassle of managing hundreds of gpus or tpus. cerebras’ current customers include global corporations across multiple industries, national labs, and top-tier healthcare systems. in , we announced a multi-year, multi-million-dollar partnership with o clinic, underscoring our commitment to transforming ai applications across various fields. in , we launched cerebras inference, the fastest generative ai inference solution in the world, over 10 times faster than gpu-based hyperscale cloud inference services. about the rolewe are looking for a hands-on infrastructure engineer to join our team and support our high-performance, on-premise server and networking infrastructure. you will be responsible for maintaining, provisioning, and troubleshooting hardware and linux systems, working closely with network and system teams. this is an in-person role, ideal for someone who enjoys working across hardware, networking, and system layers.key responsibilities
- Physically install, rack, cable, and maintain blade servers and hardware components (CPUs, DIMMs, NICs, storage devices, etc
- )
- Connect servers to high-speed networks (100G/400G), verify optics/DACs, and check link status
- Configure BIOS, firmware, and out-of-band management (IPMI/iDRAC/iLO)
- Install and provision Linux OS; configure hostnames, IPs, routing, and NFS mount points
- Debug network issues at physical and OS level (VLAN, link issues, routing, etc
- )
- Use Linux tools (e
- g
- , ip, dmesg, netstat, ping) to isolate and fix issues
- Follow provisioning playbooks and maintain accurate records of assets and changes
- Use scripting (Bash, Python) to automate routine tasks and improve efficiency
- Collaborate with internal teams (network, systems, storage) and coordinate vendor RMAs
- Document procedures and contribute to team knowledge base
- Troubleshoot and replace failed server components with minimal downtime
- Qualifications 3–5 years of experience in data center, lab, or infrastructure engineering roles
- Proficient in Linux system administration and network configuration
- Strong hands-on knowledge of x86 server hardware and enterprise networking
- Familiar with BIOS configuration, firmware updates, and remote management tools
- Skilled in physical setup and troubleshooting of high-speed NICs and optical links
- Experience with VLANs, static routing, and diagnosing layer 1–3 issues
- Ability to write scripts for automation and diagnostics (Bash, Python preferred)
- Comfortable working on-site daily and lifting/moving server hardware
- Preferred Skills Experience with PXE, NFS, RAID controllers, and monitoring tools
- Familiarity with configuration management tools (e
- g
- , Ansible)
- Prior experience in a lab or R&D hardware/software environment
- This is a unique opportunity to work with cutting-edge infrastructure and grow into more senior technical roles
- If you enjoy bridging hardware and software with hands-on work, we’d love to hear from you
- Why Join Cerebras People who are serious about software make their own hardware
- At Cerebras we have built a breakthrough architecture that is unlocking new opportunities for the AI industry
- With dozens of model releases and rapid growth, we’ve reached an inflection point in our business
- Members of our team tell us there are five main reasons they joined Cerebras: Build a breakthrough AI platform beyond the constraints of the GPU
- Publish and open source their cutting-edge AI research
- Work on one of the fastest AI supercomputers in the world
- Enjoy job stability with startup vitality
- Our simple, non-corporate work culture that respects individual beliefs
- Read our blog: We celebrate different backgrounds, perspectives, and skills
- We believe inclusive teams build better products and companies
- We try every day to build a work environment that empowers people to do their best work through continuous learning, growth and support of those around them
- This website or its third-party tools process personal data
- For more details, click to review our CCPA disclosure notice
- ~