across our infrastructure stack, including network fabric, host networking, communication libraries, and scheduling infrastructure. AI/HPC... Network Engineer Responsibilities Design, develop, test and operate networking systems to support large scale AI training...
of AI/HPC hardware requirements and specifications (e.g., configuring hardware components, GPU, memory, network for AI/HPC...Meta is seeking an experienced software engineer to join our Accelerator Solutions & Technologies group, supporting the...
Meta is seeking a Systems Engineer to join our Release to Production (RTP) team working on AI/ML initiatives supporting.... Hardware Systems Engineer, NPI AI Responsibilities Lead the bring-up, validation, and deployment of cutting-edge hardware...
them into production? Then a role on one of our network engineering teams is for you! Software Engineer - Host networking.... Our work covers the entire network lifecycle, including hardware development, capacity planning, distributed and centralized...
-GPU and multi-node data communication through HPC-style collectives. NCCL has been integrated into PyTorch and is on the.... Large-Scale GenAI/LLM training) from the trainer down to the inter-GPU and network communication layer. And we are seeking...