Find your dream job now!

Click on Location links to filter by Job Title & Location.
Click on Company links to filter by Company & Location.
For exact match, enclose search terms in "double quotes".

Keywords: Senior System Reliability Engineer, Location: Santa Clara, CA

Page: 1

Senior System Reliability Engineer

efficient and reliable systems is an imperative. We are looking for a System Reliability Engineer to join NVIDIA's existing... Reliability Engineering team, involved in NVIDIA's diverse system product range specifically Graphics and High-Performance...

Company: Nvidia
Location: Santa Clara, CA
Posted Date: 05 Nov 2024

Senior Reliability Engineer

-level performance solutions for these meaningful changes in automotive industry, We are now looking for a Senior Reliability... system, and graphics and computing power is redefining the automobile. NVIDIA is at the forefront, providing high-end to mid...

Company: Nvidia
Location: Santa Clara, CA
Posted Date: 21 Nov 2024

Senior Site Reliability Engineer - DGX Cloud

Site Reliability Engineering (SRE) at NVIDIA is an engineering discipline to design, build and maintain large scale... at NVIDIA ensures that our internal and external facing GPU cloud services run maximum reliability and uptime as promised to the...

Company: Nvidia
Location: Santa Clara, CA
Posted Date: 12 Jan 2025

Senior Site Reliability Engineer - DGX Cloud

Site Reliability Engineering (SRE) at NVIDIA is an engineering discipline to design, build and maintain large scale... at NVIDIA ensures that our internal and external facing GPU cloud services run maximum reliability and uptime as promised to the...

Company: Nvidia
Location: Santa Clara, CA
Posted Date: 12 Jan 2025

Senior Site Reliability Engineer - Cloud

Site Reliability Engineering (SRE) at NVIDIA is an engineering discipline to design, build and maintain large scale... at NVIDIA ensures that our internal and external facing GPU cloud services run maximum reliability and uptime as promised to the...

Company: Nvidia
Location: Santa Clara, CA
Posted Date: 12 Jan 2025

Senior Site Reliability Engineer, AI Infrastructure

. What You Will Be Doing: Develop and maintain large-scale systems supporting critical use cases for AI Infrastructure, driving reliability.... Build tools and frameworks to improve observability, define actionable reliability metrics, and enable fast issue resolution...

Company: Nvidia
Location: Santa Clara, CA
Posted Date: 11 Jan 2025

Senior Site Reliability Engineer - Observability and Telemetry Platform

Site Reliability Engineering (SRE) at NVIDIA is an engineering discipline to design, build and maintain large scale... at NVIDIA ensures that our internal and external facing GPU cloud services run maximum reliability and uptime as promised to the...

Company: Nvidia
Location: Santa Clara, CA
Posted Date: 03 Jan 2025

Senior Site Reliability Engineer - GPU Clusters

Engineer to lead the design, deployment, and management of our large-scale GPU clusters. These clusters will power AI workloads... - like GB200 - and cloud technologies to improve system performance. What we need to see: Minimum BS degree in Computer Science...

Company: Nvidia
Location: Santa Clara, CA
Posted Date: 13 Nov 2024

Principal Site Reliability Engineer (Cortex Cloud Security Posture Management)

into our systems’ performance and health. Your Impact As a Senior Staff SRE with the Cortex Cloud Security Posture Management team... of system issues and minimal impact on services Automation - Automate complex monitoring and alerting tasks by building tools...

Location: Santa Clara, CA
Posted Date: 15 Jan 2025
Salary: $147000 - 237500 per year

Principal Site Reliability Engineer (Cortex Cloud Security Posture Management)

into our systems' performance and health. Your Impact As a Senior Staff SRE with the Cortex Cloud Security Posture Management team... of system issues and minimal impact on services Automation - Automate complex monitoring and alerting tasks by building tools...

Location: Santa Clara, CA
Posted Date: 15 Jan 2025
Salary: $147000 - 237500 per year

Sr Staff Site Reliability Engineer (Cortex Data Lake)

Alto Networks runs a large infrastructure and is one of the largest GCP customers. As a Senior Staff DevOps Engineer for the CDL/SLS... as an engineer in Infrastructure, Operations, DevOps, or System Engineering 3+ years building high availability, scalable cloud...

Location: Santa Clara, CA
Posted Date: 14 Dec 2024
Salary: $126000 - 203500 per year

IC Reliability Engineer

and operations teams to create board and system reliability test plans for various products such as GPU, Server, Automotive, Gaming... world. Join us at the forefront of technological advancement. What you will be doing: Reliability (DfR) qualification...

Company: Nvidia
Location: Santa Clara, CA
Posted Date: 05 Nov 2024
Salary: $124000 - 195500 per year

Senior Software Engineer - Data Center System Bringup

systems for GPU accelerated applications, such as Deep Learning. As a Senior Software Engineer - Data Center System Bringup... server systems like HGX, DGX and MGX. We are looking for Senior Firmware / System Software engineers who would closely work...

Company: Nvidia
Location: Santa Clara, CA
Posted Date: 02 Jan 2025

Senior Research Engineer, Robotics Systems

NVIDIA is searching for a senior or principal engineer who specializes in robotics systems in the Generalist Embodied... robot maintenance, diagnostics, and troubleshooting to ensure system reliability; Monitor teleoperators at the lab...

Company: Nvidia
Location: Santa Clara, CA
Posted Date: 07 Dec 2024

Senior System Level Product Engineer

post-silicon Senior System Level Product Engineer who is passionate and committed to making a difference in the world.... You will cover products in all business units that Nvidia provides solutions for. Prior experience in the lab with system level post...

Company: Nvidia
Location: Santa Clara, CA
Posted Date: 03 Nov 2024

Senior System Software Engineer – DRIVE SIM Platform Software

is searching for a system software engineer to participate in creating and expanding AV simulation platforms. You will participate... AV simulation platforms including: Hardware and software system integration and bring-up Containerized software development...

Company: Nvidia
Location: Santa Clara, CA
Posted Date: 27 Nov 2024

Senior System Software Engineer - GPU and SoC

debugging is invaluable Experience working on system level reliability and resiliency features. Familiarity with system... on all aspects of SOC and system, and technology verticals. As someone who is hardworking and passionate about their craft...

Company: Nvidia
Location: Santa Clara, CA
Posted Date: 27 Nov 2024

Process Ultra-Pure Water Utilities Systems Engineer- New College Grad (Bachelor's)

infrastructure. The position is a Process-UPW-Utilities Systems Engineer supporting tool installs, new factory upgrades, and site... Engineer Interprets internal or external business issues and recommends best practices. Solves complex problems; takes...

Location: Santa Clara, CA
Posted Date: 18 Jan 2025
Salary: $80000 - 110000 per year

Systems Development Engineer, Amazon Elastic VMware Service(EVS)

Systems development engineer and a self-starter who is excited to build something new and work at cloud scale? If the answer.... We are looking for a Systems Development Engineer to build new capabilities to help customers run VMware-based workloads on AWS. The AWS Commercial...

Company: Amazon
Location: Santa Clara, CA
Posted Date: 04 Jan 2025
Salary: $116300 per year

Senior Circuit Design Engineer - Power Modeling and Simulation

We are now looking for a motivated Senior Circuit Design Engineer in Power Modeling and Simulation to join our dynamic... level. Perform detailed block-level and system-level simulations to ensure good reliability, performance, and stability...

Company: Nvidia
Location: Santa Clara, CA
Posted Date: 15 Jan 2025