Nvidia Corporation
Santa Clara, CA
NVIDIA is looking for a Deep Learning Architect to join our team working at the cutting edge of AI infrastructure. As agentic LLM workloads reshape the demands placed on modern datacenters, we need engineers who can model, simulate, and reason about complex system-level traffic at scale. If you have a passion for performance analysis, a strong quantitative foundation, and excitement about the future of AI systems, we'd love to talk. In this role, you will build and run simulations that capture the traffic dynamics of agentic AI workloads, mine the results for actionable insights, and help guide architectural decisions for next-generation datacenter and GPU systems. What you'll be doing: Develop and extend C++ and Python simulators that model system-level network and compute traffic for agentic LLM workloads in datacenter environments Characterize real-world LLM serving workloads and distill them into representative simulator inputs Run simulations at scale and apply...