~/companies/xAI/Member of Technical Staff - Inference
Member of Technical Staff - Inference
xAI
InfrastructurePalo Alto, CA
<div class="content-intro"><h3><strong><span style="font-family: arial, helvetica, sans-serif;">About xAI</span></strong></h3>
<p><span style="font-family: arial, helvetica, sans-serif;">xAI’s mission is to create AI systems that can accurately understand the universe and aid humanity in its pursuit of knowledge. </span><span style="font-family: arial, helvetica, sans-serif;">Our team is small, highly motivated, and focused on engineering excellence. This organization is for individuals who appreciate challenging themselves and thrive on curiosity. </span><span style="font-family: arial, helvetica, sans-serif;">We operate with a flat organizational structure. All employees are expected to be hands-on and to contribute directly to the company’s mission. Leadership is given to those who show initiative and consistently deliver excellence. Work ethic and strong prioritization skills are important. </span><span style="font-family: arial, helvetica, sans-serif;">All employees are expected to have strong communication skills. They should be able to concisely and accurately share knowledge with their teammates.</span></p></div><h3>RESPONSIBILITIES:</h3>
<ul>
<li>Optimizing the latency and throughput of model inference.</li>
<li>Building reliable and performant production serving systems to serve billions of users.</li>
<li>Accelerating research on scaling test-time compute and rollout in reinforcement learning training.</li>
<li>Model-hardware co-design for next-generation architectures</li>
</ul>
<h3>BASIC QUALIFICATIONS:</h3>
<ul>
<li>Worked on system optimizations for model serving, such as batching, caching, load balancing, and parallelism.</li>
<li>Worked on low-level optimizations for inference, such as GPU kernels and code generation.</li>
<li>Worked on algorithmic optimizations for inference, such as quantization, distillation, and speculative decoding, and low-precision numerics.</li>
<li>Worked on large-scale inference engines or reinforcement learning frameworks.</li>
<li>Worked on large-scale, high-concurrent production serving.</li>
<li>Worked on testing, benchmarking, and reliability of inference services.</li>
</ul>
<h3>COMPENSATION AND BENEFITS:</h3>
<p>$180,000 - $440,000 USD</p>
<p>Base salary is just one part of our total rewards package at xAI, which also includes equity, comprehensive medical, vision, and dental coverage, access to a 401(k) retirement plan, short & long-term disability insurance, life insurance, and various other discounts and perks.</p><div class="content-conclusion"><p><em>xAI is an equal opportunity employer. For details on data processing, view our </em><em><a href="https://x.ai/legal/recruitment-privacy-notice" target="_blank">Recruitment Privacy Notice</a>.</em></p></div>