nPloy лого

Performance Engineer

Лого на OpenAI

OpenAI

On-site

On-site

Постоянен трудов договор

7 - 15 years of experience

Full Time

San Francisco, United States

Описание

About the Team

We bring OpenAI's technology to the world through products like ChatGPT and the OpenAI API.

We seek to learn from deployment and distribute the benefits of AI, while ensuring that this powerful tool is used responsibly and safely. Safety is more important to us than unfettered growth.

About the Role

OpenAI is looking for an experienced Performance Engineer to help us scale the performance, reliability, and efficiency of our systems. In this role, you'll apply deep technical expertise to optimize infrastructure and application-level performance across mission-critical products like ChatGPT and our developer API. You’ll work cross-functionally with teams building core services, training models, and developing real-time user experiences to push our latency, throughput, and cost-efficiency to the next level.

We are looking for engineers who thrive in ambiguous environments, value deep systems understanding, and are motivated by delivering measurable impact. This is a highly technical, individual contributor role focused on root-cause analysis, profiling, instrumentation, and architecture-level performance improvements across our stack.

In this role, you will:

  • Analyze and optimize performance across application, middleware, runtime, and infrastructure layers—networking, storage, Python runtime, GPU utilization, and beyond.

  • Develop tooling and metrics that provide deep observability into system performance.

  • Collaborate closely with infra, platform, training, and product teams to identify key performance goals and drive systemic improvements.

  • Influence architecture and design decisions to prioritize latency, throughput, and efficiency at scale.

  • Lead investigations into high-impact performance regressions or scalability issues in production.

  • Drive performance testing strategies and help define SLAs/SLOs around latency and throughput for critical systems.

You might thrive in this role if you:

  • Have 7+ years of experience in software engineering with a strong track record in performance or reliability of high-scale distributed systems.

  • Are deeply comfortable with performance profiling tools and tracing systems.

  • Have experience optimizing performance across one or more layers of the stack (e.g., database, networking, storage, application runtime, GC tuning, Python/Golang internals, GPU utilization).

  • Have a strong understanding of OS internals, scheduling, memory management, and IO patterns.

  • Have contributed to observability, benchmarking, or performance-focused infrastructure at scale.

  • Have demonstrated success navigating ambiguity and aligning stakeholders around performance goals.

  • Value simplicity, rigor, and collaboration when solving complex systems problems.

About OpenAI

OpenAI is an AI research and deployment company dedicated to ensuring that general-purpose artificial intelligence benefits all of humanity. We push the boundaries of the capabilities of AI systems and seek to safely deploy them to the world through our products. AI is an extremely powerful tool that must be created with safety and human needs at its core, and to achieve our mission, we must encompass and value the many different perspectives, voices, and experiences that form the full spectrum of humanity. 

We are an equal opportunity employer, and we do not discriminate on the basis of race, religion, color, national origin, sex, sexual orientation, age, veteran status, disability, genetic information, or other applicable legally protected characteristic.

For additional information, please see OpenAI’s Affirmative Action and Equal Employment Opportunity Policy Statement.

Qualified applicants with arrest or conviction records will be considered for employment in accordance with applicable law, including the San Francisco Fair Chance Ordinance, the Los Angeles County Fair Chance Ordinance for Employers, and the California Fair Chance Act. For unincorporated Los Angeles County workers: we reasonably believe that criminal history may have a direct, adverse and negative relationship with the following job duties, potentially resulting in the withdrawal of a conditional offer of employment: protect computer hardware entrusted to you from theft, loss or damage; return all computer hardware in your possession (including the data contained therein) upon termination of employment or end of assignment; and maintain the confidentiality of proprietary, confidential, and non-public information. In addition, job duties require access to secure and protected information technology systems and related data security obligations.

We are committed to providing reasonable accommodations to applicants with disabilities, and requests can be made via this link.

OpenAI Global Applicant Privacy Policy

At OpenAI, we believe artificial intelligence has the potential to help people solve immense global challenges, and we want the upside of AI to be widely shared. Join us in shaping the future of technology.

Необходими умения

Design Patterns
Development Tools
Infrastructure
Metrics
Networking
Operating Systems
Performance Testing
Performance Tuning
Python
Reliability
Root Cause Analysis
Scheduling
Storage
Time Management
GoLang
Distributed Systems
Observability
SLA
Database Support
Analytics
optimization
financial instruments
Strong understanding of algorithms data structures and performance optimisation techniques
English
Обявата е публикувана днес

или

за да кандидатстваш.