What It Takes to Build AI Infrastructure at Baseten

Baseten is a developer-first AI infrastructure company that provides the underlying framework—the "rails"—for machine learning teams to transition models from prototype to high-scale production. Recently securing a $150 million Series D funding round from prominent investors like Bond, IVP, and Spark Capital, the company is rapidly expanding its workforce. For professionals looking to enter the AI sector, Baseten represents a high-impact environment focused on solving one of the most difficult challenges in the industry: cost-effective, low-latency AI model inference.

Operating primarily out of San Francisco and New York with a remote-friendly culture, Baseten powers mission-critical operations for major AI-native companies including Cursor, Notion, and Writer. Careers here are characterized by high technical rigor, a builder-heavy mentality, and the opportunity to work at the absolute frontier of GPU optimization and cloud orchestration.

The Mission of Baseten in the AI Ecosystem

To understand why a career at Baseten is highly sought after, one must look at the current state of the artificial intelligence market. While much of the early AI boom focused on training large language models (LLMs), the industry has shifted toward inference—the process of actually running those models to serve user requests.

Inference is notoriously expensive and technically complex. Companies face constant trade-offs between speed (latency), volume (throughput), and cost. Baseten specializes in navigating these trade-offs. By providing scalable GPU infrastructure and optimized software stacks, they allow engineering teams to deploy models instantly without managing the underlying hardware complexities. For a prospective employee, this means working on products that directly influence the unit economics of the world's most popular AI applications.

Technical Engineering Roles at Baseten

Engineering is the heart of Baseten. The company hires across several specialized sub-disciplines, each requiring a distinct set of skills and a deep understanding of the AI development lifecycle.

Model Performance and GPU Kernel Engineering

This is perhaps the most technically demanding area within Baseten. As AI models become larger and more complex, standard software abstractions often fail to extract maximum performance from NVIDIA hardware.

Engineers in this department focus on:

GPU Kernel Development: Writing high-performance kernels for key operations such as attention mechanisms and mixture-of-experts (MoE) routing.
Optimization Techniques: Implementing advanced methods like FP8 and FP4 quantization, sparsity, and memory coalescing to reduce the computational footprint of LLMs.
Hardware-Software Co-design: Using tools like CUDA, Triton, and PTX assembly to optimize every microsecond of computation.

Candidates for these roles typically need a strong background in C++ and a profound understanding of GPU architecture, including memory hierarchies (L1/L2 cache, shared memory) and synchronization techniques.

Infrastructure and Cloud Platform Engineering

Baseten’s infrastructure team builds the backbone of the entire platform. If the kernels are the engine, the infrastructure is the vehicle. This team ensures that thousands of GPUs can be orchestrated seamlessly to handle billions of requests.

Key responsibilities include:

Kubernetes and Orchestration: Managing large-scale clusters that host diverse AI models.
Internal Tooling: Building CI/CD pipelines and developer environments that allow for rapid iteration.
Reliability and Scaling: Ensuring 99.9% uptime for customers like Notion and Cursor, who rely on Baseten for their core product features.

The tech stack here is heavily centered around Go, Python, Kubernetes, Docker, and Terraform. There is a strong emphasis on "developer tool culture," where engineers treat their internal systems with the same level of polish as the external product.

Model APIs and Core Product

This team focuses on the developer experience (DX). They build the abstractions and interfaces that allow a machine learning engineer to deploy a model with a single command.

Truss Development: Baseten maintains an open-source model packaging framework called Truss. Engineers work on making this tool the industry standard for model deployment.
System Design: Creating elegant APIs that hide the complexity of GPU scheduling and cold starts from the end user.
Full-Stack Challenges: Building the dashboards and management consoles that give users visibility into their model performance and spending.

The Forward Deployed Engineering Model

One unique aspect of Baseten’s career structure is the Forward Deployed Engineering (FDE) team. Unlike traditional sales engineering or support roles, FDEs at Baseten are high-level ML inference specialists who work directly inside the customer’s stack.

When a company like Gamma partners with Baseten, FDEs act as an extension of the customer’s engineering team. They help optimize specific model architectures, solve unique deployment bottlenecks, and ensure that the integration between the client's application and Baseten’s infrastructure is flawless.

For engineers who enjoy the intersection of deep technical problem-solving and direct customer impact, the FDE role offers a rare opportunity to see how the world’s most advanced AI products are built from the inside.

Non-Engineering Careers at Baseten

While Baseten is an engineering-first company, its rapid growth after a $150M Series D requires a robust Go-To-Market (GTM) and Operations engine.

Sales and Go-To-Market

Baseten utilizes a Product-Led Growth (PLG) strategy supplemented by strategic enterprise sales. Roles in this department include:

Account Executives (AI Native): Specializing in selling to the new wave of AI-first startups.
Sales Development Representatives (SDRs): Identifying high-potential leads in the rapidly evolving AI landscape.
Product Marketing: Translating complex technical features into value propositions for both developers and C-suite executives.

People, Finance, and Operations

As the company scales from 100 to 200+ employees, the internal systems must evolve.

Technical and Business Recruiting: Finding the "one-percenters" in the engineering world is a core strategic priority.
Revenue and Corporate Finance: Managing the complex economics of GPU procurement and cloud credit margins.
Operations Specialists: Implementing the policies and systems that maintain a high-performance culture across distributed teams.

Compensation and Benefits at Baseten

Baseten is known for offering highly competitive compensation packages that reflect the high bar for talent. Based on recent hiring data, the salary ranges for various roles are significantly above industry averages.

Salary Ranges (Estimated by Department)

Engineering (Entry to Senior): Salaries typically range from $150,000 to $275,000, with specialized roles like GPU Kernel Engineers or Engineering Managers reaching $300,000+.
Product Management & Design: Compensation ranges from $175,000 to $275,000, depending on seniority and specific focus area (e.g., Infrastructure vs. Core Product).
Sales and GTM: Account Executives often see base salaries between $180,000 and $230,000, with a total target earnings (OTE) structure that includes a 50/50 split between base and variable compensation.
Operations and Recruiting: Roles range from $80,000 (Coordinator) to $210,000 (Senior Technical Recruiter).

Equity and Benefits

Beyond base salary, Baseten offers a comprehensive benefits package designed to attract and retain top-tier talent:

Equity: Meaningful stock option grants in a high-growth, Series D company.
Health Coverage: 100% coverage of medical, dental, and vision insurance for both the employee and their dependents.
Time Off: A generous unlimited PTO policy and a company-wide winter break from Christmas Eve to New Year’s Day.
Work Style: A hybrid model with offices in San Francisco and New York, but with a strong "async-first" communication culture and support for remote work in many roles.

What Baseten Looks for in Candidates

The hiring process at Baseten is designed to identify "builders"—people who are not just comfortable with code, but who are passionate about creating tools that other developers will love.

The "Developer-First" Mentality

Whether you are applying for a sales role or a kernel engineering role, you must understand the developer persona. Baseten's customers are engineers. They value speed, reliability, and transparency. Candidates who can demonstrate a history of contributing to open-source projects or building their own developer tools often stand out.

Technical Proficiency

For engineering roles, the expectations are high:

Language Mastery: High proficiency in Go or Python is standard. For performance roles, C++ and CUDA are non-negotiable.
Systems Thinking: An understanding of how data moves through a system—from the API call down to the GPU memory access patterns.
Problem Solving: The ability to debug complex, distributed systems under pressure.

Cultural Fit: Speed and Reliability

Baseten operates in a market where speed is the primary competitive advantage. The company values individuals who can ship fast without breaking core infrastructure. The culture is "builder-heavy," meaning there is very little room for pure management roles; even managers are often expected to stay close to the technical details.

How to Prepare for the Baseten Interview

If you are considering applying for a role at Baseten, there are several steps you can take to increase your chances of success.

Explore the Open Source Stack

Baseten is deeply involved in the open-source community. The best way to understand their technical philosophy is to explore Truss.

Analyze the Repo: Look at how Truss handles model packaging and serving.
Contribute: Submitting a pull request or improving documentation is a powerful signal of your interest and capability.
Deploy a Model: Use the Baseten platform to deploy an open-source model (like Llama 3 or Stable Diffusion). Understanding the user's journey will give you a significant advantage during the interview.

Understand the Business Model

Baseten is a PLG (Product-Led Growth) company. Familiarize yourself with how they attract individual developers and eventually convert them into enterprise accounts. Reading their technical blog and case studies (e.g., how they helped Gamma or Cursor) will provide context for your discussions with the hiring team.

Focus on Foundations

For technical interviews, don't just study LeetCode. Review the fundamentals of:

GPU architectures and how they differ from CPUs.
Kubernetes networking and storage.
The math behind LLM inference (quantization, KV caching, etc.).

Why Work at Baseten Now?

Joining Baseten in its post-Series D phase offers a unique "sweet spot" in a career trajectory. The company has clear product-market fit and significant capital, reducing the early-stage startup risk. However, it is still small enough that an individual contributor can have a massive impact on the product's direction and the company's culture.

As AI continues to move from "hype" to "utility," the companies that control the infrastructure for inference will be the most valuable players in the ecosystem. Baseten is positioned at the very center of this transition.

Frequently Asked Questions

Does Baseten offer visa sponsorship?

Yes, Baseten has been known to provide visa sponsorship for high-impact technical roles, though this is often evaluated on a case-by-case basis depending on the candidate's location and role.

Is Baseten fully remote?

Baseten is remote-friendly but maintains a strong physical presence in San Francisco and New York. Many engineering and GTM roles offer hybrid or fully remote options, while some operational roles may require an on-site presence.

What is the tech stack at Baseten?

The primary languages are Go and Python. High-performance work involves C++ and CUDA. Infrastructure is managed using Kubernetes, Docker, Terraform, and Bazel.

What makes Baseten different from AWS or Google Cloud for AI?

Baseten is highly specialized for model inference. While generic cloud providers offer raw GPUs, Baseten provides an optimized software layer (like the Truss framework and custom kernels) that makes it much faster and easier for developers to get models into production with superior performance.

How many people work at Baseten?

Baseten is currently in a high-growth phase, with an estimated headcount between 50 and 200 employees, and dozens of open roles across all departments.

Summary

Baseten is a premier destination for talent in the AI infrastructure space. With a focus on the technical challenges of model inference, a builder-centric culture, and highly competitive compensation ($150k - $300k+), it offers a compelling environment for engineers, sales professionals, and operators alike. By bridging the gap between machine learning research and scalable production, Baseten is not just building a product; they are building the foundation of the next generation of AI-native software. Candidates who are passionate about developer tools, GPU optimization, and high-stakes systems engineering will find Baseten to be one of the most rewarding places to grow their careers in the current tech landscape.