Most AI startups never have this conversation. They're Linux-native, their GPUs run Ubuntu, and NFS is the obvious choice. Enterprise AI teams are different. They've got data scientists on Windows laptops, legacy ETL pipelines in .NET, Active Directory controlling access, and security teams that are particular about protocols.

For those teams, SMB isn't a nice-to-have. This post compares SMB and NFS honestly, explains where each shines, and makes the case for why enterprise AI infrastructure usually needs both.

The Protocols, Briefly

NFS (Network File System) — Sun, 1984. Designed for Unix. Stateless (v3) or stateful (v4). The default for Linux-world network filesystems.

SMB (Server Message Block) / CIFS — IBM via Microsoft, 1983. Designed for Windows. Stateful. Integrated with Active Directory for authentication.

Both are "network filesystems" but they evolved in different ecosystems, and the ecosystems leak into their design.

Where NFS Wins

Linux Native

Every Linux distribution has an NFS client in-kernel. mount -t nfs4 just works. No extra packages on most distros.

Higher Peak Throughput in Benchmarks

For streaming sequential reads on a well-tuned setup, NFSv4 often beats SMB by 10-30%. This matters for training workloads that saturate links.

Simpler Auth in Cloud-Native Environments

Outside of Kerberos, NFS auth is just UID/GID-based. For Kubernetes or cloud-native stacks where you're already managing identity through service accounts, NFS is less friction.

Better Concurrent Reader Performance

NFSv4.1 delegations let clients cache aggressively. Many concurrent readers of the same file (training!) scale well.

Where SMB Wins

Windows Native

net use Z: \\server\share is muscle memory for Windows admins. No client software to install.

Active Directory Integration

SMB ties directly into Kerberos + AD. Your existing group policies, permissions, and audit logs apply unchanged. This is huge for enterprises.

Encryption and Signing on by Default

SMB 3.1.1 has AES-128-GCM encryption and signing baked into the protocol. NFSv4 has sec=krb5p but it's not as commonly deployed.

Richer ACLs

Windows ACLs are more expressive than POSIX permissions. If your compliance team thinks in Windows ACLs, SMB keeps them happy.

SMB is universally supported on Windows, macOS, and Linux. NFS works on Linux and macOS but is clunky on Windows (requires the NFS Client feature).

Side-by-Side for ML Workloads

Concern	NFS	SMB
Peak throughput (streaming)	Excellent	Good
Small-file latency	Excellent (with v4.1)	Good
Linux client	Native kernel	cifs-utils
Windows client	Third-party / optional feature	Native
macOS client	Native	Native
AD integration	Via Kerberos, less common	First-class
In-transit encryption	Optional (sec=krb5p)	Default (SMB 3.x)
Kubernetes CSI	Widely supported	Widely supported
POSIX semantics	Full	Mostly
Locking	NFSv4 state locking	Opportunistic / lease-based

The Enterprise AI Reality

In a real enterprise:

Data scientists use a mix of Windows, Mac, and Linux laptops
Training clusters are Linux (that's where the drivers are)
Data engineering pipelines might be .NET, Python, or Java
Jupyter notebooks run on Linux VMs
Annotation tools often run on Windows
File servers traditionally used SMB, with AD-based permissions

Forcing everyone to one protocol means either:

Windows users get a worse experience (if you pick NFS)
Your training clusters get a worse experience (if you pick SMB)
Someone maintains two separate storage systems (both)

The "Both Protocols, Same Data" Pattern

The pattern that works: one logical storage layer, multiple protocols on top.

Canonical data in object storage (S3-compatible, managed or BYO)
NFS file system for Linux training clusters (fast, kernel-native)
SMB file system for Windows data scientists and legacy tooling
S3 API for anything that wants object semantics directly

All three point at the same bytes. A data scientist edits a sample in Excel via SMB; the training cluster picks up the change via NFS; a reporting tool reads it via S3 API. No copies, no sync jobs.

This is what Training Pipes provides. When you create a file system, you pick a protocol (NFS or SMB). You can create both for the same bucket. The underlying data is one canonical copy.

SMB 3.x Enterprise Features That Matter

SMB isn't just "NFS for Windows." Recent versions bring features enterprises actively want:

Per-share encryption configurable without IPSec
Message signing to prevent tampering
Persistent handles for transparent failover
Multi-channel for bonded-link throughput
Witness protocol for automatic failover notifications
Kerberos armoring (FAST) for strong auth

If your security team has a matrix of required protocol features, they're probably looking at this list.

When to Pick Just One

NFS only if: your environment is 100% Linux, you use UID/GID or project-based auth, and your security model is network-level (VPC isolation, WireGuard tunnels, etc.).

SMB only if: your environment is Windows-dominant, you live in Active Directory, and your storage needs are less about max throughput and more about broad client compatibility.

Both if: you're an enterprise AI team with a mixed fleet. Which, realistically, is most enterprise AI teams.

The Training Pipes Take

We support both because our customers asked for both. A managed bucket (or BYO connection) can expose an NFS mount target and an SMB mount target in the same region, pointing at the same data. Same cache, same durability story, two client-facing protocols.

Mount one bucket two ways →

SMB vs NFS for Enterprise AI Teams: Which Protocol Wins?