Are virtual threads compatible with synchronized blocks?

In Java 21 a virtual thread entering a synchronized block pins to its carrier thread, preventing the carrier from serving other virtual threads. Replace synchronized with ReentrantLock in hot I/O paths to avoid pinning.

← Back to Java Enterprise Mastery

Java Virtual Threads: Project Loom Explained

Q: What is the difference between a virtual thread and a platform thread?

Platform threads map one-to-one to OS threads, limited to thousands on a server. Virtual threads are lightweight JVM constructs scheduled onto a small pool of carrier OS threads, allowing millions of concurrent virtual threads with negligible overhead.

Q: Do virtual threads improve CPU-bound performance?

No. Virtual threads are designed for I/O-bound concurrency where threads spend most time waiting. For CPU-bound work you still need platform threads matching your core count.

"For decades, concurrency was a choice between simplicity and throughput. Project Loom allows you to have both, effectively ending the era of mandatory reactive programming for high-scale backends."

In the thirty-year history of Java, few changes have been as disruptive and empowering as Project Loom. Since the 1990s, Java threads were "Platform Threads"-heavyweight wrappers around Operating System (OS) threads. This 1:1 mapping created a "Memory Wall." To handle 1,000,000 concurrent requests, a traditional server would need 1 TB of RAM just for thread stacks. This limitation forced architects to adopt complex, unreadable "Reactive" code (like WebFlux or RxJava) to squeeze performance out of a limited number of threads.

With the arrival of Virtual Threads in Java 21, the landscape has changed forever. Virtual threads are "Cheap" lightweight threads managed by the JVM, not the OS. This 1,500+ word masterclass explores the architectural shift from kernel-level scheduling to JVM-level M:N scheduling and how you can leverage it to build systems that scale to millions of concurrent tasks.

1. The Architectural Failure of Platform Threads

A traditional Platform Thread is a massive resource mapping directly to an OS kernel thread.

The Cost of the Kernel

Memory Footprint: Each platform thread reserves about 1 MB for its stack by default. If you start 1,000 threads, you've dedicated 1 GB of RAM just to thread metadata-before your application has processed a single byte of business data.
Context Switching Overheads: The OS is responsible for context-switching. When the CPU swaps from Thread A to Thread B (the "Context Switch"), it must "Freeze" the current stack, save CPU registers, switch into kernel mode, and reload the state for the new thread. This frequently consumes thousands of CPU cycles, creating a high "Concurrency Tax."
The Scalability Wall: Because threads are expensive, we were forced to use Thread Pools. This introduced a new problem: if a thread in the pool blocks for a slow database call, that expensive resource is effectively "dead" until the I/O returns, reducing the total throughput of your server.

2. Virtual Threads: Under the Hood of the M:N Model

A Virtual Thread is not a native OS resource; it is a simple Java object (specifically, java.lang.VirtualThread).

Carrier Threads and the Continuation Magic

The JVM implements Virtual Threads using a technique called M:N Scheduling:

Carrier Threads: The JVM maintains a pool of platform threads (usually equal to the number of CPU cores) called Carrier Threads. The default scheduler is a Work-Stealing ForkJoinPool.
The Continuation: The core of Project Loom is the Continuation object (jdk.internal.vm.Continuation). It represents a "Unit of Execution" that can be suspended and resumed.
Mounting: When a virtual thread starts, the JVM "Mounts" it onto a carrier thread. The business logic executes normally until it hits a blocking operation (like socket.read() or Thread.sleep()).
Unmounting: Instead of making the OS thread wait, the JVM captures the virtual thread's stack state and moves it to the Heap. The carrier thread is now free to pick up a different virtual thread immediately.
Remounting: When the I/O operation completes, the JVM's internal I/O Poller (using native epoll on Linux or kqueue on macOS) signals that data is ready. The virtual thread is scheduled for "Remounting" onto any available carrier thread and resumes execution exactly where it left off.

The result: You write code that looks synchronous and "Blocking," but the hardware performance is identical to the most complex asynchronous non-blocking code.

3. The "Pinning" Trap and Bytecode Diagnostics

While virtual threads are transformative, they introduce a new architectural edge case: Pinning. Pinning occurs when a virtual thread cannot be unmounted from its carrier thread even while waiting for I/O.

Why Pinning Happens

Synchronized Blocks: If you use the synchronized keyword, the virtual thread is "Pinned" to its carrier because the current JVM implementation cannot move an object monitor between threads safely. If your virtual thread waits for a database result inside a synchronized block, the carrier thread is effectively dead.
Native Methods: Calling JNI or native C-code often pins the thread.

The Architect's Fix

In a 2026 enterprise architecture, you must replace synchronized with java.util.concurrent.locks.ReentrantLock. ReentrantLock is "Virtual Thread Aware" and uses LockSupport.park() to allow the JVM to unmount the thread safely without blocking the underlying OS resource.

Diagnostic Tip: Use Java Flight Recorder (JFR) to detect pinning. By enabling the jdk.VirtualThreadPinned event, you can see a stack trace of every line of code causing carrier thread exhaustion.

4. Goodbye ThreadPools: Scalability over Scarcity

The most common mistake senior developers make with Project Loom is trying to "Pool" virtual threads.

Pools are for Scarcity; Loom is Plentiful

Creation over Reuse: A virtual thread is as cheap as a StringBuilder object. Creating 100,000 virtual threads per second is perfectly valid and significantly faster than the overhead of managing a pool.
The Task-Per-Thread Pattern: Stop thinking about "How many threads can I afford?" and start thinking "One thread per task." If you have 10,000 users, start 10,000 threads.
Semaphore-based Throttling: Instead of limiting threads, limit the Resource. If your database can only handle 50 concurrent connections, use a Semaphore with 50 permits inside your virtual threads. This decouples your "Concurrency" (the number of tasks you can handle) from your "Capacity" (the external resources you can call).

5. Memory Forensics: Stacks, GC, and Scoped Values

Heap Allocation vs. Stack Allocation

Traditional threads have a fixed stack size allocated in native memory. Virtual thread stacks are dynamic and live on the Heap. They are managed by the Garbage Collector (ZGC or G1GC). Using virtual threads increases the rate of "Object Allocation," making modern GC tuning (specifically Z-Garbage Collector) essential for low-latency systems.

Scoped Values (The ThreadLocal Revolution)

Using ThreadLocal with virtual threads is a "Memory Bomb." If you have 1,000,000 virtual threads, each with a ThreadLocal map, you will quickly hit a heap OOM. Java 21+ introduces Scoped Values as a lightweight, immutable alternative. Scoped values are inherited by child threads (via Structured Concurrency) and are cleared automatically, preventing the "Internal Memory Leaks" that have plagued Java enterprise apps for decades.

6. Performance Benchmarks: Loom vs. Platform vs. Reactive

Throughput Benchmarks

In a standard "Echo Server" test:

Platform Threads: 2,000 requests/sec (limited by RAM/OS context switches).
Reactive (Netty): 100,000 requests/sec (High complexity, high throughput).
Virtual Threads: 95,000 requests/sec (Minimal complexity, virtually identical throughput to Reactive).

The Verdict: Unless you are building extremely specialized high-frequency throughput engines where every microsecond of garbage collection counts, Virtual Threads should be your default choice for almost all enterprise web services.

7. Case Study: High-Scale FinTech Portfolio Sync

In this scenario, a portfolio sync engine needs to fetch data from 50 different microservices simultaneously for each user.

java

// MODERN APPROACH (Loom-Powered)
try (var executor = Executors.newVirtualThreadPerTaskExecutor()) {
    IntStream.range(0, 10_000).forEach(i -> {
        executor.submit(() -> {
            // This blocks the virtual thread, NOT the OS thread!
            var userPortfolio = portfolioService.fetchRemote(i);
            var marketData = stockService.getPrices(userPortfolio.getSymbols());
            var valuated = valuator.calculate(userPortfolio, marketData);
            database.save(valuated);
        });
    });
}

In the old model, this would require a complex CompletableFuture.allOf() chain or a Reactive Zip operation. With Loom, the code remains simple, sequential, and indestructible.

Summary: Designing for the Horizon

Project Loom is the most significant leap in Java's competitiveness since the introduction of Generics. By removing the operating system as a bottleneck, Java has regained its status as the premier environment for high-scale backend engineering.

Stop Reactive Architectures: There is no longer a business case for the complexity of Non-blocking callback chains for 99% of microservices.
Audit Your Locks: Replace synchronized with ReentrantLock to avoid pinning.
Think in Tasks: Stop pooling threads. Start threads for every request and let the JVM handle the heavy lifting.

You have now moved from "Managing threads" to "Architecting Global High-Throughput High-Flow Systems."

The Ethical Edge: Sustainability and Cloud Economics

Beyond pure performance, Virtual Threads represent a significant shift in Sustainable Engineering. In a traditional platform-thread architecture, a server might spend 70% of its time waiting for I/O while still drawing significant power to maintain large memory stacks and perform kernel-level scheduling. By maximizing the density of tasks on a single CPU core, Project Loom allows companies to drastically reduce their server footprint.

In 2026, where "Cloud Cost Optimization" and "Green Computing" are top priorities, virtual threads are a competitive advantage. You can handle the same workload with 80% fewer EC2 instances, reducing both your AWS bill and your system's carbon footprint. Project Loom is not just a technical update; it is an economic and environmental imperative.

Frequently Asked Questions

Q: What is the difference between a virtual thread and a platform thread?

A platform thread maps one-to-one to an OS thread. Creating thousands of them exhausts OS resources - typically limited to a few thousand on a standard server. A virtual thread is a lightweight construct managed by the JVM and scheduled onto a small pool of carrier OS threads. You can create millions of virtual threads with negligible overhead, making thread-per-request architectures practical for high-concurrency I/O-bound workloads again.

Q: Do virtual threads improve CPU-bound performance?

No. Virtual threads are designed for I/O-bound concurrency - situations where a thread spends most of its time waiting for network, disk, or database responses. When a virtual thread blocks on I/O, the JVM unmounts it from the carrier thread (freeing the carrier for another virtual thread) and remounts it when the I/O completes. For CPU-bound work, you still need parallel platform threads (or parallel streams) matching the number of CPU cores.

Q: Are virtual threads compatible with existing synchronisation primitives like synchronized?

In Java 21, a virtual thread that enters a `synchronized` block pins to its carrier thread for the duration of the block, preventing the carrier from running other virtual threads. This is called pinning, and it can reduce throughput. Mitigate pinning by replacing `synchronized` with `ReentrantLock` in hot paths. In Java 24+, pinning in `synchronized` blocks was eliminated, making virtual threads fully compatible with existing synchronized code.

Part of the Java Enterprise Mastery - engineering the revolution.