Spring, Coroutines, Virtual Threads

pcleary · February 23, 2025, 7:36pm

I’m working on a legacy REST backend written in Kotlin using the Servlet Stack / tomcat / thread per request.

In experimenting with going fully async non blocking, I’ve looked at Virtual Threads and Coroutines.

Virtual Threads have the advantage of not requiring any changes in the existing code.

Coroutines do require changes to go fully async, namely adding suspend in a bunch of places, but also require moving to R2DBC and various other “fixes”. Not a huge deal but touches a lot of places.

Further, with Spring ThreadLocal things and Java agent instrumentation, I worry about how to do context propagation in coroutines.

I am fully aware of structured concurrency, still lacking in Java until later this year. Also, pinning which from what I’ve gathered most Java libraries have addressed (like the Postgres driver), but not sure if coroutines have fully addressed.

Which leads me to my questions / dilemmas.

In a legacy Spring Boot setup (not Webflux, Flow, or Rx or ktor), does it make sense to use Coroutines?
I’ve heard of running Coroutines on top of Virtual threads, but wouldn’t that double memory allocations? Wouldn’t that make context propagation that much harder?
Is it ever a good idea to use runBlocking in a Spring service?

Any insights are appreciated. Unclear on what folks are doing in similar setups to mine.

broot · February 23, 2025, 9:22pm

I never had a chance to compare a service running coroutines and VTs side by side, so take my opinion with a grain of salt. I generally have an impression that coroutines don’t provide that much value over VTs. VTs are much more integrated into the runtime itself, while coroutines are a kind of hack implemented in the bytecode. If we need multiplatform, we need to use older JVMs, we like structured concurrency of coroutines, we like their API in general (for example: flows are great), we can go with coroutines. But if we only need a lightweight concurrency, VTs are probably better in a long run.

Coroutines propagate context using… well, CoroutineContext. If we need to integrate with a code that utilizes thread locals, coroutines provide tools to do this.

Coroutines couldn’t really solve the problem of thread blocking, so any blocking code is simply scheduled to a larger thread pool, meant for blocking. And we have to switch manually as coroutines can’t detect blocking. So coroutines don’t have the problem of pinning, because… they aren’t even that far

We should generally avoid it. runBlocking is meant for bridging with the code which is not coroutine-aware. Spring can run coroutines, so we shouldn’t need runBlocking there.

pcleary · February 24, 2025, 1:32am

Thanks for the replies. I wasn’t aware that coroutines reschedule blocking, are there resources to look into that?

I have seen coroutines having advantages over Loom. I’ll have to back and find some of those.

I have a hard time advocating for coroutines in this particular app, because virtual threads accomplish a lot of the same (suspension, continuations), but do so without any code modification.

Explicit parallelization (CompletableFuture.supplyAsync) and the forthcoming TWR structured concurrency definitely are not as pleasing as the rest of Kotlin.

Guess that could be made a bit better with some spring magic or a compiler plugin, but not sure the juice is worth the squeeze.

broot · February 24, 2025, 9:25am

They don’t reschedule by themselves - we have to do it.

It is funny, but I looked quickly trough coroutines documentation and I didn’t find anything about handling the blocking code. But I can assure you it is generally discouraged to run blocking code in coroutines. This is not a hard requirement. It only means if we use a small thread pool for the best CPU utilization (this is default) and a coroutine gets into blocking code, the thread will be blocked and can’t run any coroutines until unblocking. Or if we write GUI application using coroutines and we block the main thread, it still causes the app to stop responding. But, if we create our own thread pools for running coroutines, if we block for short periods or performance isn’t critical for our application, technically we could block.

Coroutines provide a shared, bigger thread pool to offload blocking code there: IO
But we have to do it manually: withContext(Dispatchers.IO) { readFileContents() }.

pcleary · February 24, 2025, 11:39am

Thanks again. Last question, is anyone running coroutines on virtual threads, or is that a bad idea?

broot · February 24, 2025, 12:11pm

I don’t have a definitive answer on this. Technically speaking, running coroutines on top of virtual threads, is as simple as creating a dispatcher with Executors.newVirtualThreadPerTaskExecutor().asCoroutineDispatcher() and scheduling with it:

val vtDispatcher = Executors.newVirtualThreadPerTaskExecutor().asCoroutineDispatcher()

suspend fun main() = withContext(vtDispatcher) {
    println("#1: ${Thread.currentThread()}")

    val deferred1 = async {
        println("#2: ${Thread.currentThread()}")
        delay(500)
        println("#3: ${Thread.currentThread()}")
        "hello"
    }
    val deferred2 = async {
        println("#3: ${Thread.currentThread()}")
        delay(1000)
        println("#4: ${Thread.currentThread()}")
        "world"
    }

    println(deferred1.await() + deferred2.await())
    println("#5: ${Thread.currentThread()}")
}

Result:

#1: VirtualThread[#20]/runnable@ForkJoinPool-1-worker-1
#2: VirtualThread[#25]/runnable@ForkJoinPool-1-worker-3
#3: VirtualThread[#26]/runnable@ForkJoinPool-1-worker-4
#3: VirtualThread[#28]/runnable@ForkJoinPool-1-worker-3
#4: VirtualThread[#30]/runnable@ForkJoinPool-1-worker-4
helloworld
#5: VirtualThread[#31]/runnable@ForkJoinPool-1-worker-1

However, it feels both frameworks duplicate the same functionality, they do similar things in a different way. They don’t cooperate, they aren’t aware of each other. If we suspend using coroutines, from the VTs perspective we just schedule multiple VTs. If we suspend using VTs, coroutines perceive this as a thread blocking (but we have potentially unlimited number of threads, so this is not a problem).

I see potential benefits of this, e.g. using coroutines APIs and tools, while not having to worry about the blocking code. But I don’t know, I never tried this pattern myself.

ebrowne72 · February 25, 2025, 12:42am

Speaking as someone who absolutely loves coroutines… yeah, it sounds like in your case using VTs is the way to go.

pcleary · February 25, 2025, 1:45am

Thanks. The direction I am leaning is:

If I had to start a new application, I would start with Coroutines (most likely)
If I have a legacy codebase built around the Thread per Request model, with limited need to concurrency control (other than occasional launch and async, then Virtual Threads + CompletableFuture are the way to go

Watching Roman’s video “Coroutines and Loom behind the scenes” - seems to support the above conclusions. He even often refers to Virtual Threads as best for the “Virtual Thread per Request” model.

pcleary · February 25, 2025, 1:47am

Right, they don’t cooperate afaik right now. Roman mentioned in his talk (before he left Jetbrains) that maybe Loom + Coroutines could work together better at some point in the future. Not sure if that is something on the Kotlin roadmap or not.

broot · February 25, 2025, 9:01am

I’m wondering about the same. We can easily imagine my whole above fork-join example could be automatically translated by the coroutines machinery to VTs: when we do async, internally coroutines start a new thread, delay() becomes Thread.sleep and await is join or awaiting on a future. Job done.

However, coroutines provide much more advanced and low-level API than Loom. Continuations are part of the official API and we can do many crazy things with them, not necessarily related to concurrency - we can create state machines, monads, generators, etc. I don’t think these are directly translatable to VTs. Also, coroutines are scheduled cooperatively, so they provide certain guarantees which VTs again can’t provide.

But of course, coroutines could use native functionality of VTs wherever possible and still use their own implementation otherwise (but this would probably mean we still have suspend functions, even if we don’t need them most of the time). Or they could limit the functionality of coroutines if compiling the code with the Loom support.

ebrowne72 · February 25, 2025, 10:01pm

A coroutine example I’m fond of is this, which will switch a UI component to an error state for 3 seconds and then switch it back.

mainScope.launch {
    view.setErrorState(true)
    delay(3000)
    view.setErrorState(false)
}

This always runs on one thread (the main one), doesn’t do anything that most people think of when they think “concurrent programming”, and is safe since the scope will be cancelled when the UI goes away. This is my counterargument whenever I hear someone say, “Coroutines are lightweight threads.”

pcleary · February 25, 2025, 10:16pm

Thanks for the example. I read from your response that you mean that “Coroutines are more than lightweight threads” … ie, like Roman mentioned, fine grained concurrency. Or more broadly, tools for doing more things with concurrency than vanilla async/await

ebrowne72 · February 26, 2025, 6:04pm

I think the best definition of coroutines is “code that can suspend without blocking the thread it’s running on”. So I guess VTs is code that can block a thread but you don’t care.

broot · February 26, 2025, 7:27pm

For me “coroutines” mean we have the ability to explicitly jump to another code location and stack. This comes from the name: subroutine is when we call another part in the code and it becomes a part of our execution flow, it is added to our stack, it becomes our child. Coroutine is when we call another existing execution flow, our sibling, another stack.

Such a jump can be used for suspending capability (we jump out somewhere, and after some time someone jumps back to us), but it can be used for many other cases, again: state machines, generators, etc. Kotlin coroutines provide this capability with continuations. VTs don’t provide such capability: we can only request to suspend or resume another VT, but we can’t request to jump to another VT.

But I guess this is a digression from the main topic.

brunojcm · February 28, 2025, 1:18am

You might be interested in reading/contributing to this:

github.com/spring-projects/spring-framework

Explore leveraging Virtual Thread Coroutine dispatcher

opened 07:43PM - 24 Oct 24 UTC

MarcinMoskala

in: web type: enhancement theme: kotlin

Currently, the default dispatcher used for suspending controller functions is `D…ispatchers.Unconfined`, which is a dangerous poor choice. I believe it was chosen due to the common misconception that `Dispatchers.Unconfined` runs on the thread that was used to start it. That is true, but only until the first suspension point, after that it runs on the thread that was used to resume it, what is dangerous, because libraries are designed to use the minimal number of threads in their suspending API, and they do not design what thread is used to resume, as they assume a dispatcher will change it anyway (out of all dispatchers, only `Dispatchers.Unconfined` is not changing it). Take a look at this example from my book Kotlin Coroutines: Deep Dive: ```kotlin fun main() { var continuation: Continuation<Unit>? = null thread(name = "Thread1") { CoroutineScope(Dispatchers.Unconfined).launch { println(Thread.currentThread().name) // Thread1 suspendCancellableCoroutine { continuation = it } println(Thread.currentThread().name) // Thread2 delay(1000) println(Thread.currentThread().name) // kotlinx.coroutines.DefaultExecutor } } thread(name = "Thread2") { Thread.sleep(1000) continuation?.resume(Unit) } Thread.sleep(3000) } ``` As you can see, after suspension, the coroutine runs on the thread that resumed it, and after `delay` it runs on DefaultExecutor. This poor thread is only supposed to be used to schedule coroutines resuming, not to run their bodies. Above all, it is one for the whole application. Consider this simplified controller: ```kotlin @RestController @RequestMapping class PingController(){ @GetMapping("/ping") suspend fun ping(): ResponseEntity<Map<String, Boolean>> { delay(1000) Thread.sleep(1000) return ResponseEntity(mapOf("success" to true), HttpStatus.OK) } } ``` If you make 1000 requests, it should take at least 1001 seconds, as all sleeps will happen on DefaultExecutor (my experiments confirm that). That is no good. If we used `Dispatchers.IO`, it would need 1000 / 64 + 1 = 17 seconds (due to `IO` limit). Of course, in a real-life example we should have some db or network request instead of `delay`, and some processing instead of `sleep`, but the essential problem remains the same. ```kotlin @RestController @RequestMapping class PingController(){ @GetMapping("/ping") suspend fun ping(): ResponseEntity<Map<String, Boolean>> { val data = fetchData() complexProcessing(data) return ResponseEntity(mapOf("success" to true), HttpStatus.OK) } } ``` Most suspending network clients optimize to use a minimal number of threads. In Ktor Client, for instance, most engines will use only one thread to resume coroutines, so delay is actually mimicking that pretty well. Consider the following example. On my computer, it takes 30 seconds with `Dispatchers.Unconfined`, but only 5 seconds if we used `Dispatchers.Default` instead: ```kotlin suspend fun main() = measureTimeMillis { withContext(Dispatchers.Unconfined) { repeat(1000) { launch { val data = fetchData() complexProcessing(data) } } } }.let { println("Took $it") } suspend fun fetchData(): Data { delay(1000) return Data() } class Data() private val list = List(200_000) { it }.shuffled() fun complexProcessing(data: Data) { list.map { it % 10_000 }.sorted() } ``` So what dispatcher should be used? In theory, if we never block threads, `Dispatchers.Default` is the best option, but using it would be a wishful thinking. There are too many blocking APIs on backend, and `Dispatchers.Default` is not good if you have blocking calls. `Dispatchers.IO` is what is used by Ktor Server, and it would be a better option. Though it is not perfect, as it has one global limit of 64 threads. The danger is that `Dispatchers.IO` is used to wrap over blocking calls, and if one process needs to do a lot of blocking calls (consider a job that sends newsletter using blocking SendGrid API), then controller handlers might wait in queue for an available thread. I believe the perfect option would be to use: - LOOM dispatcher if available and configured (`Executors.newVirtualThreadPerTaskExecutor().asCoroutineDispatcher()`) - A dispatcher with an independent limit otherwise (it can be limited to 64 as well, but the point it so have an independent limit from `Dispatchers.IO`. In the current version on Kotlin Coroutines we create it with `Dispatchers.IO.limitedParallelism(50)`, and in older versions the best we could do it making a dispatcher from a fixed pool of threads with (`Executors.newFixedThreadPool(50).asCoroutineDispatcher()`).

In my to-read list currently, but haven’t had the time yet.

pcleary · February 28, 2025, 12:22pm

I don’t fully understand the interplay between VT in Loom and coroutines, so the following maybe incorrect.

When a coroutine is started, it will get a VT as a carrier thread.

When a coroutine suspends, it is unmounted from the VT. Does that VT then get garbage collected?

When a coroutine resumes, I presume a new VT is created and the coroutine is mounted to this new VT.

What happens when the VT yields? For example, make a JDBC call.

I presume that the Coroutine is just hanging out in state on the VT, so the whole thing (Coroutine + VT) are unmounted from the VT carrier thread. When the VT is resumed, the whole thing gets mounted on a platform thread and continues.

Is this right?

Are there currently any downsides to this approach? Obviously, there are a lot more allocations as you create both a coroutine and VT. You also create a new VT every time a coroutine resumes (I think).

Not sure of the other tradeoffs, risks.

And to my earlier question, is anyone doing this in production?

Thanks!

broot · February 28, 2025, 1:12pm

I believe you got it right and this is what I described in previous posts. I don’t see major downsides of this approach, only:

added complexity - sometimes we suspend using VTs, sometimes we suspend using coroutines, both mechanisms are independent of each other, so developers, tools, etc. need to be aware of both mechanisms.
Potentially added overhead, e.g. we still use suspend functions, continuations, etc. even if we only ever suspend using VTs.

clovisai · March 2, 2025, 11:15am

My understanding so far:

Coroutines are more lightweight. If you have many small CPU-bound tasks (e.g. actors, complex streaming APIs, reactive pipelines), coroutines will be better.
Virtual threads automatically convert blocking IO into async IO. If your workflow is very IO-bound (e.g. just calling a database driver), Virtual threads will be better.
In theory, modern frameworks should only use async IO, so Virtual threads’ auto-conversion shouldn’t help much, but in practice many wildly used libraries still use blocking IO

pcleary · March 2, 2025, 10:31pm

To your last point, a ton of apps use JDBC / JPA which is blocking.

dineshmatrix675 · July 22, 2025, 4:35pm

Hey sorry for asking off topic question. Is it possible to get metrics around IO Dispatchers like i want see thread utilisation of thread pool.

Topic		Replies	Views
Kotlin coroutines or jvm virtual threads? Libraries	15	7041	March 3, 2022
Can Coroutines leverage the new Loom compatible JDBC drivers? Web Development	9	1977	October 11, 2023
Coroutines That Call Functions With Blocking Code Libraries	17	7426	December 21, 2023
How do functions indicate their suspension points to coroutines? Libraries	14	395	March 8, 2025
Kotlin Coroutines and (upcoming) Java Loom Language Design	41	16549	February 3, 2023

Spring, Coroutines, Virtual Threads

Related topics