Why coroutine dispatcher is so invasive?

minogin · December 30, 2024, 1:16pm

Consider the following example:

@Test
fun test(): Unit = runBlocking {
    f()
}

private suspend fun f() = coroutineScope {
   launch {
       println("Job 1: ${Thread.currentThread().name}")
    }

    launch {
        println("Job 2: ${Thread.currentThread().name}")
    }

    println("f: ${Thread.currentThread().name}")
}

f is a self-contained function which knows how to do its job. The result looks as follows:

f: Test worker @coroutine#1
Job 1: Test worker @coroutine#2
Job 2: Test worker @coroutine#3

All the three things happen in the same thread.

Now I call the function in a context of different dispatcher:

@Test
fun `test IO`(): Unit = runBlocking {
    launch(Dispatchers.IO) {
        f()
    }
}

This completely changes the internal behaviour of the function. Now all the calls are done in parallel threads:

Launch 1: DefaultDispatcher-worker-2 @coroutine#3
f: DefaultDispatcher-worker-1 @coroutine#2
Launch 2: DefaultDispatcher-worker-4 @coroutine#4

For me this looks extremely confusing, I believe that the underlying behaviour should not be influenced by the dispatchers used somewhere in the callstack. Especially when you call suspending functions from other libraries which you know nothing about.

One real case where this led me to big issues (bad performance, program halting) is calling Ktor HTTP client code with Dispatchers.IO. I guess because it creates a lot of coroutines internally and gets stuck when each of the coroutines is called in its own thread.

Would be interesting to hear your opinion if this is ok or not. Probably I am using coroutines in a wrong way. Would be also great to hear @elizarov opinion.

Thanks!

broot · December 30, 2024, 6:58pm

This is an intentional behavior. Function could decide to either use the dispatcher of the caller or specify its own if needed. If it performs I/O, it should switch to IO, if it performs concurrent CPU-heavy computation, it could switch to Default, also it could decide to limit the parallelism or concurrency, etc. And if it doesn’t have such special needs, it could use the dispatcher of the caller, which helps to avoid thread switches and allows the caller to partially control the execution.

Why does this concern you? Are you concerned that whenever you start a thread, it is influenced by the underlying implementation of the OS scheduler and by the number of CPU cores? When using coroutines, threads become carriers of our coroutines, similarly to CPU cores being carriers of threads in the classic code. Usually, we don’t care too much which CPU core picked up our thread. It is a similar story with the coroutine->thread association.

In the end of the day, in both cases your function did exactly what it wanted to do. It launched 3 concurrent coroutines, waited for them to finish, then returned. It doesn’t matter that much if it executed them in parallel or sequentially and in which order. If it needs such control, it should ask for it explicitly.

minogin · December 31, 2024, 10:19am

Why does this concern you?

Unfortunately thread management is not transparent and can seriously affect execution especially when we talk about high-performant code. One case when I faced this was calling ktor client reading from two urls in parallel with Dispatchers.IO which simply led to it’s hanging. Without Dispatchers.IO it works normally but cannot be parallelized.

davidecannizzo · January 1, 2025, 8:46pm

While I’ve not had enough experience with Ktor specifically (and hence I’m not aware of implementation details that could be so heavily influenced by the execution context), I would inform you that what you’re concerned with is not about coroutines or coroutine dispatchers in general. Rather, it’s about specific dispatchers such as Dispatchers.IO, which has a pool of threads and (if my memory serves me right) employs work-stealing. This is done so that blocking (as customary in Java) I/O operations can happen concurrently (each on a different thread) transparently for the user (i.e., you don’t have to explicitly specify which threads do what). The fact that each suspension point most often resumes to a different thread is necessary; in fact, imagine a situation where coroutine A in thread 1 suspends, and coroutine B is dispatched to the now-idle thread 1. Before coroutine B completes, coroutine A resumes. Where can it resume? Not on thread 1, because it may be doing blocking work from coroutine B. That’s why it will run on thread 2 (or any other idle thread). Basically, the IO dispatcher is optimized for maximum parallelism of blocking operations, which means having the largest-feasible thread pool and evenly distributing work on it.

If you want a single thread, or a thread pool you provide, to run coroutines, you can either implement your own dispatcher tailored for your specific situation or just create a Java Executor and a coroutine dispatcher from it (using Executor.asCoroutineDispatcher()).

In particular, from what I’m understanding, if you want a number of tasks to each have a dedicated thread, making sure resumptions don’t get intertwined, what you’re looking for seems to create a single-threaded dispatcher for each of those tasks. This would create a blocking queue on each thread where the same task can resume to after suspension points, but the dispatchers being unique to every task makes sure that tasks can’t steal threads from each other, and the dispatchers being single-threaded ensures they don’t move across threads (thus eliminating possible concurrency issues you may be facing with Ktor).

There might be a better solution to it that I may not be aware of, so it will be interesting to see what this conversation yields.

I just dumped my thoughts in a rather messy way. Feel free to ask for anything that wasn’t clear.

Skater901 · January 5, 2025, 12:50pm

Your opening question honestly seems like “why did this function run on the main thread when I executed it on the main thread, but ran on different threads when I executed it in a thread pool?” If you have multiple threads, and multiple pieces of asynchronous code, the code will execute on multiple threads. Think of launch as being like using CompletableFuture.supplyAsync.

jean.chalard · January 8, 2025, 3:54am

The way I see this, it’s not only working as intented : it’s meant to be like this to improve thread management. Whether it does or not improve, each developer can decide, but let me point out some useful tenets :

It is normal and expected that this feels weird to someone used to manage their own threads. This is, after all, a different paradigm.
Traditional thread management is evidently not a great way for humans to express themselves. Many programs are not threaded enough for their own good performance ; those that are will most of the time use threads in a too heavy way and in an unprincipled manner (e.g. devs randomly spinning a thread to fix an issue with some blocking call becauset that’s immediately the fastest fix (at least that’s my experience ; YMMV)). Programs that need good multithreaded performance will typically need to be written with that very explicit goal in mind.
The dispatcher concept abstracts away the thread semantics. Traditional thread management doesn’t help you with that ; you’d have to realize you want it, then build it yourself.
- It allows for a clear, explicit and documented semantic contract between the dispatcher and its users, an API of sorts. When you say Dispatchers.IO, the reader can know that you want a pool, that you don’t care which of these threads the coroutines use, and very likely that you’re going to do blocking work or something with similar semantics. You can write your own with your preferred semantics.
- It abstracts away the implementation of the dispatcher. With an API contract in place, you can improve the implementation of the dispatcher without breaking your clients (well, in theory at least).
- It allows easier reuse of the dispatcher implementation.
The Dispatcher object also encourages common usage of thread pools, which is a common use case. YMMV, but in my experience any medium-sized project eventually grows one thread pool per programmer that needs one, wasting resources. Large teams with shared thread pools tend to struggle managing them, as with traditional thread management, it’s pretty difficult to achieve sharing threads past your immediate team boundary.
This has the clear virtue of separating the processing that has to be done from where it has to be done. Traditional thread management mixes this together in an inextricable way.

As I see it, these are the main reasons to build it this way. Whether this is superior in practice to traditional thread management, I guess time will tell. But the bar hasn’t been very high.
One can think the names of the dispatchers in Kotlin are not very clear (I do), but that would be a separate criticism. The debugging tools are also not there yet, but they’re improving.

Now I don’t have a lot of experience with dispatchers yet, but a good amount with traditional thread management, and I’ve suffered from the same difficulties you have : dispatchers are counterintuitive to me, I feel uncomfortable with the explicit bits being somewhere else, etc. But I feel like this about any unfamiliar paradigm trying to improve on something I’ve been doing for a lorg time. Time will tell whether this is better.

Finally, about performance : most use cases do not need the very minute thread management required for extreme performance. And as always, when you do need extreme performance you will not be able to dispense with caring about the low-level details. Having abstractions doesn’t change that. I see this in the same way as, an app with difficult memory constraints can’t afford to ignore detailed memory considerations even under GC but it’s not worse than with manual management (just different), while for all other apps it tends to be much easier.

ebrowne72 · January 8, 2025, 7:57pm

The standard way to do two requests in parallel is this:

coroutineScope {
    val result1 = async { request1() }
    val result2 = async { request2() }

    doSomethingWithResults(result1.await(), result2.await())
}

In this example, request1() and request2() are suspend functions that do the network requests. Note that you don’t specify a dispatcher, since the request functions should switch to the correct dispatcher themselves.

Raman.Gupta · January 15, 2025, 5:04pm

The coroutineScope function inherits the parent’s dispatcher by default (in this case the one created by runBlocking), but you can always use withContext or async and launch with an explicit dispatcher parameter to change the dispatcher if that is relevant to the specific code that is running. For example, you would want CPU-intensive code to run on the default dispatcher, blocking code to run on the IO dispatcher, and so on. Function f knows what it is running, and so can make this decision for itself. If it does not do so, then per the documentation of coroutineScope, it is implicitly deferring that decision to the parent scope.

See Blocking Threads, Suspending Coroutines by Roman Elizarov for more information.

I can’t speak for the Ktor authors, but

Ktor is written explicitly as a coroutines-based framework.
It is explicitly (by documentation) a framework which is not thread blocking. In other words, all the Ktor engines (client and server) do suspending but non-thread-blocking I/O.

Because Ktor is natively coroutines-based, it assumes that callers are taking care to set the dispatcher that they want to use. For example, callers may have reasons to execute all http client calls in a specific thread pool, or have other special requirements. Therefore, Ktor respects the caller’s decisions for dispatching and does not explicitly set its own dispatchers.

Given that, when you explicitly specified Dispatchers.IO as the dispatcher the Ktor client should use, the Ktor code respected that. However, the article from Roman above explains why it is wrong and to call non-blocking I/O code in the IO dispatcher – the I/O dispatcher is designed for running thread-blocking calls.

Coroutines are definitely a different paradigm than traditional thread-based programming. However, they do provide a lot of power, flexibility and safety. Its worth reading (and re-reading) docs and articles and experimenting and asking questions until you really understand what is going on. Based on your post, you’re on the right path!

Topic		Replies	Views
Coroutine dispatcher confined to a single thread Language Design	22	22908	June 14, 2024
How do I understand thread dispatching in coroutines? Support	6	296	July 22, 2024
Dispatcher.IO looks like better option for default dispatcher Android	5	4090	November 28, 2020
Default Dispatcher creates too many threads Support	3	4556	November 16, 2018
Warning "Inappropriate blocking method call" with coroutines, how to fix? Support	8	27977	November 12, 2020

Why coroutine dispatcher is so invasive?

Related topics