`Sequence.sorted*` methods

mfulton26 · August 15, 2016, 5:57pm

I was surprised to find the following methods in the standard library in the kotlin.sequences package:

sorted
sortedBy
sortedByDescending
sortedDescending
sortedWith

These methods convert a sequence to a mutable list, sort the list, and then return a sequence based on the iterator of the sequence. This will cause performance/memory issues when used on large/infinite sequences. Can these be deprecated or even removed or am I missing something?

It seems to me that they should not belong and that if someone wants to sort a sequence they need to explicitly convert the sequence to a list and then sort it. Are there other methods on Sequence that won’t work on large/infinite instances?

ilya.gorbunov · August 15, 2016, 6:08pm

It should be noted that all these operations are lazy, i.e. the sequence isn’t materialized and sorted until an iterator is requested from it.

A plenty of them: toList, groupBy, all/any, partition, forEach, etc — all terminating methods that involve sequence iteration will not complete normally when invoked on an infinite sequence.

mfulton26 · August 15, 2016, 6:15pm

Right, I guess I meant to ask whether or not there were other methods that took a Sequence and then return a Sequence that internally convert to a list. toList, groupBy, all/any, partition, etc. naturally exhaust the sequence as their names imply such and forEach may be infinite if the sequence is infinite, again, this is intuitive.

I find it odd though that sorted takes a Sequence and returns a Sequence as this is impossible. In order to sort a sequence you must convert it to a list but some may overlook this. Can Sequence.sorted be changed to return a List instead? This would be safer in my opinion as it would become clear from the signature the potential performance/memory impact of calling such a method and then it really just becomes a shortcut for sequence.toList().sorted().

ilya.gorbunov · August 15, 2016, 6:25pm

Another function with the similar behavior is Sequence.minus(Sequence). When an iterator is obtained from the resulting sequence the second sequence is materialized to set and then that set is used to filter lazily the first sequence.

Some time before release we had Sequence.sortToList or something like this, we had removed it as it was quite cumbersome and that naming didn’t scale well to cover all of sorted overloads.

Changing the return type is not an option now, after the release.

bailieb · August 17, 2016, 6:48am

Another sequence operation that can involve significant overhead is distinct (or distinctBy), which involves a Set internally.

Such operations can be described as stateful, as they retain state regarding earlier elements that is used when processing a given element (and therefore trigger consumption of the source of the sequence and potentially involve internal overhead). Stateless operations can process elements without regard to the number or value of any earlier elements (and therefore do not trigger consumption of the source of the sequence and do not involve significant internal overhead).

In the official documentation for the java stream APIs, all intermediate operations are categorised as stateful or stateless. This categorisation is important for the developer because it indicates whether the operation is cheap or potentially expensive (especially if the stream is being processed in parallel). This documentation also defines intermediate vs terminal operations, stateful vs stateless intermediate operations and the significant of them.

It is perhaps worth noting that the JDK 8 design includes stateful intermediate operations that return streams (similar to the design decision in kotlin to include stateful intermediate operations that return a sequence).

Could something similar be added to the documentation for the kotlin.sequences package?

For example, definitions of intermediate vs terminal operations and stateful vs stateless intermediate operations (as well as indicating in the method level documentation what type each operation is).

ilya.gorbunov · August 19, 2016, 2:19pm

Good idea, I think we could categorize each sequence operation as stateful, stateless, or terminal.

ilya.gorbunov · May 22, 2017, 4:42pm

We’ve improved docs for sequence operations in this regard in 1.1.2, see for example this section and linked pages: kotlin.sequences - Kotlin Programming Language

timvanoijen · June 28, 2025, 1:09pm

To address these needs, I have written a small lib that contains some efficient operations for sequences that are explicitly asserted to be sorted:

https://github.com/timvanoijen/sorted-sequence

Example:

val seq1 = sequenceOf("a1", "b2").assertSortedBy { it.first() }
val seq2 = sequenceOf("b3", "c4").assertSortedBy { it.first() }

// With default pairing
val joined = seq1.fullOuterJoinByKey(seq2)
// Results in: [("a1" to null), ("b2" to "b3"), (null to "c4")]

// With custom merge function
val merged = seq1.fullOuterJoinByKey(seq2) { key, v1, v2 -> "${v1 ?: ""}${v2 ?: ""}" }
// Results in: ["a1", "b2b3", "c4"]

Topic		Replies	Views
asSequence().filter{} vs .filter{} Language Design	9	5785	January 14, 2022
Large Sequences performing worse than large lists	1	2129	July 11, 2017
Why Sequence instead of Iterator? Language Design	15	19397	October 19, 2017
Internal implementation of List operations Language Design	1	717	February 16, 2021
Extention functions performances	4	3955	April 19, 2018

`Sequence.sorted*` methods

Related topics