What is a coroutine? How are they related to concurrency?
Coroutines and concurrency are largely orthogonal. Coroutines are a general control structure whereby flow control is cooperatively passed between two different routines without returning.
The 'yield' statement in Python is a good example. It creates a coroutine. When the 'yield ' is encountered the current state of the function is saved and control is returned to the calling function. The calling function can then transfer execution back to the yielding function and its state will be restored to the point where the 'yield' was encountered and execution will continue.
Coroutines are a general control structure whereby flow control is cooperatively passed between two different routines without returning.
<-- This is concurrency. The word you are looking for is parallelism. - Adam Arold
orthogonal = Not similar to each other
? - tonix
yield
was a language construct to implement generators
, which are essentially a subset of coroutines
. If we talk about "pure" coroutines
, then the control is not returned to the calling function, but to another coroutine. What can you say about that? - tonix
orthogonal
means "independent of each other". - Rick
I find most of the answers too technical even though it is a technical question. I had a hard time trying to understand the coroutine process. I kind of get it but then I don't get it at the same time.
I found this answer here very helpful:
https://dev.to/thibmaek/explain-coroutines-like-im-five-2d9
To quote from Idan Arye:
To build on your story, I'd put it something like this:
You start watching the cartoon, but it's the intro. Instead of watching the intro you switch to the game and enter the online lobby - but it needs 3 players and only you and your sister are in it. Instead of waiting for another player to join you switch to your homework, and answer the first question. The second question has a link to a YouTube video you need to watch. You open it - and it starts loading. Instead of waiting for it to load, you switch back to the cartoon. The intro is over, so you can watch. Now there are commercials - but meanwhile a third player has joined so you switch to the game And so on...
The idea is that you don't just switch the tasks really fast to make it look like you are doing everything at once. You utilize the time you are waiting for something to happen(IO) to do other things that do require your direct attention.
Definitely check the link, there are much more that I cannot quote everything.
From
Programming in Lua
[1], "Coroutines
" section:
A coroutine is similar to a thread (in the sense of multithreading): it is a line of execution, with its own stack, its own local variables, and its own instruction pointer; but it shares global variables and mostly anything else with other coroutines. The main difference between threads and coroutines is that, conceptually (or literally, in a multiprocessor machine), a program with threads runs several threads in parallel. Coroutines, on the other hand, are collaborative: at any given time, a program with coroutines is running only one of its coroutines, and this running coroutine suspends its execution only when it explicitly requests to be suspended.
So the point is: Coroutines are "collaborative". Even in multi-core system, there is only one coroutine running at any given time (but multiple threads can run in parallel). There is non-preemptive between coroutines, the running coroutine must relinquish the execution explicitly.
For "concurrency
", you can refer Rob Pike's
slide
[2]:
Concurrency is the composition of independently executing computations.
So during coroutine A's execution, it passes control to coroutine B. Then after some time, the coroutine B passes control back to coroutine A. Since there is dependency between coroutines, and they must run in tandem, so the two coroutines are not concurrency.
[1] http://feistyduck.myshopify.com/products/programming-in-luaconcurrency
. Why do you think the conclusion doesn't seem to match the definition provided? - Nan Xiao
I find an explanation from this link [1] is pretty straight forward. None of those answers try to explain concurrency vs parallelism except the last bullet point in this answer [2].
cited from "programming Erlang", by Joe Armstrong, the legendary:
a concurrent program can run potentially faster on a parallel computer.
a concurrent program is a program written in a concurrent programming language. We write concurrent programs for reasons of performance, scalability, or fault tolerance.
a concurrent programming language is a language that has explicit language constructs for writing concurrent programs. These constructs are an integral part of programming language and behave the same way on all operating systems.
a parallel computer is a computer that has several processing units (CPUs or cores) that can run at the same time.
So concurrency is not the same as parallelism. You can still write concurrent programs on a single-core computer. The time-sharing scheduler will make you feel your program is running concurrently.
The concurrent program has the potential to run in parallel in a parallel computer but not guaranteed. OS may only give you one core to run your program.
Therefore, concurrency is a software model from a concurrent program that doesn't mean your program can run in parallel physically.
The word “coroutine” is composed of two words: “co” (cooperative) and “routines” (functions).
a. does it achieve concurrency or parallelism?
To be simple, let's discuss it on a single-core computer.
Concurrency is achieved by time-shares from OS. A thread executes its code in its assigned time frames on the CPU core. It can be preempted by OS. It may also yield control to OS.
A coroutine, on the other hand, yields control to another coroutine within the thread, not to OS. So all coroutines within a thread still exploit the time frame for that thread without yielding the CPU core to other threads managed by OS.
Therefore, you can think of coroutine achieves time-shares by the user not by OS (or quasi-parallelism). Coroutines run on the same core assigned to the thread that runs those coroutines.
Does Coroutine achieve parallelism? If it's CPU-bound code, no. Like time-shares, it makes you feel they run in parallel but their executions are interleaved not overlapped. If it's IO-bound, yes, it achieves parallel by hardware (IO devices) not by your code.
b. the difference with function call?
As the pic shows, it doesn't need to call return
to switch control. It can yield without return
. A coroutine saves and shares state on the current function frame (stack). So it's much more lightweight than function since you don't have to save registers and local variables to stack and rewind call stack when call ret
.
Coroutine is similar to subroutine/threads. The difference is once a caller invoked a subroutine/threads, it will never return back to the caller function. But a coroutine can return back to the caller after executing a few piece of code allowing the caller to execute some of its own code and get back to the coroutine point where it stopped execution and continue from there. ie. A coroutine has more than one entry and exit points
Basically, there are two types of Coroutines:
Kotlin implements stackless coroutines — it’s mean that the coroutines don’t have own stack, so they don’t map on native thread.
These are the functions to start the coroutine:
launch{}
async{}
You can learn more from here :
https://www.kotlindevelopment.com/deep-dive-coroutines/
https://blog.mindorks.com/what-are-coroutines-in-kotlin-bf4fecd476e9
If you are still confused, here is a very simple way of understanding a co-routine
. First off, what is a routine
? In a lay man's term, a routine is something that we do again and again (for example, your morning routine). Similarly. in programming languages, a routine
is a piece of code that we use again and again, e.g., a function
. Now, if you look at the general characteristic of a function or routine
(note: I am cautiously using these two terms interchangeably), it takes some inputs and hogs the CPU threads for as long as the function needs to output the result. Meaning, functions or routines
are blocking calls in you code. However, a co-routine
is a special kind of routine that can co-exist (the "co" part of the word co-routine comes from this) with other routines at the same time and we can make this happen in programming languages with the help of asynchronous programming. In Asynchronous programming, when one co-routine is waiting for something to happen (e.g., disk io), the other co-routine will start working and when this co-routine is in a waiting state the other co-routine will be active ultimately reducing the waiting time of our code.
If you understand the above, let's see how you can create a co-routine function in Python. You can define a co-routine function as following -
async def my_coroutine_function():
return 123
And you can the call the above co-routine by adding await
in front of the co-routine-
my_result = await my_coroutine_function()
To conclude,
When you're watching a TV show and as soon as the Ad comes, you take your phone and text a friend - what you have just done is Asynchronous Programming. While your TV show (a co-routine) was on a waiting state, you went ahead and made your other co-routine (texting your friend) active.
A coroutine is a single-threaded solution to achieve concurrency.
A-Start ------------------------------------------ A-End
| B-Start -----------------------------------------|--- B-End
| | C-Start ------------------- C-End | |
| | | | | |
V V V V V V
1 thread->|<-A-|<--B---|<-C-|-A-|-C-|--A--|-B-|--C-->|---A---->|--B-->|
In comparison to a multithreading solution:
thread A->|<--A| |--A-->|
thread B------>|<--B| |--B-->|
thread C ---------->|<---C| |C--->|
On a different note,
in python gevent
library is a coroutine
based networking library which gives you threadlike features like async network requests, without the overhead of creating and destroying threads. The coroutine
library used is greenlet
.
From Python Coroutine [1]:
Execution of Python coroutines can be suspended and resumed at many points (see coroutine). Inside the body of a coroutine function, await and async identifiers become reserved keywords; await expressions, async for and async with can only be used in coroutine function bodies.
From Coroutines (C++20) [2]
A coroutine is a function that can suspend execution to be resumed later. Coroutines are stackless: they suspend execution by returning to the caller. This allows for sequential code that executes asynchronously (e.g. to handle non-blocking I/O without explicit callbacks), and also supports algorithms on lazy-computed infinite sequences and other uses.
Compare with other's answer:
In my opinion, the resumed later part is a core difference, just like @Twinkle's.
Although many fields of the document are still work in progress, however, this part is similar to most answer, except @Nan Xiao 's
Coroutines, on the other hand, are collaborative: at any given time, a program with coroutines is running only one of its coroutines, and this running coroutine suspends its execution only when it explicitly requests to be suspended.
Since it's quoted from Program in Lua, maybe it's language related(not familiar with Lua currently), not all document mentioned the only one part.
The relation with concurrent:
There is an "Execution" part of the
Coroutines (C++20)
[3].Too long to quote here.
Besides the detail, there are several states.
When a coroutine begins execution
When a coroutine reaches a suspension point
When a coroutine reaches the co_return statement
If the coroutine ends with an uncaught exception
When the coroutine state is destroyed either because it terminated via co_return or uncaught exception, or because it was destroyed via its handle
as the comment from @Adam Arold under @user217714's answer. It's concurrency.
But it's different from multithreading.
from std::thread
[4]
Threads allow multiple functions to execute concurrently. Threads begin execution immediately upon construction of the associated thread object (pending any OS scheduling delays), starting at the top-level function provided as a constructor argument. The return value of the top-level function is ignored and if it terminates by throwing an exception, std::terminate is called. The top-level function may communicate its return value or an exception to the caller via std::promise or by modifying shared variables (which may require synchronization, see std::mutex and std::atomic)
Since it's concurrency, it works like multithreading especially when waiting is unavoidable(from the OS perspective), that's also why it's confusing.
[1] https://docs.python.org/3/reference/compound_stmts.html#coroutine-function-definitionThere is a long street with banks, offices, shops, restaurants, schools along the way. You and your neighbors and friends and family's and perfect strangers, all of you living at that street have a job.
Now you start from home and take a Taxi. Midway, you have to stop at the bank to draw some cash. You hop off the taxi, because you don’t want to keep it waiting. Taxi is now free to give a ride to some one else with a job.
Once you are finished with your work in bank, you come out, wave at another Taxi that was waiting nearby and move along.
You all are Coroutines. Taxis are Threads.
I will expand on @user21714 's answer. Coroutines are independent paths of execution that can not run simultaneously. They depend upon a controller - for example a python
controller library - to handle switching between these paths. But for this to work the coroutines themselves need to invoke yield
or similar structures that allow their execution to be paused.
Threads instead are running on independent compute resources and in parallel with each other. Since they are on different resources there is no need for invoking yield to allow the other paths of execution to proceed.
You can see this effect by starting a multihreaded program - e.g. a jvm
application - in which all eight of your core i7
hyperthread cores are utilized: you might see 797% utilization in Activity Monitor
or Top
. Instead when running a typical python
program - even one with coroutines
or python threading
- the utilization will max out at 100%. I.e. one machine hyperthread.
From Wikipedia [1]:
[1] https://en.wikipedia.org/w/index.php?title=Coroutine&oldid=1222161955#ThreadsCoroutines are very similar to threads. However, coroutines are cooperatively multitasked, whereas threads are typically preemptively multitasked. Coroutines provide concurrency, because they allow tasks to be performed out of order or in a changeable order, without changing the overall outcome, but they do not provide parallelism, because they do not execute multiple tasks simultaneously. The advantages of coroutines over threads are that they may be used in a hard-realtime context (switching between coroutines need not involve any system calls or any blocking calls whatsoever), there is no need for synchronization primitives such as mutexes, semaphores, etc. in order to guard critical sections, and there is no need for support from the operating system.
It is possible to implement coroutines using preemptively-scheduled threads, in a way that will be transparent to the calling code, but some of the advantages (particularly the suitability for hard-realtime operation and relative cheapness of switching between them) will be lost.
Kotlin Coroutine
[Synchronous vs Asynchronous] [1]
[Concurrency vs Parallelism] [2]
Usually we heart something like - coroutines are light weight threads, they allow us to write asynchronous, non-blocking code in a synchronous manner
As for Kotlin Coroutines:
Coroutine is a synthetic sugar/additional layer which allows you to run a task in a non-blocking way and without callbacks. Coroutine consists of some components:
Lets review some example
class MyClass {
val network = Network()
val fileSystem = FileSystem()
suspend fun downloadFile(): File {
//suspendCoroutine is key point
return suspendCoroutine { continuation ->
network.download(callback: Network.Callback {
override fun onSuccess(file: File) {
continuation.resume(file)
}
})
}
}
suspend fun saveFile(file: File) {
//suspendCoroutine is key point
return suspendCoroutine { continuation ->
fileSystem.save(callback: FileSystem.Callback {
override fun onSuccess() {
continuation.resume()
}
})
}
}
GlobalScope.launch {
val downloadResult = downloadFile() //1. suspend function
show(downloadResult) //2. UI
saveFile(downloadResult) //3. suspend function
}
Continuation
It creates Continuation
class which is state machine
with invokeSuspend()
function inside. invokeSuspend()
is called(like callback) at the end of any suspend
function
class Continuation {
int label;
//block of local variabels
File file;
void invokeSuspend(Object result) {
switch (label) {
case 0: {
label = 1;
downloadFile(this); //1. suspend function
return;
}
case 1: {
file = (File) result; //work with result
show(file); //2. UI
saveFile(file, this); //3.suspend function
return;
}
}
}
}
class MyClass {
fun downloadFile(continuation: Continuation): File {
//logic
continuation.invokeSuspend(file)
}
fun saveFile(file: File, continuation: Continuation) {
//logic
continuation.invokeSuspend()
}
}
suspended
continuation: Continuation
)Continuation.resume() -> Continuation.invokeSuspend()
The main point that behavior of coroutine
completely depends on library realisation
yield
generators in Python are an example of coroutines.async
declares are fully fledged coroutine. - Konrad Rudolphswitching between coroutines need not involve any system calls or any blocking calls whatsoever
. Can you pls clear it for me? Cause, goroutine creates threads. Is it same for lua? - reyad