-
-
Notifications
You must be signed in to change notification settings - Fork 30.5k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
bpo-32309: Implement asyncio.ThreadPool #18410
bpo-32309: Implement asyncio.ThreadPool #18410
Conversation
self.loop.run_until_complete(pool_cm()) | ||
|
||
|
||
class ThreadPoolTests(unittest.TestCase): |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Could the tests benefit from IsolatedAsyncioTestCase
? I guess it would remove run_until_complete
calls along with using async def main
. Doc : https://docs.python.org/3/library/unittest.html#unittest.IsolatedAsyncioTestCase
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Based on my local experimentation so far, it seems be to working as intended and does provide a bit of a readability benefit in terms to reducing white noise. Just waiting on a +1 from @1st1 since this would be the first CPython regression test to use IsolatedAsyncioTestCase
(outside of unittest
of course).
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Update: I'd like to consider using IsolatedAsyncioTestCase
in a separate PR, after the initial version is implemented since we're a bit short on time with the upcoming feature deadline for 3.9 beta.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I'll be interested to see the non-wasteful native implementation when you'll get to it :-)
I'll definitely let you know when there's a working native implementation then! That's probably the part that I'm most looking forward to, but I do think it's important to have an initial stable version to work with. :) In the meantime, this has the benefit of being more convenient to use than |
Use abc.ABC and abc.abstractmethod to prevent direct instantiation of AbstractPool.
First, thanks a lot for this PR! I was surprised when I found If I may make a suggestion, I would say that the use of "IO-bound" throughout the documentation here is not particularly appropriate; long-running, CPU-bound routines will also block the event loop and so are also well-suited to running in a separate thread. Perhaps "blocking" is a more suitable general term? [Disclaimer: not familiar with any Python style, this is my opinion] More broadly, I think from a style POV, the documentation should avoid prescribing a use case for this feature, and instead describe its general functionality with reference to common use cases for it. E.g. the docs for ThreadPool say
Even changed to "blocking," this still seems too prescriptive IMO. Maybe there are other reasons users would want to await threads? I don't know. |
I soon as I commented that I remembered the global interpreter lock. I have no idea how that interacts with async code or executors. Please ignore my comment where it is inaccurate! |
No problem. :-) The distinction between "high-level" and "low-level" can be a bit subjective, but in our case the goal is for users to be able to accomplish most things in asyncio with a "high-level" API that doesn't require direct interaction with the event loop. It adds some unnecessary boilerplate and complexity that can be easily misused (especially with regards to the cleanup process and sharing objects between event loops). The "low-level" API will remain in place for asyncio-based libraries, but eventually users should not have to use it much at all. It may take some time though for the ecosystem to adopt the model, and some parts are still a work in progress. If you're specifically curious about which parts of asyncio are defined as high-level vs low-level, we have a couple of dedicated sections in the documentation: high-level index
Due to the GIL, a thread pool would not be especially suitable for CPU-bound operations; it would effectively block the event loop. Instead, in order to avoid blocking the event loop, one would have to use a process pool in order to circumvent the GIL, since each process has its own separate GIL (similar to how you could pass an instance of In the bpo issue, I mentioned that we might want to eventually consider an |
Of course, the high- / low-level distinction seems a good idea to me, no qualms about that. I suppose it's still possible an external library routine could do some number crunching with the GIL unlocked. All I was saying was that I think we should slightly re-word the docs as to not prescribe the thread pool to just IO-bound things. I can suggest some changes so you can see what I mean? |
The main reason why the current documentation for It also nicely paves the way for asyncio.ProcessPool in the future: for IO-bound operations, use That being said, I would be glad to consider any changes to the wording in the docs to make it a bit less prescriptive, as long as it's still clear what the primary use cases are. To some degree, I think the examples can help communicate that, but I want to at least mention it in the docs. It's a fine balance between making the use cases obvious enough while not limiting its usefulness in other areas; I can see that. |
I completely agree with your views on the use case. I think all I mean is that, technically the thread pool's typical use for only IO-bound work is a CPython implementation detail. It's a point of consistency: just like the 'threading' module discusses threads in general, and mentions the GIL as an implementation detail, I think these docs should too. E.g. the ThreadPool docs could change from An asynchronous threadpool that provides methods to concurrently
run IO-bound functions, without blocking the event loop. to something like An asynchronous threadpool that provides methods to concurrently
run functions in separate threads, preventing the event loop from being
blocked.
.. impl-detail::
Due to the Global Interpreter Lock, only one thread can execute Python
code at a time. This makes ThreadPool mainly suitable for I/O-bound
functions or third-party library functions which release the GIL. If you see what I mean? [I will leave the formatting decisions to you :)] I will also leave some other thoughts now, let me know if they are not welcome. |
Yeah, I quite like the overall idea of the suggestion. In general, it's a good goal for the documentation to be less specific to CPython as much as reasonably possible, without sacrificing readability and understanding for the majority of the audience. I'll have to go over the specific wording of it, but I like the separation of a more general Python language definition and a CPython implementation specific part that mentions the main use case in IO-bound operations. I'll likely include some links to other parts of the documentation when mentioning the GIL, so that readers can have somewhere to go for more information without going off-topic. Thanks for the detailed feedback and suggestions. @1st1 As the current primary author and maintainer of the asyncio docs, would something along the lines of what @tmewett suggested would be appropriate, specifically the separation between language definition and CPython details? Although I'm authoring this specific section and +1 on the idea, I'd like to ensure it fits well with the overall theme of the existing documentation. |
Co-authored-by: Tom M <[email protected]>
No, I meant, I wasn't in the discussion and I tried to skim it but didn't end up with a clear picture of where things ended up.
Yeah. The closest equivalent is Probably the most unusual design choice is that we don't have any explicit "thread pool" concept. Classically, thread pools are solving two separate problems at once: (1) reusing threads to amortize startup costs (i.e. basically acting like a cache for thread objects), (2) putting a policy on the total number of threads, to avoid overwhelming the system. We break these responsibilities apart, so there's a "thread cache" that amortizes startup + This is nice because:
There's more detail here: https://trio.readthedocs.io/en/stable/reference-core.html#threads-if-you-must (Some limitations of the current implementation: (a) our thread cache is currently trivial and doesn't actually re-use threads; it turns out that re-using threads is actually a pretty minor optimization, but we'll implement this eventually. See python-trio/trio#6. (b) |
Oh my, I actually like Trio design a lot. Huge thanks, Nathaniel, for sharing this. @asvetlov @cjerdonek @elprans I'm interested to hear your opinion. I think that a simple ThreadPool implementation is infinitely better than what we have now with What I like a lot about Trio's design is Should we go ahead with this PR and consider implementing a similar to Trio API in 3.10? |
Co-authored-by: Yury Selivanov <[email protected]>
For the eventual full native implementation of That's still significantly different from the global autoscaling w/ threads in Trio, but I think that autoscaling in general is something we may want to consider. In the bpo issue, I believe we were considering that the native implementation of |
Thread spawning is very cheap. Pulling a pre-existing thread out of a cache is even cheaper, but the difference is like 100 µs versus 30 µs or something. It's still probably worth having a cache, but it's only important for code that's pushing lots and lots of cheap operations into threads (e.g. heavy disk I/O where most data is in the cache, but occasionally something isn't, so you have to use a thread just-in-case). And in that regime, your cache is always warm, so pre-spawning doesn't really make a difference. |
I love trio's design. It is very common that there is one or two blocking operation. I am worrying about current asyncio.ThreadPool design is misleading for some users. def called_very_often():
with ThreadPool(...) as pool:
pool.run(blocking_work, args) Creating and shutting down thread pool is more expensive than a thread. |
I thought this too - having a threads only available inside the context of a managed pool doesn't seem consistent with Python's other APIs. Indeed, when i started learning asyncio I saw that you could await individual subprocesses, and naturally started looking for how I could await threads. So I think a more direct replacement of run_in_executor, for one function inside (effectively) one-off thread, would be a great eventual feature. But also I see no reason it should delay this one |
Yeah. @aeros @cjerdonek @methane Maybe we should add async def run_in_thread(func, /, *args, **kwargs):
loop = asyncio.get_running_loop()
return await loop.run_in_executor(None, *args, **kwargs) And for now we can get away without implementing threads auto scaling etc. As soon as this is merged we can start working on auto scaling for 3.10? I have a feeling that auto-scalable and simple |
Note that Trio's version was originally called If you want to read the bikeshedding before we settled on |
In order to implement auto-scaling though eventually (or any significant differences from The current implementation in this PR does not provide that, but it does give us a baseline to work with testing purposes as we build one that's fully native to asyncio. My primary goal has been to have this completed by 3.10. Regarding Inada's point of it potentially being misunderstood and used in a way that causes the thread pool to be repeatedly created, I think that's certainly worth addressing. IMO, it would be best clarified with a note in the documentation, perhaps with a link to Also, regarding the proposed implementation above for async def run_in_thread(func, /, *args, **kwargs):
loop = asyncio.get_running_loop()
func_call = functools.partial(func, *args, **kwargs)
return await loop.run_in_executor(None, func_call) |
asyncio almost has too many APIs now, so I'd prefer adding the fewest new APIs as possible.
The implementation of auto scaling will be drastically different from the current code. Right now I'm inclined to include |
Yeah, I can certainly understand that. If we conclude that the current implementation of
I would certainly be interested in working on
I'll admit that it would be somewhat disappointing with it being one of my most involved contributions so far, but I would definitely understand and not take it personally. Regardless of the eventual outcome of this PR, this is something that I very much want to continue working on. :-) |
Yeah, that's my usual problem of pushing something too close to the deadline. In this case, the Kyle, can you open an alternative PR adding
I understand. Sorry. OTOH, implementing auto scaling & efficient thread management is more challenging and ultimately will be way more rewarding. |
Sure, I'll work on that and close this PR. But I do have a question regarding the location: At a glance, this seems like the function would be best placed in |
It's fine; or you can add new |
Lastly, we also need to look at all active deprecation warnings and act on them before 3.9. Do you have time for that or should I work on that? |
I'm going to be otherwise occupied tomorrow w/ college work, but I should have some time on Monday to help with that (assuming Monday isn't too late). Perhaps we could open a separate issue to group all of the deprecations that need to be transitioned into removals for 3.9. |
Implements `asyncio.to_thread`, a coroutine for asynchronously running IO-bound functions in a separate thread without blocking the event loop. See the discussion starting from [here](#18410 (comment)) in GH-18410 for context. Automerge-Triggered-By: @aeros
Implements `asyncio.to_thread`, a coroutine for asynchronously running IO-bound functions in a separate thread without blocking the event loop. See the discussion starting from [here](https://github.com/python/cpython/pull/18410GH-issuecomment-628930973) in pythonGH-18410 for context. Automerge-Triggered-By: @aeros (cherry picked from commit cc2bbc2) Co-authored-by: Kyle Stanley <[email protected]>
Implements `asyncio.to_thread`, a coroutine for asynchronously running IO-bound functions in a separate thread without blocking the event loop. See the discussion starting from [here](https://github.com/python/cpython/pull/18410GH-issuecomment-628930973) in GH-18410 for context. Automerge-Triggered-By: @aeros (cherry picked from commit cc2bbc2) Co-authored-by: Kyle Stanley <[email protected]>
Implements `asyncio.to_thread`, a coroutine for asynchronously running IO-bound functions in a separate thread without blocking the event loop. See the discussion starting from [here](python#18410 (comment)) in pythonGH-18410 for context. Automerge-Triggered-By: @aeros
Implements
asyncio.ThreadPool
, a high-level asynchronous context manager for concurrently running IO-bound functions without blocking the event loop. The initial implementation relies significantly uponloop.run_in_executor()
, but the long-term goal is to eventually use a fully native asyncio threadpool (which doesn't rely onconcurrent.futures.ThreadPoolExecutor
). See the bpo issue for more details.https://bugs.python.org/issue32309
Note: Disregard the name of the branch. It has all of the intended changes, but it seems that I accidentally added the changes on top of branch from my recently merged PR (#18057). This has no consequence though other than a misleading branch name, so I'll leave it as is.