async API #569

douglas-raillard-arm · 2021-08-19T13:50:20Z

Add an async API to devlib to take advantage of concurrency more easily, both at a coarse and fine grain.

Note: This currently requires python 3.7

Based on #568

douglas-raillard-arm · 2021-08-19T13:55:48Z

Preliminary results on:

        with target.cpufreq.use_governor('performance'):
            pass

1.4x faster on a SSH target (juno r0):

on this branch: 3.7s (avg over 10 runs)
on master: 5.3s (avg over 10 runs)

2.2x faster on a local target:

on this branch: ~9s
on master: ~20s

douglas-raillard-arm · 2021-08-19T16:46:04Z

I checked all instances of threading.Lock in devlib and it should be safe, as none is in the path used when running a background command or any other coroutine.

If a function taking a threading.Lock was being called by 2 coroutines that end up running concurrently, the 2nd coroutine would wait for the lock, thereby blocking the main thread and deadlocking.

douglas-raillard-arm · 2021-08-19T16:50:42Z

I also tried various combinations of thread pools, including running a blocking execute() in a different thread (so with separate SSH connections) and I did not get any real timing variations.

What I did get is some errors probably because I was maintaining too many connections on the SSH server, so the current approach with one connection and many concurrent SSH channels seems to be the best. I could not saturate the server with open channels (maybe there is no limit). If that was to happen, we could use an asyncio.Semaphore to limit the number of bg commands in flight.

douglas-raillard-arm · 2021-08-31T13:25:45Z

There are a few things to cleanup (docstring etc) so if you are happy with the API as it stands I will proceed with:

adding docstrings
renaming asyn.parallel into asyn.concurrently
remove the .concurrent attribute, as it is not that useful in practice I think (it allows running the function concurrently, only blocking to get the result when it is awaited. The "normal" couroutine behavior is to not do anything until the return value is awaited). asyn.concurrently is usually better as it makes it obvious what runs concurrently and avoids any kind of "task leak".

EDIT: cleanup done. Beyond possible high level doc and converting more bits, the change is somewhat ready.

douglas-raillard-arm · 2021-10-15T14:18:44Z

While working on a bug fix for another problem, it came to my attention that SSH servers accept a fixed number of "sessions" per connection. This "sessions" seem to map to paramiko's channels. This means that for a given connection, we might be limited to e.g. 10 concurrent channels (default of OpenSSH). This is configurable with MaxSessions on OpenSSH side and results in an exception on paramiko side: Could not open an SSH channel: ChannelException(2, 'Connect failed')

I'll probably need to handle that in that PR to limit the number of opened channels. There does not seem to be a standard way of getting the max number of channels, but we can probably handle the exception as a hint, or attempt to read OpenSSH config and parse it to find the MaxSessions value (a bit hacky though).

douglas-raillard-arm · 2021-11-10T12:45:37Z

@setrofim @marcbonnici I've updated the PR with an automatic detection of number of allowed background commands, along with some automatic limiting in the async API to use at most half of the available channels.

Next step:

Decide what version of Python we want to support
"instrument" read_value() and write_value() such that we can detect when we are trying to read/write to the same files concurrently. This should catch most issues where we are trying to do some setup concurrently, which is where the bulk of the speed up will come from.

douglas-raillard-arm · 2021-11-11T18:35:38Z

Updated the PR with checks to make sure that no file access conflicts when executing concurrent asyncio tasks. An artificial example:

from devlib.utils.asyn import *

m = ConcurrentResourceManager()

r1 = FileResource('host', 'file1', 'w')
r2 = FileResource('host', 'file1', 'r')

async def f1():
    # m.track_resource(r1)
    m.track_resource(r2)
    print('from f1')


async def f2():
    m.track_resource(r1)
    m.track_resource(r2)
    print('from f2')

async def f3():
    # m.track_resource(r1)
    print('from f3')

run(
    m.concurrently(
        (
            f1(),
            m.concurrently(
                (
                    f2(),
                    f3(),
                )
            )
        )
    )
)

In the PR, ConcurrentResourceManager is target.resource_manager, which is thread-local

Note: this depends on Python >= 3.7 for asyncio.current_task()

douglas-raillard-arm · 2022-06-16T19:29:17Z

FWIW we have been using this PR in Lisa for a month now (Lisa repo vendorizes devlib as a git subtree) and so far I haven't observed or heard of any issue related to that

devlib/target.py

Implement __aenter__ and __aexit__ on nullcontext so it can be used as an asynchronous context manager.

The conncetion returned by Target.get_connection() does not have its .busybox attribute initialized. This is expected for the first connection, but connections created for new threads should have busybox set.

Remove all the tls_property from the state, as they will be recreated automatically.

Remove dependencies that are ruled out due to the current Python minimal version requirement.

Require Python >= 3.7 in order to have access to a fully fledged asyncio module.

douglas-raillard-arm · 2022-07-26T17:40:58Z

@marcbonnici PR updated with:

Fix to connect(max_async)
Fix to cpufreq.use_governor() racy file write
Fix to async.asyncf() decorator for async generators: async generators are now consumed completely when crossing a blocking boundary, rather than failing to await on them. We could maybe do something a bit smarter to preserve lazyness provided that we can asyncio.run() anywhere, so I'll have a look later.
Added asyn.memoized_method() decorator that works for both async and non-async code. This does not memoize non-hashable data so it will avoid issues described here: Surprising memoized behavior for mutable containers #341
Converted the final bits of the cpufreq module to async

EDIT: Forgot to add the last bullet entry

Home for async-related utilities.

Add async variants of Target methods.

Allow the user to set a maximum number of conrruent connections used to dispatch non-blocking commands when using the async API.

Make use of the new async API to speedup other parts of devlib.

douglas-raillard-arm · 2022-07-26T17:48:30Z

PR re-updated with a lazy async generator blocking shim. This preserves the laziness of the async gen, made possible by nest_asyncio package.

@memoized

use_governor() was trying to set concurrently both per-cpu and global tunables for each governor, which lead to a write conflict. Split the work into the per-governor global tunables and the per-cpu tunables, and do all that in concurrently. Each task is therefore responsible of a distinct set of files and all is well. Also remove @memoized on async functions. It will be reintroduced in a later commit when there is a safe alternative for async functions.

Add a memoize_method decorator that works for async methods. It will not leak memory since the memoization cache is held in the instance __dict__, and it does not rely on hacks to hash unhashable data.

douglas-raillard-arm force-pushed the async branch 6 times, most recently from 1b2cc97 to fcd8d4b Compare August 19, 2021 16:37

douglas-raillard-arm force-pushed the async branch from fcd8d4b to 798a0a3 Compare August 20, 2021 14:30

douglas-raillard-arm force-pushed the async branch 3 times, most recently from c42be20 to 7a7fead Compare September 7, 2021 10:37

setrofim approved these changes Sep 28, 2021

View reviewed changes

douglas-raillard-arm force-pushed the async branch from 7a7fead to d11b2de Compare October 11, 2021 09:15

douglas-raillard-arm force-pushed the async branch 4 times, most recently from 6f03604 to 1ebc1a3 Compare November 10, 2021 12:39

douglas-raillard-arm force-pushed the async branch 5 times, most recently from f89adf9 to 8679e42 Compare November 11, 2021 18:25

douglas-raillard-arm force-pushed the async branch 2 times, most recently from 9c4c402 to 6844720 Compare November 15, 2021 15:09

douglas-raillard-arm force-pushed the async branch 5 times, most recently from f92e27a to 659a4ff Compare April 11, 2022 10:07

douglas-raillard-arm force-pushed the async branch 3 times, most recently from bb68b69 to 2b8bd69 Compare May 23, 2022 12:16

marcbonnici reviewed Jul 26, 2022

View reviewed changes

devlib/target.py Outdated Show resolved Hide resolved

devlib/target.py Show resolved Hide resolved

douglas-raillard-arm added 5 commits July 26, 2022 13:32

utils.misc: Make nullcontext work with asyncio

4cb1f7a

Implement __aenter__ and __aexit__ on nullcontext so it can be used as an asynchronous context manager.

target: Fix Target.get_connection()'s busybox

6dc9eba

The conncetion returned by Target.get_connection() does not have its .busybox attribute initialized. This is expected for the first connection, but connections created for new threads should have busybox set.

target: Make __getstate__ more future-proof

44cf2ef

Remove all the tls_property from the state, as they will be recreated automatically.

setup.py: cleanup dependencies in setup.py

eba0039

Remove dependencies that are ruled out due to the current Python minimal version requirement.

setup.py: Require Python >= 3.7

683413c

Require Python >= 3.7 in order to have access to a fully fledged asyncio module.

douglas-raillard-arm force-pushed the async branch 2 times, most recently from 1bace01 to 4a99e78 Compare July 26, 2022 17:36

douglas-raillard-arm force-pushed the async branch from 4a99e78 to 7b3d726 Compare July 26, 2022 17:46

douglas-raillard-arm added 4 commits July 26, 2022 18:47

utils/async: Add new utils.async module

1c43096

Home for async-related utilities.

target: Enable async methods

276c8e5

Add async variants of Target methods.

target: Expose Target(max_async=50) parameter

6eb24a6

Allow the user to set a maximum number of conrruent connections used to dispatch non-blocking commands when using the async API.

devlib: Use async Target API

5361ec1

Make use of the new async API to speedup other parts of devlib.

douglas-raillard-arm force-pushed the async branch from 7b3d726 to 843e954 Compare July 27, 2022 15:13

douglas-raillard-arm added 2 commits July 27, 2022 16:14

utils/asyn: Add memoize_method() decorator

843e954

Add a memoize_method decorator that works for async methods. It will not leak memory since the memoization cache is held in the instance __dict__, and it does not rely on hacks to hash unhashable data.

marcbonnici approved these changes Jul 28, 2022

View reviewed changes

marcbonnici merged commit fefdf29 into ARM-software:master Jul 28, 2022

douglas-raillard-arm mentioned this pull request Jul 5, 2023

devlib.collector: Add PerfettoCollector #618

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

async API #569

async API #569

douglas-raillard-arm commented Aug 19, 2021 •

edited

Loading

douglas-raillard-arm commented Aug 19, 2021 •

edited

Loading

douglas-raillard-arm commented Aug 19, 2021

douglas-raillard-arm commented Aug 19, 2021

douglas-raillard-arm commented Aug 31, 2021 •

edited

Loading

douglas-raillard-arm commented Oct 15, 2021

douglas-raillard-arm commented Nov 10, 2021

douglas-raillard-arm commented Nov 11, 2021 •

edited

Loading

douglas-raillard-arm commented Jun 16, 2022

douglas-raillard-arm commented Jul 26, 2022 •

edited

Loading

douglas-raillard-arm commented Jul 26, 2022

async API #569

async API #569

Conversation

douglas-raillard-arm commented Aug 19, 2021 • edited Loading

douglas-raillard-arm commented Aug 19, 2021 • edited Loading

douglas-raillard-arm commented Aug 19, 2021

douglas-raillard-arm commented Aug 19, 2021

douglas-raillard-arm commented Aug 31, 2021 • edited Loading

douglas-raillard-arm commented Oct 15, 2021

douglas-raillard-arm commented Nov 10, 2021

douglas-raillard-arm commented Nov 11, 2021 • edited Loading

douglas-raillard-arm commented Jun 16, 2022

douglas-raillard-arm commented Jul 26, 2022 • edited Loading

douglas-raillard-arm commented Jul 26, 2022

douglas-raillard-arm commented Aug 19, 2021 •

edited

Loading

douglas-raillard-arm commented Aug 19, 2021 •

edited

Loading

douglas-raillard-arm commented Aug 31, 2021 •

edited

Loading

douglas-raillard-arm commented Nov 11, 2021 •

edited

Loading

douglas-raillard-arm commented Jul 26, 2022 •

edited

Loading