Async #13

felipeZ · 2018-11-13T15:37:38Z

Create an asynchronous interface to call an lieMD simulation, see #11

API

There are 3 new functions that support the async call:

run_async_ligand_solvent_md
run_async_ligand_protein_md
query_liemd_results

How does it work?

Both the run_async_ligand_solvent_md and run_async_ligand_protein_md take the same input that they blocking counterparts: run_ligand_solvent_md and run_ligand_protein_md, respectively. These async functions submit a job using cerise and return inmediately.

These functions return the a dictionary specified here. The dict returned by these functions can be used together with the query_liemd_results function to retrieve the output. When this last function is invoked, it checks for the status of the job using the job_name, username and other meta-data information provided by either run_async_ligand_solvent_md and run_async_ligand_protein_md .
If the job is still running an empty dictionary is return together with the status. Otherwise if the job has finished successfully a results dictionary is return. In case of failure the status and an empty dictionary are returned.

See the example

LourensVeen · 2018-11-13T16:03:16Z

Hey, great to see this being implemented!

Looks good on the whole, but I have a few remarks about the schema:

The async response should never return 'completed', because it won't return any results. I'm not sure if it makes sense to forbid this in the schema. Also, is the schema for the result not defined centrally? This reads to me like this is a locally defined result type, but maybe I'm misreading, or it's not a problem.
Could the service return a single identifier only (job_name or task_id, not sure which is more appropriate), for passing back to query_liemd_results? I think it would be neater if things like the container name and port are internal to lie_md, because all sorts of crazy stuff could happen if these become corrupted on the client side (including security issues).

felipeZ · 2018-11-13T17:10:29Z

@LourensVeen you are right about the async function should never return completed. Also, the schemas are using a local type but I have also defined the future type as in common_resources. I am just not sure if we need to defined this future type globally.

Also, job_name = "job_" + task_id, so in principle we need only the task_id. I can also make job_name = task_id.
The problem with having just job_name is that the lie_md module will need to keep track of what containers/username are in used. but I think that is the client who should be in charged.

LourensVeen · 2018-11-14T09:28:26Z

The reason I was thinking it would be defined globally is that it carries a particular meaning, namely "this is an asynchronous service, you should call the specified callback again to get the result". The workflow engine needs to be able to distinguish between synchronous and asynchronous calls, and the idea is that it can do so by the return type. If it's a Future, it's asynchronous, otherwise it's synchronous and whatever comes out is the result.

If the workflow engine can distinguish this by itself, then the user doesn't have to think about it. They just specify the step, and the engine takes care of the rest. It's much more user-friendly.

For the same reason, I don't think that the user should have to worry about docker container ids and port names. The user is busy thinking about proteins and ligands and binding sites! They shouldn't have to know that there's a Cerise running in a Docker that connects to a cluster. All they should have to know is that they put a protein-with-ligand-md step in their workflow which will produce the energies as its output. Everything else is overhead that we should try to minimise.

Also, what happens if the user specifies a container name and port number that's in use by another user. Will they be able to steal the other user's core hours?

marcvdijk · 2018-11-14T10:51:21Z

@felipeZ @LourensVeen Great to see this materialize so quickly.

As Lourens mentioned, I would try to keep the Future response as generic as possible and indeed have it be part of common_resources. It makes developing the workflow manager easier and more consistent and in the end it makes the life of the user easier.
It is very likely that more services are going to want to implement async endpoints and it would be consistent if these all make use of the same Future schema as much as possible.

That makes me wonder how many of the current parameters in the schema are lie_md specific and if they could be derived from the task_id. A task_id is generic to the Future object regardless the async endpoint implementation but a job_type or a port is not. Why not associate the lie_md specific parameters to the task_id and store them in the lie_md or Cerise database?

felipeZ · 2018-11-14T11:01:50Z

I agree with both of you I would return a Future object with only the task_id.

marcvdijk · 2018-11-14T11:06:20Z

And probably the URI of the endpoint needed to check progress and retrieve results.

felipeZ · 2018-11-14T15:07:20Z

Following the recommendation of both @LourensVeen and @marcvdijk the functions run_async_ligand_solvent_md and run_async_ligand_protein_md returned a future object. This object basically contains the status, task_id and query_url. This last string is the URL to the endpoint to query the status and results of the future.

the function query_liemd_results returns also a future object. But when the status is completed, it also attaches the results to the future object, as specified in the schema.

We only need to store the task_id because we can always retrieve the whole service/job pair from the mongodb and the cerise interface.

The synchronous functions liemd_ligand and liemd_protein also return a future object](https://github.com/MD-Studio/common_resources/blob/master/common_resources/schemas/resources/future.v1.json), but the results are available and the status is completed or failed.

felipeZ added 7 commits November 13, 2018 13:30

created async ligand simulation #11

dee21f4

fixed schemas

567a41e

called ligand_liemd simulation asynchronous #11

c75ab33

Query for the results after some time #11

5872328

added async call to protein liemd simulation #11

b639331

changed query function name #11

57e0468

removed unused print

5679684

felipeZ requested a review from marcvdijk November 13, 2018 15:37

felipeZ added 2 commits November 13, 2018 16:47

used /tmp/lie_md folder for travis

4a1d69f

fixed typo in test

ecd205f

felipeZ mentioned this pull request Nov 13, 2018

Fix multi-user running #10

Open

felipeZ added 5 commits November 14, 2018 13:08

updated schemas to used futures #11

5fc693c

removed redundant schemas #11

023d911

query service info from db #11

3de1b13

used the same functionality for async and blocking calls #11

349146e

Run in the test both async and blocking version #11

4226afb

marcvdijk merged commit 8350a5c into master Nov 29, 2018

marcvdijk deleted the async branch November 29, 2018 12:39

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Async #13

Async #13

felipeZ commented Nov 13, 2018

LourensVeen commented Nov 13, 2018

felipeZ commented Nov 13, 2018

LourensVeen commented Nov 14, 2018

marcvdijk commented Nov 14, 2018

felipeZ commented Nov 14, 2018

marcvdijk commented Nov 14, 2018

felipeZ commented Nov 14, 2018 •

edited

Loading

Async #13

Async #13

Conversation

felipeZ commented Nov 13, 2018

API

How does it work?

LourensVeen commented Nov 13, 2018

felipeZ commented Nov 13, 2018

LourensVeen commented Nov 14, 2018

marcvdijk commented Nov 14, 2018

felipeZ commented Nov 14, 2018

marcvdijk commented Nov 14, 2018

felipeZ commented Nov 14, 2018 • edited Loading

felipeZ commented Nov 14, 2018 •

edited

Loading