DB Creation and Updating #114

jthompson-arcus · 2024-10-16T14:14:31Z

Lines 66 to 74 in f32e09d

    
           if(!file.exists(user_db)){ 
        
             warning("No user database found. New database will be created") 
        
             db_create(get_review_data(data), db_path = user_db) 
        
           } else{ 
        
             # Skip if not needed for faster testing: 
        
             if(isTRUE(get_golem_config("app_prod"))){ 
        
               db_update(get_review_data(data), db_path = user_db)  
        
             } 
        
           }

Why is the database creation and updating not part of the preprocessing? It seems like since we already require the creation of the study data and meta data that it makes sense to update the database when study data is created, not every time the application is run.

LDSamson · 2024-10-21T10:29:33Z

You can always update the database manually of course, which is probably smart to do. Do you propose to remove these checks and if so, why? Don't you think these checks add a layer of robustness to the app by verifying that the database is in synch with the study data when starting the application?

jthompson-arcus · 2024-10-21T12:51:01Z

I do think we should have the check in place. And db_update() does return early with the simple sync time check.

I think my question around it is really two-fold:

As a process flow, it makes more sense to be running this in the pre-processing step as opposed to the run-time step. But maybe that is just a documentation problem because this can function mostly as a check.
The update only runs when app_prod = TRUE and does so silently. This configuration has to be intentionally added because the default is FALSE and is easy to forget.

To a lesser extent I'm worried about initialization time, but this process here isn't really the culprit there.

jthompson-arcus · 2024-10-23T14:56:03Z

@LDSamson another thought is that the study_data is only used here and in get_appdata() in app_server.R. Relooking at our process can help with load times in the application. Right now the app_data object gets created for each user at runtime. As far as I can tell, we should get performance gains if the application relies on app_data instead of study_data.

LDSamson · 2024-10-30T09:39:18Z

Might be better indeed to rely on app_data instead. historically, the app relied on multiple objects being available (app_data, app_tables, app_vars, metadata), which was too complicated and led to the current situation. I don't think performance gain will be big, but it can help to only rely on app_data.

We can also simplify the in-memory metadata, since I think we are only using metadata$items_expanded and some metadata-derived app_vars in the application.

We should aim for better performance since the app is slow to start up and identify the biggest bottlenecks. Probably biggest performance gains can be achieved by improving the plotly functions, I think designing the figures by using native plotly function instead of using ggplot2 and then converting the figures with plotly::ggplotly().

jthompson-arcus added the question Further information is requested label Oct 16, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

DB Creation and Updating #114

DB Creation and Updating #114

jthompson-arcus commented Oct 16, 2024

LDSamson commented Oct 21, 2024

jthompson-arcus commented Oct 21, 2024

jthompson-arcus commented Oct 23, 2024

LDSamson commented Oct 30, 2024

DB Creation and Updating #114

DB Creation and Updating #114

Comments

jthompson-arcus commented Oct 16, 2024

LDSamson commented Oct 21, 2024

jthompson-arcus commented Oct 21, 2024

jthompson-arcus commented Oct 23, 2024

LDSamson commented Oct 30, 2024