Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Relion/5.0 tomgoraphy Reconstruction modality runs in an endless loop #1149

Open
schmitbp opened this issue Jun 20, 2024 · 2 comments
Open

Comments

@schmitbp
Copy link

Hello,
I've been attempting to use RELION's tomography suite for sub tomogram averaging. However I've consistently run into difficulties using the Reconstruction modality. I've been following Relion/5.0's subtomogram averaging tutorial and have been able to run all prior steps (seemingly successfully). However, when attempting to run the reconstruction step (after alignment using AreTomo), the program runs in an endless loop (for more than 18 hours) and has to be killed in order to free up the CPUs. I've checked and it looks as if the reconstruction worked (i.e., I can navigate to the Relion/ReconstructTomograms folder and can find the half1.mrc/half2.mrc reconstructions output from this part of the pipeline. However, if i check the CPU load, it's as if the program never stopped running and the job never moves to the completed tab. I will add that there is nothing written out in run.err, and the run.out section just shows:

  • Reconstructing 1 tomograms:
    • sample378-tiltseries
  • Reconstructing ...
    55.55/55.55 min ............................................................~~(,_,">

My Input .star file (output from the alignment part of the pipeline) looks like this:

Created by the starfile Python package (version 0.4.12) at 14:54:33 on 19/06/2024

data_global

loop_
_rlnTomoName #1
_rlnVoltage #2
_rlnSphericalAberration #3
_rlnAmplitudeContrast #4
_rlnMicrographOriginalPixelSize #5
_rlnTomoHand #6
_rlnMtfFileName #7
_rlnTomoTiltSeriesPixelSize #8
_rlnTomoTiltSeriesStarFile #9
sample378-tiltseries 300.000000 2.700000 0.100000 3.362000 -1.000000 mtf_k3_standard_300kV_FL2.star 6.724000 AlignTiltSeries/job043/tilt_series/sample378-tiltseries.star

We are running RELION on our lab's server with 64x CPUs and 6x Nvidia A40 GPUs. The version we're running is Relion/5.0-beta-gpu. I have loaded the following Modules:

  1. chpc/1.0 (S) 6) miniconda3/relion5
  2. gcc/8.5.0 7) relion/5.0-beta-gpu
  3. intel-oneapi-mpi/2021.1.1 8) ctffind/4.1.14
  4. cuda/12.2.0 (g) 9) aretomo2/1.1.2
  5. intel-oneapi-mkl/2022.0.2
    If relevant, I did check Generate Tomograms for Denoising. Has anyone else run into this problem?
    Thank you and happy to provide more information if needed.
@rdrighetto
Copy link

Is it consistently happening for you, or is it random? I run into this issue randomly on our cluster, so I assume just a glitch in network communication, but not sure.

@schmitbp
Copy link
Author

It's a consistent problem, anytime I start a reconstruction it starts the endless loop

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants