reorganize is_fastest_rate function for NSE_NET #1274

zhichen3 · 2023-07-12T17:20:40Z

Based on pr#1269. I reorganized is_fastest_rate function to make it more readable and renamed it to fill_merge_indices, since it is more descriptive and accurate of what the function does.

It also gives a slight improvement in the time it takes for this function.

Here's the new profile for running Detonation on perlmutter (no mpi or anything). For whatever reason get_partition_function doesn't take that much time anymore.

Flat profile:

Each sample counts as 0.01 seconds.
  %   cumulative   self              self     total           
 time   seconds   seconds    calls   s/call   s/call  name    
 16.12     12.61    12.61 754714265     0.00     0.00  std::_Function_handler<amrex::GpuTuple<int, int> (), amrex::ReduceData<int, int>::ReduceData<amrex::ReduceOpMax, amrex::ReduceOpMax>(amrex::ReduceOps<amrex::ReduceOpMax, amrex::ReduceOpMax>&)::{lambda()#1}>::_M_manager(std::_Any_data&, std::_Any_data const&, std::_Manager_operation)
 15.43     24.68    12.07 20251079     0.00     0.00  void fcn<burn_t>(amrex::Array1D<double, 1, 2>&, amrex::Array1D<double, 1, 2>&, burn_t const&, int&) [clone .constprop.0]
  9.55     32.15     7.47 375679202     0.00     0.00  std::_Function_handler<amrex::GpuTuple<amrex::ValLocPair<double, amrex::IntVect> > (), amrex::ReduceData<amrex::ValLocPair<double, amrex::IntVect> >::ReduceData<amrex::ReduceOpMin>(amrex::ReduceOps<amrex::ReduceOpMin>&)::{lambda()#1}>::_M_manager(std::_Any_data&, std::_Any_data const&, std::_Manager_operation)
  9.08     39.25     7.10      112     0.06     0.70  Castro::react_state(amrex::MultiFab&, amrex::MultiFab&, double, double, int)
  6.66     44.46     5.21 56871959     0.00     0.00  void apply_electrons<burn_t>(burn_t&)
  4.85     48.25     3.79  5949077     0.00     0.00  Castro::okToContinue()
  4.64     51.88     3.63  5832732     0.00     0.00  in_nse(burn_t&, bool)
  3.80     54.85     2.97  5751323     0.00     0.00  void nse_hybrid_solver<burn_t>(burn_t&, double)
  3.72     57.76     2.91 14532070     0.00     0.00  void actual_eos<eos_input_t, burn_t>(eos_input_t, burn_t&)
  3.59     60.57     2.81 952020582     0.00     0.00  get_partition_function(int, tf_t const&, double&, double&)
  2.94     62.87     2.30  5762508     0.00     0.00  void hybrj<2, burn_t>(hybrj_t<2>&, burn_t const&)
  2.68     64.97     2.10  9074378     0.00     0.00  sneut5(double, double, double, double, double&, double&, double&, double&, double&)
  2.31     66.78     1.81  8736723     0.00     0.00  void evaluate_rates<0, rate_t>(burn_t const&, rate_t&)
  1.89     68.26     1.48  8736723     0.00     0.00  void fill_reaclib_rates<0, rate_t>(tf_t const&, rate_t&)
  1.12     69.13     0.88 10467305     0.00     0.00  void chabrier1998<1>(plasma_state_t const&, scrn::screen_factors_t const&, double&, double&)
  0.97     69.89     0.76  9074378     0.00     0.00  rhs_nuc(burn_t const&, amrex::Array1D<double, 1, 23>&, amrex::Array1D<double, 1, 22> const&, amrex::Array1D<double, 1, 93> const&)
  0.87     70.57     0.68 47787585     0.00     0.00  fill_merge_indices(amrex::Array1D<int, 1, 2>&, double&, int, amrex::Array1D<double, 1, 22> const&, burn_t const&, amrex::Array1D<double, 1, 93> const&, amrex::Array1D<int, 1, 22> const&, double const&)
  0.65     71.08     0.51  8736723     0.00     0.00  void rate_s32_to_p_p31_derived<0>(tf_t const&, double&, double&)
  0.64     71.58     0.50  8736723     0.00     0.00  void rate_mg24_to_he4_ne20_derived<0>(tf_t const&, double&, double&)

zhichen3 and others added 30 commits May 30, 2023 17:07

temp save

278114e

initial commit

f33268e

Merge branch 'development' into simplified_sdc_ase

9beea45

fix typo

dd5b960

fix another composition

80d8908

simplify

2a94b31

change to rho instead of SRHO

88ddf97

more simplification

a34b37b

add rhoX when use nse_state

14ab6c2

Merge branch 'development' into simplified_sdc_ase

8e00d8c

Merge branch 'development' into simplified_sdc_ase

a6c3785

Merge branch 'development' into simplified_sdc_ase

bc44fff

update burn_to_eos

571667d

update and simplify ase network

a61becd

update

d2c4bfd

Merge branch 'development' into nse_neutron

f6e452e

Merge branch 'development' into update_nse_files

fea0d26

Merge branch 'development' into simplified_sdc_ase

93e92ca

Merge branch 'update_nse_files' into nse_neutron

d0326b8

Merge branch 'simplified_sdc_ase' into nse_neutron

4475c05

Merge branch 'ase_remove_neutron' into nse_neutron

077d0fa

update

48196c1

include additional rates

7cd960c

Merge branch 'development' into ase_remove_neutron

c795cc0

Merge branch 'ase_remove_neutron' into nse_neutron

64825cc

update namespace

8d0d4a3

update nse_check

72b8804

Merge branch 'development' into nse_neutron

55bc3fa

add stdouts if nse solver failed

2ab2f6a

change single_group metric for non-neutron network

7be2a8b

zhichen3 and others added 28 commits July 10, 2023 21:41

Merge branch 'optimize_nse' into more_optimize_nse

131f08f

clean up is_fastest_rate -> fill_merge_indices

0de4c23

missing colon

d450424

fix typo

ead2bbd

fix segfault issue

215086b

update

07a9f95

revert

6a27aae

fix index

78b5c48

change order of index check

bf4792b

revert for checking

0adef02

test

fb90d83

test

766dd9c

add stdcouts to debug

835c8c1

add more stdcouts

d179120

add more couts

f6ef70d

more couts

4e01622

mroe debug

f8f769c

fix merge_indices issue

fe89ad9

cleanup

c3e6220

more fix

7a83f7b

fix

3e89568

fix initial index

38091ff

use Y_group to construct b_f and b_r

a50ca7c

revert

121b8cc

remove blank lines

04d6604

Merge branch 'development' into more_optimize_nse

ccd70ad

remove old comment

7330fa2

Merge branch 'development' into more_optimize_nse

8ee44da

zingale approved these changes Jul 12, 2023

View reviewed changes

zingale merged commit e5c0f75 into AMReX-Astro:development Jul 12, 2023
20 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

reorganize is_fastest_rate function for NSE_NET #1274

reorganize is_fastest_rate function for NSE_NET #1274

zhichen3 commented Jul 12, 2023

reorganize is_fastest_rate function for NSE_NET #1274

reorganize is_fastest_rate function for NSE_NET #1274

Conversation

zhichen3 commented Jul 12, 2023