Enable usched and initret and create ConverseExit #143

ritvikrao · 2025-10-24T21:26:20Z

To support NAMD, we need the ability for reconverse users to control the scheduler and do extra setup (aside from Cmi_startfn) on their own, if they choose. These changes re-work the thread launches to work more like old Converse, where rank 0 uses the current thread instead of launching a new thread. All cleanup occurs in ConverseExit, including comm backend cleanup. This also allows the scheduler's stop flag to be reset, allowing for repeated scheduler calls.

…erse into sched-modes

JiakunYan · 2025-10-29T02:26:46Z

@lvkale I’d like to loop you in to discuss this PR. Ritvik noticed that the current Charm++ code calls ConverseExit in two different ways:

The ck exit handler always calls ConverseExit on all PEs.
NAMD sets both usched and initret to 1, and its main thread calls ConverseExit at the very end.

Because of these two usage patterns, ConverseExit has to be implemented as a real exit function, as done in this PR. In this implementation, all worker threads with rank ≥ 1 enter an infinite loop at the end of ConverseExit, waiting to be terminated, while the rank 0 thread eventually calls exit, allowing the OS to kill all threads.

I’m not sure this is the most elegant way to handle program termination. What are your thoughts?

lvkale · 2025-10-30T15:41:39Z

This is a major-ish change. Would be good to get some old timers to review. Will you please tag Sam and Eric Bohm (and maybe Evan)?

ritvikrao · 2025-10-30T16:44:55Z

For @ericjbohm and others, I was hoping you would try these changes on NAMD and see if this works for you. I also would appreciate feedback on the design of the exit procedure (which I have to make compatible with existing Charm++).

ritvikrao added 10 commits October 23, 2025 12:52

usched and initret implementation

1321e19

reset stop flag

aa5f368

create ConverseExit

49bd709

Merge branch 'sched-modes' of https://github.com/charmplusplus/reconv…

0877376

…erse into sched-modes

fix nodereduction cmake

d060a16

initialization of csv on rank 0 only

72c9632

reduction id

4c08361

fix in header

8924c81

fix exit thread cleanup

fdd06c8

remove print

704c1fb

ritvikrao marked this pull request as ready for review October 27, 2025 13:53

ritvikrao requested a review from JiakunYan October 27, 2025 13:53

ritvikrao added 3 commits October 27, 2025 15:42

Merge remote-tracking branch 'origin' into sched-modes

7e71a1a

remove unneeded atomics

20f248c

remove another

4271fa5

ritvikrao requested review from ericjbohm and evan-charmworks October 30, 2025 15:43

ritvikrao added 2 commits October 30, 2025 15:59

Merge remote-tracking branch 'origin' into sched-modes

5d3b76c

Merge remote-tracking branch 'origin' into sched-modes

a14b51e

ritvikrao requested review from lvkale and stwhite91 November 2, 2025 15:16

Merge remote-tracking branch 'origin' into sched-modes

c466ff5

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Enable usched and initret and create ConverseExit #143

Enable usched and initret and create ConverseExit #143

Uh oh!

ritvikrao commented Oct 24, 2025

Uh oh!

JiakunYan commented Oct 29, 2025

Uh oh!

lvkale commented Oct 30, 2025

Uh oh!

ritvikrao commented Oct 30, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Enable usched and initret and create ConverseExit #143

Are you sure you want to change the base?

Enable usched and initret and create ConverseExit #143

Uh oh!

Conversation

ritvikrao commented Oct 24, 2025

Uh oh!

JiakunYan commented Oct 29, 2025

Uh oh!

lvkale commented Oct 30, 2025

Uh oh!

ritvikrao commented Oct 30, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants