Conversation
Note that some USE_ACC, especially in controlling the multigpu code, are necessary. Otherwise, a non-GPU compile will behave quite badly. So, those remain.
This is a significant update that attempted to fix the multiGPU parallel operations. It did not succeed, but the async functionality may still be worthwhile to use later on. Other changes over the last month as I evolved the integral code for Jellium/etc are contained here too.
Contributor
|
It would be nice to fix the multiGPU operation with the latest nvidia compilers, but I also suspect we just have a compiler bug on our hands. Versions 24.9 and 24.11 do not work; 20.7 is fine. |
Member
Author
|
Hmm @paulzim46 do you think it's worth trying to sufficiently isolate it to provide the compiler team with a lead if it is a bug? While working on the infrastructure, nvhpc has also been the most difficult dependency to manage. It seems like their licensing makes it difficult for a package manager to redistribute. I've noticed that gcc also has openacc support which may be easier to manage from an infrastructure angle, but I don't know whether it's as reliable and performant in general, or how many subtle differences there are between them. |
Closed
also removed non-reference output files from lih_VK1 example
…cmake CMake configuration for installation and include directories
helped with installing on perlmutter
…d backend version to 0.1.15
Update example in README because we deleted the one previously referred to
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Add this suggestion to a batch that can be applied as a single commit.This suggestion is invalid because no changes were made to the code.Suggestions cannot be applied while the pull request is closed.Suggestions cannot be applied while viewing a subset of changes.Only one suggestion per line can be applied in a batch.Add this suggestion to a batch that can be applied as a single commit.Applying suggestions on deleted lines is not supported.You must change the existing code in this line in order to create a valid suggestion.Outdated suggestions cannot be applied.This suggestion has been applied or marked resolved.Suggestions cannot be applied from pending reviews.Suggestions cannot be applied on multi-line comments.Suggestions cannot be applied while the pull request is queued to merge.Suggestion cannot be applied right now. Please check back later.
Paul plans to work on this branch for some time and use it as the dependency for his replacement of FancyElectrons. It is being kept separate because Paul's changes likely will not continue to be compatible with FancyElectrons. Once the replacement is complete, the plan is to merge this pr and pin FancyElectrons to a commit before this pr was merged. Ultimately, Paul plans to migrate all FancyElectrons usage in the group to his replacement.
I'm creating this pr well before it's ready to merge as a space for notes and to monitor progress.
For Alex and Nate: do the examples currently in the repository look representative of what would be helpful for validating the accuracy of the code?
Vaibhav's testing
Closes #16
Closes #17