Skip to content

Merging New Features#240

Merged
bputzeys merged 18 commits intoreleasefrom
main
Jun 18, 2025
Merged

Merging New Features#240
bputzeys merged 18 commits intoreleasefrom
main

Conversation

@maxiallard
Copy link
Contributor

This pull request introduces several updates to improve functionality, usability, and maintainability across multiple modules in the codebase. The most significant changes include adding support for attention outputs in embedding methods, improving installation instructions, and refining configuration and processing logic.

Enhancements to Embedding Methods:

  • Added an output_attentions parameter to embedding methods in Geneformer and scGPT models, enabling the return of attention maps for analysis. This includes updates to the get_embs function in geneformer_utils.py and _encode function in scgpt/model_dir/model.py to handle attention outputs. [1] [2] [3] [4]

  • Updated the TransformerEncoder import in scGPT to use a custom implementation for attention weight support.

Installation and Documentation Improvements:

  • Improved installation instructions in README.md, including a detailed guide for resolving issues with mamba-ssm installation using .whl files.

  • Updated the evo_2/README.md file to use HTTPS for cloning the vortex repository instead of SSH.

Configuration and Defaults:

  • Changed the default embedding mode in scGPTConfig from "cls" to "cell" for better usability. [1] [2]

  • Removed the precision field from transcriptformer_config.yaml to simplify configuration.

Code Cleanup and Refinements:

  • Refactored error and warning messages in Geneformer to improve readability and maintain consistency.

  • Replaced padding="max_length" with padding="longest" in the process_data method of the helix_mrna model for more efficient tokenization.

Miscellaneous:

  • Added new hash values for embedding-related files in constants/hash_values.py.

These changes collectively enhance the functionality, usability, and maintainability of the codebase, particularly in embedding workflows and installation processes.

@bputzeys bputzeys merged commit ad78f9a into release Jun 18, 2025
14 of 15 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

4 participants