Make initial tensor dimensions general by khatchad · Pull Request #156 · ponder-lab/ML

khatchad · 2025-07-11T23:11:48Z

Add all dtypes to library summary,

codecov · 2025-07-11T23:13:46Z

Codecov Report

All modified and coverable lines are covered by tests ✅

Project coverage is 59.74%. Comparing base (0197356) to head (6185d5a).

Additional details and impacted files

@@             Coverage Diff              @@
##             master     #156      +/-   ##
============================================
- Coverage     59.77%   59.74%   -0.03%     
+ Complexity      521      520       -1     
============================================
  Files           109      109              
  Lines          7480     7480              
  Branches        828      828              
============================================
- Hits           4471     4469       -2     
- Misses         2830     2832       +2     
  Partials        179      179

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

🚀 New features to boost your workflow:

❄️ Test Analytics: Detect flaky tests, report on failures, and find test suite problems.

…egalArgumentException when indices points-to set is empty.

…legalArgumentException when 'lam' argument is missing.

…sisEngine.getDataflowSources.

…getDataflowSources.

…n SparseAdd.java.

…tion.

…nerator.java.

…s in TensorGenerator.java.

…cified.

…in the same class or an inherited class.

- Modified `PythonConstructorTargetSelector` to extract methods from both `IPythonClass` and `BypassSyntheticClass` (XML summaries), including the "do" method for the latter. - Updated `PythonInstanceMethodTrampolineTargetSelector` to recognize summarized classes as callables. - Updated `PythonSuper` with similar logic to support `super()` calls for summarized classes. - Ensures `self` correctly refers to the instance object in summarized methods, addressing wala#127.

…icit `__call__` on synthetic objects.

…ala#127 is resolved.

… `TestIssue127.java`. Adds tests for wala#127.

This commit resolves wala#127 by updating how synthetic methods defined in XML summaries are integrated into the Python front-end's object model. - Modifies `PythonAnalysisEngine` to automatically translate synthetic XML methods into "Function Classes", allowing them to leverage the existing trampoline generation infrastructure. - Restricts the application of the `IPythonClass` interface to only those synthetic classes that actually contain methods, preventing the erroneous interception of non-instance factory functions (e.g., `tf.reshape`) by the Python constructor selector. - Updates `PythonInstanceMethodTrampolineTargetSelector` to support flexible lookups of the `__call__` method for synthetic types, ensuring implicit invocations (like `c()`) route correctly to the underlying `do` method.

This commit improves the Pass 2 logic in `PythonAnalysisEngine` to accurately distinguish between auto-generated dummy constructors and intentional factory functions defined in XML summaries (e.g., `pd.read_excel`). - Updates the removal heuristic to only delete `do` methods that allocate the exact type of their declaring class AND belong to a class possessing other instance methods. - Ensures that pure factory functions modeled as classes without instance methods remain untouched, preserving their original XML-defined semantics. - Fixes `TestPandasModel.testPandas1` and ensures continued support for TensorFlow factory functions while maintaining the fix for wala#127.

…ly used a read_data helper that did not preserve shape and dtype information from input tensors.

…o remove the `read_dataset` helper. This change inlines tensor allocations directly within the `do` methods. It avoids the `read_dataset` helper pattern, which obfuscates object creation from 1-CFA context sensitivity. This prevents the aliasing of resulting tensor objects and ensures that tensor arguments are correctly tracked, fixing the failing test `testDataset19`.

… trampolines. - Updated `PythonInstanceMethodTrampolineTargetSelector` to properly recognize and resolve `do` methods (via a new constant `DO_METHOD_NAME`) for summarized classes when they are invoked as callables. - Added type safety to `getCall()` in `PythonMethodTrampolineTargetSelector` to gracefully return `null` instead of throwing `ClassCastException` on non-Python invoke instructions, avoiding analysis crashes.

Added an explanatory comment to the `from_tensor_slices` method detailing why dataset field initializations are duplicated across operations. This redundancy is required to bypass WALA's 1-CFA context sensitivity limits, ensuring unique allocation sites and preventing object aliasing in chained dataset operations.

Removes the centralized `read_dataset` helper methods in `Dataset` and `ImageDataGenerator` and inlines their logic directly into the `do` methods of various dataset operations. This circumvents 1-CFA context sensitivity limitations, ensuring unique allocation sites for dataset fields and preventing object aliasing in chained operations.

The underlying trampoline resolution issues that necessitated these workarounds have been resolved.

…l tensor specs in the code. The initial tensor dimensions were not accurate, which caused the test to fail. The expected tensor spec should be (256, 28, 28, 1) with dtype float32, which matches the actual tensor spec used in the `train_step` function in the `tensorflow_gan_tutorial.py` file.

In case we need it.

That's where they currently are.

…ponding tensor generators to support pass-through behavior for __call__ and call methods. Also, refine tensor source identification in PythonTensorAnalysisEngine by verifying sources with TensorGeneratorFactory.

…tive layer discovery and fixing trampoline injection for multiple allocations in summarized methods.

khatchad linked an issue Jul 11, 2025 that may be closed by this pull request

Track initial tensor shapes and dtypes through literals wala/ML#267

Open

khatchad self-assigned this Jul 11, 2025

khatchad closed this Jul 22, 2025

khatchad deleted the 267-initial-tensor-dimensions-arent-always-accurate branch July 22, 2025 16:11

khatchad restored the 267-initial-tensor-dimensions-arent-always-accurate branch July 22, 2025 16:12

khatchad reopened this Jul 22, 2025

khatchad added 23 commits February 5, 2026 15:46

Update OneHot.getShapes to throw IllegalStateException instead of Ill…

61e30d2

…egalArgumentException when indices points-to set is empty.

Remove unused LOGGER and its import from RaggedRange.java.

6135d13

Update Poisson.getShapes to throw IllegalStateException instead of Il…

c56815a

…legalArgumentException when 'lam' argument is missing.

Prefix getLamParameterName() call with this. in Poisson.java.

1600159

Add comments to tf.constant identification logic in PythonTensorAnaly…

3a35c9b

…sisEngine.getDataflowSources.

Add Javadoc to PythonTensorAnalysisEngine.getDataflowSources.

be6d3c9

Clarify Javadoc for dataflow parameter in PythonTensorAnalysisEngine.…

b8b5588

…getDataflowSources.

Add Javadoc to RaggedFromNestedValueRowIds.getMax.

62ce2ee

Add internal comments to RaggedFromNestedValueRowIds.getMax.

2d1b349

Prefix instance method calls with 'this' in SparseAdd.java.

114cacf

Introduce SPARSE_TENSOR_TYPE constant in TensorFlowTypes and use it i…

1529f18

…n SparseAdd.java.

Update TensorCall.getDefaultShapes to throw UnsupportedOperationExcep…

2b24645

…tion.

Fix compilation error in SparseAdd.java by importing TensorFlowTypes.

9fe32db

Use static import for SPARSE_TENSOR_TYPE in SparseAdd.java.

60632ea

Import java.util.Map and use it in TensorGenerator.java.

957911b

Initialize shapeCache and dtypeCache at field declaration in TensorGe…

a1b2972

…nerator.java.

Add Javadoc to TensorGenerator.getTensorTypes.

43c4a7f

Add Javadoc to TensorGenerator.clearCaches.

424cead

Update clearCaches Javadoc to link to shapeCache and dtypeCache field…

629b6b2

…s in TensorGenerator.java.

Cleanup.

b1a2cd5

Put back the default shapes and dtypes when the arguments are not spe…

0c92292

…cified.

Don't prefix calls to static methods with the class name if they are …

3ad48f5

…in the same class or an inherited class.

Remove faulty caching mechanism from TensorGenerator.java.

1a0e40d

khatchad added 30 commits March 11, 2026 15:24

Unsure what these files are about.

8fd6e9a

Use import statement instead of FQN.

2ba7b60

Unsure why could be considered a callable.

f8e7c0d

Add isolated reproduction test case for issue wala#127 regarding impl…

208cc2e

…icit `__call__` on synthetic objects.

Update TestIssue127 to use assertFalse with FIXME until issue w…

686014f

…ala#127 is resolved.

Add comprehensive precision tests for synthetic method trampolines in…

2b1bb53

… `TestIssue127.java`. Adds tests for wala#127.

Test updates.

5e0e644

Progress on wala#127.

56a25c9

Fix: Update tf.sparse.add model in tensorflow.xml. The model previous…

5139209

…ly used a read_data helper that did not preserve shape and dtype information from input tensors.

Restore TODO comment position.

9ffaaf4

Remove obsolete FIXME comments related to ML issue #127.

c92c1c7

The underlying trampoline resolution issues that necessitated these workarounds have been resolved.

Add assertions.

ed7adaf

Unsure how a helper method would increase precision of 1-CFA analysis.

a4f16dc

Add UNKNOWN dtype.

95e287d

In case we need it.

Revert "Add assertions." This reverts commit ed7adaf.

e11ac9b

Add new types.

236be01

Ignore submodules.

a7a4eeb

Ignore the submodules at the root.

889efd5

That's where they currently are.

Restore comments.

4ffcacb

Add multiple model test.

a48ac2c

Correctly infer shapes for Keras Model weights by implementing exhaus…

64e2f7b

…tive layer discovery and fixing trampoline injection for multiple allocations in summarized methods.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Make initial tensor dimensions general#156

Make initial tensor dimensions general#156
khatchad wants to merge 818 commits intomasterfrom
267-initial-tensor-dimensions-arent-always-accurate

khatchad commented Jul 11, 2025 •

edited

Loading

Uh oh!

codecov bot commented Jul 11, 2025 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

khatchad commented Jul 11, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

codecov bot commented Jul 11, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

khatchad commented Jul 11, 2025 •

edited

Loading

codecov bot commented Jul 11, 2025 •

edited

Loading