Extending off #65
What would happen within a subgraph, if 1 operation loads a full T0 but another operation only requires a strip of T0? Would this be intra-subgraph reuse or will we need to double count the T0 strip?
Does the situation change if T0 was fully retained from the previous subgraph? What decides the tensor renaming when fetching from slow memory?
Thanks for all the clarification!
Extending off #65
What would happen within a subgraph, if 1 operation loads a full
T0but another operation only requires a strip ofT0? Would this be intra-subgraph reuse or will we need to double count the T0 strip?Does the situation change if T0 was fully retained from the previous subgraph? What decides the tensor renaming when fetching from slow memory?
Thanks for all the clarification!