Looking at the SSD streaming feature, I was wondering whether you've experimented with streaming weights from multiple disks. Most Macs have many independent Thunderbolt 4/5 buses, so we could attach multiple external enclosures with NVME drives. If the weights were copied onto each drive, including the internal one, we could stream a different portion of the weights from each drive (a bit like RAID 1). This could a be "cheap" way to increase the SSD bandwidth in random reads, and maybe make these cache miss less expensive?
I am not really familiar with ds4's codebase, so this could be irrelevant, just some food for thoughts.
Looking at the SSD streaming feature, I was wondering whether you've experimented with streaming weights from multiple disks. Most Macs have many independent Thunderbolt 4/5 buses, so we could attach multiple external enclosures with NVME drives. If the weights were copied onto each drive, including the internal one, we could stream a different portion of the weights from each drive (a bit like RAID 1). This could a be "cheap" way to increase the SSD bandwidth in random reads, and maybe make these cache miss less expensive?
I am not really familiar with ds4's codebase, so this could be irrelevant, just some food for thoughts.