Question: Support for Alternative Embeddings in fid_metric (e.g., CLIP for CMMD)
Hi, thanks for this great package!
I had a few questions regarding the flexibility of the fid_metric implementation:
-
Encoder:
By default, I assume fid_metric uses Inception-v3 as the feature extractor. Is that correct?
-
Support for Alternative Encoders:
Is it currently possible to use a different encoder (e.g., CLIP, DINO, etc.) with fid_metric?
I’m aware of fid_dino, but wondering if there is broader support for swapping in custom embeddings.
-
CMMD Metric with CLIP:
Specifically, I'm interested in computing the CMMD metric as described in [this paper](https://arxiv.org/pdf/2401.09603), which uses CLIP embeddings.
I noticed you support:
fid_metric(..., distance_type='mmd', kernel_type='rbf')
- Does this support CLIP embeddings?
- If not directly, is there a recommended way to plug in CLIP embeddings (or a custom feature extractor) into the FID/CMMD pipeline?
Thanks in advance!
Question: Support for Alternative Embeddings in
fid_metric(e.g., CLIP for CMMD)Hi, thanks for this great package!
I had a few questions regarding the flexibility of the
fid_metricimplementation:Encoder:
By default, I assume
fid_metricuses Inception-v3 as the feature extractor. Is that correct?Support for Alternative Encoders:
Is it currently possible to use a different encoder (e.g., CLIP, DINO, etc.) with
fid_metric?I’m aware of
fid_dino, but wondering if there is broader support for swapping in custom embeddings.CMMD Metric with CLIP:
Specifically, I'm interested in computing the CMMD metric as described in [this paper](https://arxiv.org/pdf/2401.09603), which uses CLIP embeddings.
I noticed you support:
Thanks in advance!