This is to discuss some potential artefacts in the benchmark Cls. This applies to the clustering auto-correlations, especially at high redshift.
-
At large ell (~ell>1000), the benchmark Cls don't seem to agree with the Limber approximation:

This is Cl_gg 9-9; in blue: CCL Limber; orange: independent non-Limber/Limber implementation by @rreischke.
In this case, the region of ell > 1000 contributes ~25% to the SNR, so can't just be ignored.
-
ell = 2 looks weird:

Again Cl_gg, 9-9. The ell = 2 mode contributes about 40% to the SNR.
This is to discuss some potential artefacts in the benchmark Cls. This applies to the clustering auto-correlations, especially at high redshift.
At large ell (~ell>1000), the benchmark Cls don't seem to agree with the Limber approximation:

This is Cl_gg 9-9; in blue: CCL Limber; orange: independent non-Limber/Limber implementation by @rreischke.
In this case, the region of ell > 1000 contributes ~25% to the SNR, so can't just be ignored.
ell = 2 looks weird:

Again Cl_gg, 9-9. The ell = 2 mode contributes about 40% to the SNR.