The 100K MERFISH benchmark (PR #216) showed that pairwise_corr_vs_python fails at 100K (0.987, below the 0.99 threshold) despite all other quality gates passing. The metric computes correlation ...