dessn.investigations.gaussianity package¶
In most supernova cosmology analyses, the analysis is performed not using the observed light curves, but using SALT2 summary statistics. Whilst this choice does provide strong computational benefits, the underpinning assumption of validity for using the summary statistics - namely that they are Gaussian - is often assumed.
In order to provide a more rigorous justification for utilising this assumption, I investigate how well the summary statistics reflect the full posterior surface of the SALT2 fits under a variety of conditions and supernova parametrisation.
Introduction¶
To begin with, we pick the optimal case for a supernova: a high signal-to-noise
signal observed regularly throughout the entire supernova lifespan for
a supernova with an exactly determined redshift. We generate
a value for supernova parameters \(x_0,\, x_1,\, t_0,\, c,\, z\) and realise
a SALT2 light curve using sncosmo
. The light curve realised is shown below.
We then fit the light curves to get summary statistics using sncosmo
, and compare
this to a separate MCMC fit which records the entire posterior surface. In this optimal case
we hope to see a high degree of agreement between the summary statistics and full posterior
surface.
This simple example provides the framework for a more sophisticated analysis. In it, we vary the supernova parameters and observation scenario (time pre-peak, and post-peak, frequency of observations) and determine the difference between the full posterior surface and summary statistics via a given metric, and determine in what regions of parameter space might the gaussianity assumption not adequately hold.
Our primary parameter of interest is not any of the fit parameters, but is instead the distance modulus \(\mu\), as it is biases in the distance modulus which will bias our cosmology. Using fiducial values for \(\alpha\) and \(\beta\) from [Re7a0e5d21dfc-1], we have the distance modulus as
where \(m_B^*\) is calculated using sncosmo
. Using this relationship, and moving
the nuisance parameter \(M\) to the other side, we can convert our multi-dimensional
posterior surface into a single dimension, and then check for any biases in the mean and
variance of the \(\mu+M\) distribution. Doing this for the simple example above gives
us the distributions shown below, which have \(\Delta (\mu+M) = 0.003\)
and \(\Delta (\sigma_{\mu+M}) = 0.0007\), a negligible difference for use in cosmology.
To rephrase the above for more clarity, what we are doing is comparing the distribution of \(\mu +M\) from a full posterior surface against the distribution obtained from using summary statistics (assuming gaussianity).
Generalisation¶
By finding the difference in mean and standard deviation of the \(\mu+M\) distributions from assuming a Gaussian approximation and examining the full posterior surface, we can investigate the effect of various observational conditions. Parameters which may be of interest are the peak signal to noise of the light curves, the number of observations, and when observations for a supernova begin (ie how early do we catch the supernova before peak) - in addition to stretch, colour and redshift.
To simplify the problem, we assume consistent observations every five days.
First, we generate supernova with stretch and colour set to zero, with observations starting well before the peak and spanning the lifetime of the supernova. We allow the redshift and sky flux to vary (ie redshift and signal-to-noise changes). The shift in the mean value for \(\mu+M\) is shown in the figure below.
With reference to the above figure, we can see that the summary statistics provide
accurate statistics when the mean and covariance are determined from the emcee
distribution,
implying low skewness of the posterior surface. However, the difference between the
minuit
fits and both emcee
fits gives rise to a difference which is not negligible.
DES Specific Survey Conditions¶
Shallow Field¶
To investigate the DES specific impact of Salt 2 summary statistics gaussianity,
we implement DES-like simulations. Specifically in this section, we emulate shallow
field observations for DES in the griz bands for typical seeing conditions. We
fit for a range of supernova over a redshift distribution from 0.1 to 0.9, and investigate
the derived parameter \(\mu + M\) with regards to the shift in mean, change in
variance, and pull. We also add in nestle
as a forth fitting methodology to help
discriminate between minuit
and emcee
.
From these results, it can be seen that above a redshift range of approximately
\(z=0.5\), using iminuit
via the fit_lc
method produces biased summary
statistics. Below this redshift, and for all other fitting methodologies, expressing the
posterior via gaussian approximation loses negligible information about the posterior surface
and introduces no detectable bias.
We find identical results when generating non-canonical supernova (with \(x_1,\ c\) not set to zero). Furthermore, we take the realised light curve of maximal bias and present it here.
Deep Field¶
We also investigate the presence of any bias when observing in the deep field. From the Generalisation section, we expect to discover less bias due to the increased signal-to-noise over the shallow fields. This is the case, as shown below, such that the bias in final output is lessened, however it is still present in redshifts above \(z=0.5\).
Cosmology Impact¶
To test the impact on cosmology fits this bias might have, I implemented a basic cosmology simulation.
We start with the basics, and assume that supernova are drawn from some underlying
distribution of absolute magnitude \(M_i \sim \mathcal{N}(M,\sigma_{\rm int})\). We
set our simulation cosmology to astropy.cosmology.WMAP9
, and (as done before), set
our Philip’s relation variables \(\alpha\) and \(\beta\) to \(0.14\) and
\(3.15\) respectively, (following fiducial values from [Re7a0e5d21dfc-1]), and fix the values there.
Supernova colours are drawn such that \(c_i \sim \mathcal{N}(0, 0.1)\) and colours are drawn
as \(x_{1i} \sim \mathcal{N}(0, 1)\). The redshift distribution is calculated from the
comoving volume of the DES deep-field area of 30 square degrees using the sncosmo
function zdist
.
We assume perfectly realised zero points, which at the time of analysis were given as \([32.46, 32.28, 32.55, 33.12]\) for the g, r, i and z bands in the shallow field, and \([34.24, 34.85, 34.94, 35.42]\). Observational cadence is set to 5 days, and weather conditions and moon cycle are simulated in light curve generation.
Given this, we can compute the weight as a function of the two cosmological parameters free: \(\Omega_m\) and \(w\). We generate 500 supernova having set \(\sigma_{\rm int} = 0.1\)
To test the impact on cosmology fits this bias might have, I use a very simplistic
cosmology fitter and do not correct for selection bias. Thus we expect the final
cosmology to be biased, however both input data (one using skewed minuit
output
and the other using the output summary stats from an emcee
chain) have the same
selection effects and so should still give the same output posterior surface.
Using selection effects of one point in one band being above a signal to noise of 4, we plot the supernova distance modulus minus the cosmology distance modulus to show the bias in redshift, for both the shallow and deep field.
The output surfaces for the supernova data put through a simplistic cosmology analysis are shown below, and we can see a difference in both the recovered value for \(\Omega_m\) and \(w\).
Method | Shallow | Deep | ||
\(\Omega_m\) | \(w\) | \(\Omega_m\) | \(w\) | |
emcee | \(0.326^{+0.025}_{-0.029}\) | \(-1.13\pm 0.12\) | \(0.323^{+0.041}_{-0.050}\) | \(-1.11\pm 0.17\) |
minuit | \(0.369^{+0.019}_{-0.022}\) | \(-1.30^{+0.12}_{-0.13}\) | \(0.416^{+0.023}_{-0.030}\) | \(-1.51\pm 0.20\) |
The offset is more clearly shown in the likelihood surfaces:
Summary¶
From investigating the biases imparted onto \(\mu +M\) from the assumption of posterior
gaussianity, we find the assumption of gaussianity to hold for emcee
or nestle
based
fits across all redshifts, and the assumption to only hold for \(z<0.5\) objects when
using iminuit
. From this, we recommend utilising the emcee
or nestle
fitting
methods. When using either of those methods, we find no evidence of detectable bias
in the output distance modulus for supernova events.
[Re7a0e5d21dfc-1] | (1, 2) Betoule, M., Kessler, R., Guy, J. et al. (2014), “Improved cosmological constraints from a joint analysis of the SDSS-II and SNLS supernova samples”, http://adsabs.harvard.edu/abs/2014A%26A...568A..22B |