# Clustering of Gamma-Ray bursts through kernel principal component analysis [CL]

We consider the problem related to clustering of gamma-ray bursts (from “BATSE” catalogue) through kernel principal component analysis in which our proposed kernel outperforms results of other competent kernels in terms of clustering accuracy and we obtain three physically interpretable groups of gamma-ray bursts. The effectivity of the suggested kernel in combination with kernel principal component analysis in revealing natural clusters in noisy and nonlinear data while reducing the dimension of the data is also explored in two simulated data sets.

S. Modak, A. Chattopadhyay and T. Chattopadhyay
Fri, 17 Mar 17
43/50

Comments: 30 pages, 10 figures

# The M33 Synoptic Stellar Survey. II. Mira Variables [SSA]

We present the discovery of 1847 Mira candidates in the Local Group galaxy M33 using a novel semi-parametric periodogram technique coupled with a Random Forest classifier. The algorithms were applied to ~2.4×10^5 I-band light curves previously obtained by the M33 Synoptic Stellar Survey. We derive preliminary Period-Luminosity relations at optical, near- & mid-infrared wavelengths and compare them to the corresponding relations in the Large Magellanic Cloud.

W. Yuan, S. He, L. Macri, et. al.
Mon, 6 Mar 17
20/47

Comments: Accepted for publication in the Astronomical Journal

# An intermediate-mass black hole in the centre of the globular cluster 47 Tucanae [GA]

Intermediate mass black holes play a critical role in understanding the evolutionary connection between stellar mass and super-massive black holes. However, to date the existence of these species of black holes remains ambiguous and their formation process is therefore unknown. It has been long suspected that black holes with masses $10^{2}-10^{4}M_{\odot}$ should form and reside in dense stellar systems. Therefore, dedicated observational campaigns have targeted globular cluster for many decades searching for signatures of these elusive objects. All candidates found in these targeted searches appear radio dim and do not have the X-ray to radio flux ratio predicted by the fundamental plane for accreting black holes. Based on the lack of an electromagnetic counterpart upper limits of $2060 M_{\odot}$ and $470 M_{\odot}$ have been placed on the mass of a putative black hole in 47 Tucanae (NGC 104) from radio and X-ray observations respectively. Here we show there is evidence for a central black hole in 47 Tuc with a mass of M$_{\bullet}\sim2200 M_{\odot}$$_{-800}^{+1500}$ when the dynamical state of the globular cluster is probed with pulsars. The existence of an intermediate mass black hole in the centre of one of the densest clusters with no detectable electromagnetic counterpart suggests that the black hole is not accreting at a sufficient rate and therefore contrary to expectations is gas starved. This intermediate mass black hole might be a member of electromagnetically invisible population of black holes that are the elusive seeds leading to the formation of supermassive black holes in galaxies.

B. Kiziltan, H. Baumgardt and A. Loeb
Thu, 9 Feb 17
47/67

Comments: Published in Nature

# Method for estimating cycle lengths from multidimensional time series: Test cases and application to a massive "in silico" dataset [SSA]

Many real world systems exhibit cyclic behavior that is, for example, due to the nearly harmonic oscillations being perturbed by the strong fluctuations present in the regime of significant non-linearities. For the investigation of such sys- tems special techniques relaxing the assumption to periodicity are required. In this paper, we present the generalization of one of such techniques, namely the D2 phase dispersion statistic, to multidimensional datasets, especially suited for the analysis of the outputs from three-dimensional numerical simulations of the full magnetohydrodynamic equations. We present the motivation and need for the usage of such a method with simple test cases, and present an application to a solar-like semi-global numerical dynamo simulation covering nearly 150 magnetic cycles.

N. Olspert, M. Kapyla and J. Pelt
Wed, 7 Dec 16
21/67

# Learning an Astronomical Catalog of the Visible Universe through Scalable Bayesian Inference [CL]

Celeste is a procedure for inferring astronomical catalogs that attains state-of-the-art scientific results. To date, Celeste has been scaled to at most hundreds of megabytes of astronomical images: Bayesian posterior inference is notoriously demanding computationally. In this paper, we report on a scalable, parallel version of Celeste, suitable for learning catalogs from modern large-scale astronomical datasets. Our algorithmic innovations include a fast numerical optimization routine for Bayesian posterior inference and a statistically efficient scheme for decomposing astronomical optimization problems into subproblems.
Our scalable implementation is written entirely in Julia, a new high-level dynamic programming language designed for scientific and numerical computing. We use Julia’s high-level constructs for shared and distributed memory parallelism, and demonstrate effective load balancing and efficient scaling on up to 8192 Xeon cores on the NERSC Cori supercomputer.

J. Regier, K. Pamnany, R. Giordano, et. al.
Fri, 11 Nov 16
11/40

Comments: submitting to IPDPS’17

# The Non-homogeneous Poisson Process for Fast Radio Burst Rates [HEAP]

This paper presents the non-homogeneous Poisson process (NHPP) for modeling the rate of fast radio bursts (FRBs) and other infrequently observed astronomical events. The NHPP, well-known in statistics, can model changes in the rate as a function of both astronomical features and the details of an observing campaign. This is particularly helpful for rare events like FRBs because the NHPP can combine information across surveys, making the most of all available information. The goal of the paper is two-fold. First, it is intended to be a tutorial on the use of the NHPP. Second, we build an NHPP model that incorporates beam patterns and a power law flux distribution for the rate of FRBs. Using information from 12 surveys including 15 detections, we find an all-sky FRB rate of 586.88 events per sky per day above a flux of 1 Jy (95\% CI: 271.86, 923.72) and a flux power-law index of 0.91 (95\% CI: 0.57, 1.25). Our rate is lower than other published rates, but consistent with the rate given in Champion et al. 2016.

E. Lawrence, S. Wiel, C. Law, et. al.
Thu, 3 Nov 16
27/57

Comments: 19 pages, 2 figures

# Neutron stars in the light of SKA: Data, statistics, and science [IMA]

The Square Kilometre Array (SKA), when it becomes functional, is expected to enrich neutron star (NS) catalogues by at least an order of magnitude over their current state. This includes the discovery of new NS objects leading to better sampling of under-represented NS categories, precision measurements of intrinsic properties such as spin period and magnetic field, as also data on related phenomena such as microstructure, nulling, glitching, etc. This will present a unique opportunity to seek answers to interesting and fundamental questions about the extreme physics underlying these exotic objects in the universe. In this paper, we first present a meta-analysis (from a methodological viewpoint) of statistical analyses performed using existing NS data, with a two-fold goal: First, this should bring out how statistical models and methods are shaped and dictated by the science problem being addressed. Second, it is hoped that these analyses will provide useful starting points for deeper analyses involving richer data from SKA whenever it becomes available. We also describe a few other areas of NS science which we believe will benefit from SKA which are of interest to the Indian NS community.

M. Arjunwadkar, A. Kashikar and M. Bagchi
Thu, 27 Oct 16
46/59

Comments: To appear in Journal of Astrophysics and Astronomy (JOAA) special issue on “Science with the SKA: an Indian perspective”

|

# Period estimation for sparsely-sampled quasi-periodic light curves applied to Miras [SSA]

We develop a non-linear semi-parametric Gaussian process model to estimate periods of Miras with sparsely-sampled light curves. The model uses a sinusoidal basis for the periodic variation and a Gaussian process for the stochastic changes. We use maximum likelihood to estimate the period and the parameters of the Gaussian process, while integrating out the effects of other nuisance parameters in the model with respect to a suitable prior distribution obtained from earlier studies. Since the likelihood is highly multimodal for period, we implement a hybrid method that applies the quasi-Newton algorithm for Gaussian process parameters and search the period/frequency parameter over a dense grid.
A large-scale, high-fidelity simulation is conducted to mimic the sampling quality of Mira light curves obtained by the M33 Synoptic Stellar Survey. The simulated data set is publicly available and can serve as a testbed for future evaluation of different period estimation methods. The semi-parametric model outperforms an existing algorithm on this simulated test data set as measured by period recovery rate and quality of the resulting Period-Luminosity relations.

S. He, W. Yuan, J. Huang, et. al.
Thu, 22 Sep 16
22/62

Comments: Accepted for publication in The Astronomical Journal. Software package and test data set available at this http URL

# The Type Ia Supernova Color-Magnitude Relation and Host Galaxy Dust: A Simple Hierarchical Bayesian Model [CEA]

Conventional Type Ia supernova (SN Ia) cosmology analyses currently use a simplistic linear regression of magnitude versus color and light curve shape, which does not model intrinsic SN Ia variations and host galaxy dust as physically distinct effects, resulting in low color-magnitude slopes. We construct a probabilistic generative model for the distribution of dusty extinguished absolute magnitudes and apparent colors as a convolution of the intrinsic SN Ia color-magnitude distribution and the host galaxy dust reddening-extinction distribution. If the intrinsic color-magnitude (M_B vs. B-V) slope beta_int differs from the host galaxy dust law R_B, this convolution results in a specific curve of mean extinguished absolute magnitude vs. apparent color. The derivative of this curve smoothly transitions from beta_int in the blue tail to R_B in the red tail of the apparent color distribution. The conventional linear fit approximates this effective curve at this transition near the average apparent color, resulting in an apparent slope beta_app between beta_int and R_B. We incorporate these effects into a hierarchical Bayesian statistical model for SN Ia light curve measurements, and analyze a dataset of SALT2 optical light curve fits of a compilation of 277 nearby SN Ia at z < 0.10. The conventional linear fit obtains beta_app = 3. Our model finds a beta_int = 2.2 +/- 0.3 and a distinct dust law of R_B = 3.7 +/- 0.3, consistent with the average for Milky Way dust, while correcting a systematic distance bias of ~0.10 mag in the tails of the apparent color distribution.

K. Mandel, D. Scolnic, H. Shariff, et. al.
Fri, 16 Sep 16
17/63

Comments: 22 pages, 16 figures, submitted to ApJ

|

# astroABC: An Approximate Bayesian Computation Sequential Monte Carlo sampler for cosmological parameter estimation [IMA]

Given the complexity of modern cosmological parameter inference where we are faced with non-Gaussian data and noise, correlated systematics and multi-probe correlated data sets, the Approximate Bayesian Computation (ABC) method is a promising alternative to traditional Markov Chain Monte Carlo approaches in the case where the Likelihood is intractable or unknown. The ABC method is called “Likelihood free” as it avoids explicit evaluation of the Likelihood by using a forward model simulation of the data which can include systematics. We introduce astroABC, an open source ABC Sequential Monte Carlo (SMC) sampler for parameter estimation. A key challenge in astrophysics is the efficient use of large multi-probe datasets to constrain high dimensional, possibly correlated parameter spaces. With this in mind astroABC allows for massive parallelization using MPI, a framework that handles spawning of jobs across multiple nodes. A key new feature of astroABC is the ability to create MPI groups with different communicators, one for the sampler and several others for the forward model simulation, which speeds up sampling time considerably. For smaller jobs the Python multiprocessing option is also available. Other key features include: a Sequential Monte Carlo sampler, a method for iteratively adapting tolerance levels, local covariance estimate using scikit-learn’s KDTree, modules for specifying optimal covariance matrix for a component-wise or multivariate normal perturbation kernel, output and restart files are backed up every iteration, user defined metric and simulation methods, a module for specifying heterogeneous parameter priors including non-standard prior PDFs, a module for specifying a constant, linear, log or exponential tolerance level, well-documented examples and sample scripts. This code is hosted online at https://github.com/EliseJ/astroABC

E. Jennings and M. Madigan
Tue, 30 Aug 16
37/78

Comments: 19 pages, 3 figures. Comments welcome

|

# Bayesian isochrone fitting and stellar ages [GA]

Stellar evolution theory has been extraordinarily successful at explaining the different phases under which stars form, evolve and die. While the strongest constraints have traditionally come from binary stars, the advent of asteroseismology is bringing unique measures in well-characterised stars. For stellar populations in general, however, only photometric measures are usually available, and the comparison with the predictions of stellar evolution theory have mostly been qualitative. For instance, the geometrical shapes of isochrones have been used to infer ages of coeval populations, but without any proper statistical basis. In this chapter we provide a pedagogical review on a Bayesian formalism to make quantitative inferences on the properties of single, binary and small ensembles of stars, including unresolved populations. As an example, we show how stellar evolution theory can be used in a rigorous way as a prior information to measure the ages of stars between the ZAMS and the Helium flash, and their uncertainties, using photometric data only.

D. Valls-Gabaud
Tue, 12 Jul 16
11/71

Comments: 43 pages, Proceedings of the Evry Schatzman School of Stellar Astrophysics “The ages of stars”, EAS Publications Series, Volume 65

# Comparing cosmic web classifiers using information theory [CEA]

We introduce a decision scheme for optimally choosing a classifier, which segments the cosmic web into different structure types (voids, sheets, filaments, and clusters). Our framework, based on information theory, accounts for the design aims of different classes of possible applications: (i) parameter inference, (ii) model selection, and (iii) prediction of new observations. As an illustration, we use cosmographic maps of web-types in the Sloan Digital Sky Survey to assess the relative performance of the classifiers T-web, DIVA and ORIGAMI for: (i) analyzing the morphology of the cosmic web, (ii) discriminating dark energy models, and (iii) predicting galaxy colors. Our study substantiates a data-supported connection between cosmic web analysis and information theory, and paves the path towards principled design of analysis procedures for the next generation of galaxy surveys. We have made the cosmic web maps, galaxy catalog, and analysis scripts used in this work publicly available.

F. Leclercq, G. Lavaux, J. Jasche, et. al.
Thu, 23 Jun 16
23/49

Comments: 20 pages, 8 figures, 6 tables. Public data available from the first author’s website (currently this http URL)

|

# Tests for Comparing Weighted Histograms. Review and Improvements [CL]

Histograms with weighted entries are used to estimate probability density functions. Computer simulation is the main application of this type of histograms. A review on chi-square tests for comparing weighted histograms is presented in this paper. Improvements to these tests that have a size closer to its nominal value are proposed. Numerical examples are presented for evaluation and demonstration of various applications of the tests.

N. Gagunashvili
Wed, 22 Jun 16
21/50

Comments: 23 pages, 2 figures. arXiv admin note: text overlap with arXiv:0905.4221

# A Goldilocks principle for modeling radial velocity noise [EPA]

The doppler measurements of stars are diluted and distorted by stellar activity noise. Different choices of noise models and statistical methods have led to much controversy in the confirmation of exoplanet candidates obtained through analysing radial velocity data. To quantify the limitation of various models and methods, we compare different noise models and signal detection criteria for various simulated and real data sets in the Bayesian framework. According to our analyses, the white noise model tend to interpret noise as signal, leading to false positives. On the other hand, the red noise models are likely to interprete signal as noise, resulting in false negatives. We find that the Bayesian information criterion combined with a Bayes factor threshold of 150 can efficiently rule out false positives and confirm true detections. We further propose a Goldilocks principle aimed at modeling radial velocity noise to avoid too many false positives and too many false negatives. We propose that the noise model with RHK-dependent jitter is used in combination with the moving average model to detect planetary signals for M dwarfs. Our work may also shed light on the noise modeling for hotter stars, and provide a valid approach for finding similar principles in other disciplines.

F. Feng, M. Tuomi, H. Jones, et. al.
Fri, 17 Jun 16
36/65

Comments: 14 pages, 6 figures, accepted for publication in MNRAS

# Using Extreme Value Theory for Determining the Probability of Carrington-Like Solar Flares [CL]

Space weather events can negatively affect satellites, the electricity grid, satellite navigation systems and human health. As a consequence, extreme space weather has been added to the UK and other national risk registers. However, by their very nature, extreme events occur rarely and statistical methods are required to determine the probability of occurrence solar storms. Space weather events can be characterised by a number of natural phenomena such as X-ray (solar) flares, solar energetic particle (SEP) fluxes, coronal mass ejections and various geophysical indices (Dst, Kp, F10.7). Here we use extreme value theory (EVT) to investigate the probability of extreme solar flares. Previous work has suggested that the distribution of solar flares follows a power law. However such an approach can lead to overly “fat-tails” in the probability distribution function and thus to an under estimation of the return time of such events. Using EVT and GOES X-ray flux data we find that the expected 150 year return level is an X60 flare (6×10^(-3) Wm-2, 1-8 {\AA} X-ray flux). We also show that the EVT results are consistent with flare data from the Kepler space telescope mission.

S. Elvidge and M. Angling
Wed, 13 Apr 16
43/60

Comments: 10 pages, 3 figures, submitted to Nature

# Photo-z Estimation: An Example of Nonparametric Conditional Density Estimation under Selection Bias [CL]

Redshift is a key quantity for inferring cosmological model parameters. In photometric redshift estimation, cosmologists use the coarse data collected from the vast majority of galaxies to predict the redshift of individual galaxies. To properly quantify the uncertainty in the predictions, however, one needs to go beyond standard regression and instead estimate the full conditional density f(z|x) of a galaxy’s redshift z given its photometric covariates x. The problem is further complicated by selection bias: usually only the rarest and brightest galaxies have known redshifts, and these galaxies have characteristics and measured covariates that do not necessarily match those of more numerous and dimmer galaxies of unknown redshift. Unfortunately, there is not much research on how to best estimate complex multivariate densities in such settings. Here we describe a general framework for properly constructing and assessing nonparametric conditional density estimators under selection bias, and for combining two or more estimators for optimal performance. We propose new improved photo-z estimators and illus- trate our methods on data from the Sloan Data Sky Survey and an application to galaxy-galaxy lensing. Although our main application is photo-z estimation, our methods are relevant to any high-dimensional regression setting with complicated asymmetric and multimodal distributions in the response variable.

R. Izbicki, A. Lee and P. Freeman
Thu, 7 Apr 16
42/51

# Unfolding problem clarification and solution validation [CL]

The unfolding problem formulation for correcting experimental data distortions due to finite resolution and limited detector acceptance is discussed. A novel validation of the problem solution is proposed. Attention is drawn to fact that different unfolded distributions may satisfy the validation criteria, in which case a conservative approach using entropy is suggested. The importance of analysis of residuals is demonstrated.

N. Gagunashvili
Fri, 19 Feb 16
22/50

Comments: 9 pages,4 figures

# Bayesian Estimates of Astronomical Time Delays between Gravitationally Lensed Stochastic Light Curves [IMA]

The gravitational field of a galaxy can act as a lens and deflect the light emitted by a more distant object such as a quasar. If the galaxy is a strong gravitational lens, it can produce multiple images of the same quasar in the sky. Since the light in each gravitationally lensed image traverses a different path length from the quasar to the Earth, fluctuations in the source brightness are observed in the several images at different times. The time delay between these fluctuations can be used to constrain cosmological parameters and can be inferred from the time series of brightness data or light curves of each image. To estimate the time delay, we construct a model based on a state-space representation for irregularly observed time series generated by a latent continuous-time Ornstein-Uhlenbeck process. We account for microlensing, an additional source of independent long-term extrinsic variability, via a polynomial regression. Our Bayesian strategy adopts a Metropolis-Hastings within Gibbs sampler. We improve the sampler by using an ancillarity-sufficiency interweaving strategy and adaptive Markov chain Monte Carlo. We introduce a profile likelihood of the time delay as an approximation of its marginal posterior distribution. The Bayesian and profile likelihood approaches complement each other, producing almost identical results; the Bayesian method is more principled but the profile likelihood is simpler to implement. We demonstrate our estimation strategy using simulated data of doubly- and quadruply-lensed quasars, and observed data from quasars Q0957+561 and J1029+2623.

H. Tak, K. Mandel, D. Dyk, et. al.
Fri, 5 Feb 16
31/47

|

# Using hydrodynamical simulations of stellar atmospheres for periodogram standardization : application to exoplanet detection [CL]

Our aim is to devise a detection method for exoplanet signatures (multiple sinusoids) that is both powerful and robust to partially unknown statistics under the null hypothesis. In the considered application, the noise is mostly created by the stellar atmosphere, with statistics depending on the complicated interplay of several parameters. Recent progresses in hydrodynamic (HD) simulations show however that realistic stellar noise realizations can be numerically produced off-line by astrophysicists. We propose a detection method that is calibrated by HD simulations and analyze its performances. A comparison of the theoretical results with simulations on synthetic and real data shows that the proposed method is powerful and robust.

S. Sulis, D. Mary and L. Bigot
Fri, 29 Jan 16
10/52

Comments: 5 pages, 3 figures. This manuscript was submitted and accepted to the 41st IEEE International Conference on Acoustics Speech and Signal Processing (ICASSP), 2016

# Testing the anisotropy in the angular distribution of $Fermi$/GBM gamma-ray bursts [HEAP]

Gamma-ray bursts (GRBs) were confirmed to be of extragalactic origin due to their isotropic angular distribution, combined with the fact that they exhibited an intensity distribution that deviated strongly from the $-3/2$ power law. This finding was later confirmed with the first redshift, equal to at least $z=0.835$, measured for GRB970508. Despite this result, the data from $CGRO$/BATSE and $Swift$/BAT indicate that long GRBs are indeed distributed isotropically, but the distribution of short GRBs is anisotropic. $Fermi$/GBM has detected 1669 GRBs up to date, and their sky distribution is examined in this paper. A number of statistical tests is applied: nearest neighbour analysis, fractal dimension, dipole and quadrupole moments of the distribution function decomposed into spherical harmonics, binomial test, and the two point angular correlation function. Monte Carlo benchmark testing of each test is performed in order to evaluate its reliability. It is found that short GRBs are distributed anisotropically on the sky, and long ones have an isotropic distribution. The probability that these results are not a chance occurence is equal to at least 99.98\% and 30.68\% for short and long GRBs, respectively. The cosmological context of this finding and its relation to large-scale structures is briefly discussed.

M. Tarnopolski
Thu, 10 Dec 15
3/63

Comments: 13 pages, 5 figures; submitted; comments very welcome

# An integrative approach based on probabilistic modelling and statistical inference for morpho-statistical characterization of astronomical data [CL]

This paper describes several applications in astronomy and cosmology that are addressed using probabilistic modelling and statistical inference.

R. Stoica, S. Liu, L. Liivamagi, et. al.
Tue, 20 Oct 15
45/92

# Detecting Effects of Filaments on Galaxy Properties in the Sloan Digital Sky Survey III [GA]

We study the effects of filaments on galaxy properties in the Sloan Digital Sky Survey (SDSS) Data Release 12 using filaments from the Cosmic Web Reconstruction’ catalogue (Chen et al. 2015a), a publicly available filament catalogue for SDSS. Since filaments are tracers of medium-to-high density regions, we expect that galaxy properties associated with the environment are dependent on the distance to the nearest filament. Our analysis demonstrates a red galaxy or a high-mass galaxy tend to reside closer to filaments than a blue or low-mass galaxy. After adjusting the effect from stellar mass, on average, late-forming galaxies or large galaxies have a shorter distance to filaments than early-forming galaxies or small galaxies. For the Main galaxy sample, all signals are very significant ($> 5\sigma$). For the LOWZ and CMASS samples, most of the signals are significant (with $> 3\sigma$). The filament effects we observe persist until z = 0.7 (the edge of the CMASS sample). Comparing our results to those using the galaxy distances from redMaPPer galaxy clusters as a reference, we find a similar result between filaments and clusters. Our findings illustrate the strong correlation of galaxy properties with proximity to density ridges, strongly supporting the claim that density ridges are good tracers of filaments.

Y. Chen, S. Ho, R. Mandelbaum, et. al.
Wed, 23 Sep 15
41/63

Comments: 12 pages, 6 figures, 2 tables

# Cosmic Web Reconstruction through Density Ridges: Catalogue [CEA]

We construct a catalogue for filaments using a novel approach called SCMS (subspace constrained mean shift; Ozertem & Erdogmus 2011; Chen et al. 2015). SCMS is a gradient-based method that detects filaments through density ridges (smooth curves tracing high-density regions). A great advantage of SCMS is its uncertainty measure, which allows an evaluation of the errors for the detected filaments. To detect filaments, we use data from the Sloan Digital Sky Survey, which consist of three galaxy samples: the NYU main galaxy sample (MGS), the LOWZ sample and the CMASS sample. Each of the three dataset covers different redshift regions so that the combined sample allows detection of filaments up to z = 0.7. Our filament catalogue consists of a sequence of two-dimensional filament maps at different redshifts that provide several useful statistics on the evolution cosmic web. To construct the maps, we select spectroscopically confirmed galaxies within 0.050 < z < 0.700 and partition them into 130 bins. For each bin, we ignore the redshift, treating the galaxy observations as a 2-D data and detect filaments using SCMS. The filament catalogue consists of 130 individual 2-D filament maps, and each map comprises points on the detected filaments that describe the filamentary structures at a particular redshift. We also apply our filament catalogue to investigate galaxy luminosity and its relation with distance to filament. Using a volume-limited sample, we find strong evidence (6.1$\sigma$ – 12.3$\sigma$) that galaxies close to filaments are generally brighter than those at significant distance from filaments.

Y. Chen, S. Ho, J. Brinkmann, et. al.
Wed, 23 Sep 15
45/63

Comments: 14 pages, 12 figures, 4 tables

|

# Polarized CMB recovery with sparse component separation [CEA]

The polarization modes of the cosmological microwave background are an invaluable source of information for cosmology, and a unique window to probe the energy scale of inflation. Extracting such information from microwave surveys requires disentangling between foreground emissions and the cosmological signal, which boils down to solving a component separation problem. Component separation techniques have been widely studied for the recovery of CMB temperature anisotropies but quite rarely for the polarization modes. In this case, most component separation techniques make use of second-order statistics to discriminate between the various components. More recent methods, which rather emphasize on the sparsity of the components in the wavelet domain, have been shown to provide low-foreground, full-sky estimate of the CMB temperature anisotropies. Building on sparsity, the present paper introduces a new component separation technique dubbed PolGMCA (Polarized Generalized Morphological Component Analysis), which refines previous work to specifically tackle the estimation of the polarized CMB maps: i) it benefits from a recently introduced sparsity-based mechanism to cope with partially correlated components, ii) it builds upon estimator aggregation techniques to further yield a better noise contamination/non-Gaussian foreground residual trade-off. The PolGMCA algorithm is evaluated on simulations of full-sky polarized microwave sky simulations using the Planck Sky Model (PSM), which show that the proposed method achieve a precise recovery of the CMB map in polarization with low noise/foreground contamination residuals. It provides improvements with respect to standard methods, especially on the galactic center where estimating the CMB is challenging.

J. Bobin, F. Sureau and J. Starck
Mon, 31 Aug 15
3/63

Comments: Accepted to A&A, august 2015

|

# Detecting Abrupt Changes in the Spectra of High-Energy Astrophysical Sources [CL]

Variable-intensity astronomical sources are the result of complex and often extreme physical processes. Abrupt changes in source intensity are typically accompanied by equally sudden spectral shifts, i.e., sudden changes in the wavelength distribution of the emission. This article develops a method for modeling photon counts collected form observation of such sources. We embed change points into a marked Poisson process, where photon wavelengths are regarded as marks and both the Poisson intensity parameter and the distribution of the marks are allowed to change. We believe this is the first effort to embed change points into a marked Poisson process. Between the change points, the spectrum is modeled non-parametrically using a mixture of a smooth radial basis expansion and a number of local deviations from the smooth term representing spectral emission lines. Because the model is over parameterized we employ an $\ell_1$ penalty. The tuning parameter in the penalty and the number of change points are determined via the minimum description length principle. Our method is validated via a series of simulation studies and its practical utility is illustrated in the analysis of the ultra-fast rotating yellow giant star known as FK Com.

R. Wong, V. Kashyap, T. Lee, et. al.
Mon, 31 Aug 15
21/63

Comments: 27 pages, 5 figures

# Multivariate Approaches to Classification in Extragalactic Astronomy [GA]

Clustering objects into synthetic groups is a natural activity of any science. Astrophysics is not an exception and is now facing a deluge of data. For galaxies, the one-century old Hubble classification and the Hubble tuning fork are still largely in use, together with numerous mono-or bivariate classifications most often made by eye. However, a classification must be driven by the data, and sophisticated multivariate statistical tools are used more and more often. In this paper we review these different approaches in order to situate them in the general context of unsupervised and supervised learning. We insist on the astrophysical outcomes of these studies to show that multivariate analyses provide an obvious path toward a renewal of our classification of galaxies and are invaluable tools to investigate the physics and evolution of galaxies.

D. Fraix-Burnet, M. Thuillard and A. Chattopadhyay
Fri, 28 Aug 15
15/49

Comments: Open Access paper. this http URL&gt;. \&lt;10.3389/fspas.2015.00003 \&gt

# Investigating Galaxy-Filament Alignments in Hydrodynamic Simulations using Density Ridges [CEA]

In this paper, we study the filamentary structures and the galaxy alignment along filaments at redshift $z=0.06$ in the MassiveBlack-II simulation, a state-of-the-art, high-resolution hydrodynamical cosmological simulation which includes stellar and AGN feedback in a volume of (100 Mpc$/h$)$^3$. The filaments are constructed using the subspace constrained mean shift (SCMS; Ozertem & Erdogmus (2011) and Chen et al. (2015a)). First, we show that reconstructed filaments using galaxies and reconstructed filaments using dark matter particles are similar to each other; over $50\%$ of the points on the galaxy filaments have a corresponding point on the dark matter filaments within distance $0.13$ Mpc$/h$ (and vice versa) and this distance is even smaller at high-density regions. Second, we observe the alignment of the major principal axis of a galaxy with respect to the orientation of its nearest filament and detect a $2.5$ Mpc$/h$ critical radius for filament’s influence on the alignment when the subhalo mass of this galaxy is between $10^9M_\odot/h$ and $10^{12}M_\odot/h$. Moreover, we find the alignment signal to increase significantly with the subhalo mass. Third, when a galaxy is close to filaments (less than $0.25$ Mpc$/h$), the galaxy alignment toward the nearest galaxy group depends on the galaxy subhalo mass. Finally, we find that galaxies close to filaments or groups tend to be rounder than those away from filaments or groups.

Y. Chen, S. Ho, A. Tenneti, et. al.
Wed, 19 Aug 15
5/32

Comments: 11 pages, 10 figures

|

# Improving the precision matrix for precision cosmology [CEA]

The estimation of cosmological constraints from observations of the large scale structure of the Universe, such as the power spectrum or the correlation function, requires the knowledge of the inverse of the associated covariance matrix, namely the precision matrix, $\mathbf{\Psi}$. In most analyses, $\mathbf{\Psi}$ is estimated from a limited set of mock catalogues. Depending on how many mocks are used, this estimation has an associated error which must be propagated into the final cosmological constraints. For future surveys such as Euclid and DESI, the control of this additional uncertainty requires a prohibitively large number of mock catalogues. In this work we test a novel technique for the estimation of the precision matrix, the covariance tapering method, in the context of baryon acoustic oscillation measurements. Even though this technique was originally devised as a way to speed up maximum likelihood estimations, our results show that it also reduces the impact of noisy precision matrix estimates on the derived confidence intervals, without introducing biases on the target parameters. The application of this technique can help future surveys to reach their true constraining power using a significantly smaller number of mock catalogues.

D. Paz and A. Sanchez
Fri, 14 Aug 15
43/49

Comments: 9 pages, 7 figures, submitted to MNRAS

|

# Trans-Dimensional Bayesian Inference for Gravitational Lens Substructures [IMA]

We introduce a Bayesian solution to the problem of inferring the density profile of strong gravitational lenses when the lens galaxy may contain multiple dark or faint substructures. The source and lens models are based on a superposition of an unknown number of non-negative basis functions (or “blobs”) whose form was chosen with speed as a primary criterion. The prior distribution for the blobs’ properties is specified hierarchically, so the mass function of substructures is a natural output of the method. We use reversible jump Markov Chain Monte Carlo (MCMC) within Diffusive Nested Sampling (DNS) to sample the posterior distribution and evaluate the marginal likelihood of the model, including the summation over the unknown number of blobs in the source and the lens. We demonstrate the method on a simulated data set with a single substructure, which is recovered well with moderate uncertainties. We also apply the method to the g-band image of the “Cosmic Horseshoe” system, and find some hints of potential substructures. However, we caution that such results could also be caused by misspecifications in the model (such as the shape of the smooth lens component or the point spread function), which are difficult to guard against in full generality.

B. Brewer, D. Huijser and G. Lewis
Wed, 5 Aug 15
5/46

Comments: Submitted. 10 pages, 10 figures

|

# Weighted ABC: a new strategy for cluster strong lensing cosmology with simulations [CEA]

Comparisons between observed and predicted strong lensing properties of galaxy clusters have been routinely used to claim either tension or consistency with $\Lambda$CDM cosmology. However, standard approaches to such cosmological tests are unable to quantify the preference for one cosmology over another. We advocate using a weighted’ variant of approximate Bayesian computation (ABC), whereby the parameters of the scaling relation between Einstein radii and cluster mass, $\alpha$ and $\beta$, are treated as summary statistics. We demonstrate, for the first time, a method of estimating the likelihood of the data under the $\Lambda$CDM framework, using the X-ray selected $z>0.5$ MACS clusters as a case in point and employing both N-body and hydrodynamic simulations of clusters. We investigate the uncertainty in the calculated likelihood, and consequential ability to compare competing cosmologies, that arises from incomplete descriptions of baryonic processes, discrepancies in cluster selection criteria, redshift distribution, and dynamical state. The relation between triaxial cluster masses at various overdensities provide a promising alternative to the strong lensing test.

M. Killedar, S. Borgani, D. Fabjan, et. al.
Wed, 22 Jul 15
34/59

Comments: 15 pages, 6 figures, 1 table, submitted to MNRAS, comments welcome

|

# The Overlooked Potential of Generalized Linear Models in Astronomy-III: Bayesian Negative Binomial Regression and Globular Cluster Populations [IMA]

In this paper, the third in a series illustrating the power of generalized linear models (GLMs) for the astronomical community, we elucidate the potential of the class of GLMs which handles count data. The size of a galaxy’s globular cluster population $N_{\rm GC}$ is a prolonged puzzle in the astronomical literature. It falls in the category of count data analysis, yet it is usually modelled as if it were a continuous response variable. We have developed a Bayesian negative binomial regression model to study the connection between $N_{\rm GC}$ and the following galaxy properties: central black hole mass, dynamical bulge mass, bulge velocity dispersion, and absolute visual magnitude. The methodology introduced herein naturally accounts for heteroscedasticity, intrinsic scatter, errors in measurements in both axes (either discrete or continuous), and allows modelling the population of globular clusters on their natural scale as a non-negative integer variable. Prediction intervals of 99% around the trend for expected $N_{\rm GC}$comfortably envelope the data, notably including the Milky Way, which has hitherto been considered a problematic outlier. Finally, we demonstrate how random intercept models can incorporate information of each particular galaxy morphological type. Bayesian variable selection methodology allows for automatically identifying galaxy types with different productions of GCs, suggesting that on average S0 galaxies have a GC population 35% smaller than other types with similar brightness.

R. Souza, J. Hilbe, B. Buelens, et. al.
Wed, 17 Jun 15
42/47

Comments: 14 pages, 12 figures. Comments are welcome

|

# Cosmic web-type classification using decision theory [CEA]

We propose a decision criterion for segmenting the cosmic web into different structure types (voids, sheets, filaments and clusters) on the basis of their respective probabilities and the strength of data constraints. Our approach is inspired by an analysis of games of chance where the gambler only plays if a positive expected net gain can be achieved based on some degree of privileged information. The result is a general solution for classification problems in the face of uncertainty, including the option of not committing to a class for a candidate object. As an illustration, we produce high-resolution maps of web-type constituents in the nearby Universe as probed by the Sloan Digital Sky Survey main galaxy sample. Other possible applications include the selection and labeling of objects in catalogs derived from astronomical survey data.

F. Leclercq, J. Jasche and B. Wandelt
Wed, 4 Mar 15
45/45

Comments: 5 pages, 2 figures, submitted to A&A Letters

|

# Constrained correlation functions from the Millennium Simulation [CEA]

Context. In previous work, we developed a quasi-Gaussian approximation for the likelihood of correlation functions, which, in contrast to the usual Gaussian approach, incorporates fundamental mathematical constraints on correlation functions. The analytical computation of these constraints is only feasible in the case of correlation functions of one-dimensional random fields.
Aims. In this work, we aim to obtain corresponding constraints in the case of higher-dimensional random fields and test them in a more realistic context.
Methods. We develop numerical methods to compute the constraints on correlation functions which are also applicable for two- and three-dimensional fields. In order to test the accuracy of the numerically obtained constraints, we compare them to the analytical results for the one-dimensional case. Finally, we compute correlation functions from the halo catalog of the Millennium Simulation, check whether they obey the constraints, and examine the performance of the transformation used in the construction of the quasi-Gaussian likelihood.
Results. We find that our numerical methods of computing the constraints are robust and that the correlation functions measured from the Millennium Simulation obey them. Despite the fact that the measured correlation functions lie well inside the allowed region of parameter space, i.e. far away from the boundaries of the allowed volume defined by the constraints, we find strong indications that the quasi-Gaussian likelihood yields a substantially more accurate description than the Gaussian one.

P. Wilking, R. Roseler and P. Schneider
Tue, 17 Feb 15
58/60

Comments: 11 pages, 13 figures, submitted to A&A

|

# Fast Bayesian Inference for Exoplanet Discovery in Radial Velocity Data [IMA]

Inferring the number of planets $N$ in an exoplanetary system from radial velocity (RV) data is a challenging task. Recently, it has become clear that RV data can contain periodic signals due to stellar activity, which can be difficult to distinguish from planetary signals. However, even doing the inference under a given set of simplifying assumptions (e.g. no stellar activity) can be difficult. It is common for the posterior distribution for the planet parameters, such as orbital periods, to be multimodal and to have other awkward features. In addition, when $N$ is unknown, the marginal likelihood (or evidence) as a function of $N$ is required. Rather than doing separate runs with different trial values of $N$, we propose an alternative approach using a trans-dimensional Markov Chain Monte Carlo method within Nested Sampling. The posterior distribution for $N$ can be obtained with a single run. We apply the method to $\nu$ Oph and Gliese 581, finding moderate evidence for additional signals in $\nu$ Oph with periods of 36.11 $\pm$ 0.034 days, 75.58 $\pm$ 0.80 days, and 1709 $\pm$ 183 days; the posterior probability that at least one of these exists is 85%. The results also suggest Gliese 581 hosts many (7-15) “planets” (or other causes of other periodic signals), but only 4-6 have well determined periods. The analysis of both of these datasets shows phase transitions exist which are difficult to negotiate without Nested Sampling.

B. Brewer and C. Donovan
Thu, 29 Jan 15
36/49

Comments: Accepted for publication in MNRAS. 9 pages, 12 figures. Code at this http URL

|

# Cosmic Web Reconstruction through Density Ridges: Method and Algorithm [CEA]

The detection and characterization of filamentary structures in the cosmic web allows cosmologists to constrain parameters that dictates the evolution of the Universe. While many filament estimators have been proposed, they generally lack estimates of uncertainty, reducing their inferential power. In this paper, we demonstrate how one may apply the Subspace Constrained Mean Shift (SCMS) algorithm (Ozertem and Erdogmus (2011); Genovese et al. (2012)) to uncover filamentary structure in galaxy data. The SCMS algorithm is a gradient ascent method that models filaments as density ridges, one-dimensional smooth curves that trace high-density regions within the point cloud. We also demonstrate how augmenting the SCMS algorithm with bootstrap-based methods of uncertainty estimation allows one to place uncertainty bands around putative filaments. We apply the SCMS method to datasets sampled from the P3M N-body simulation, with galaxy number densities consistent with SDSS and WFIRST-AFTA and to LOWZ and CMASS data from the Baryon Oscillation Spectroscopic Survey (BOSS). To further assess the efficacy of SCMS, we compare the relative locations of BOSS filaments with galaxy clusters in the redMaPPer catalog, and find that redMaPPer clusters are significantly closer (with p-values $< 10^{-9}$) to SCMS-detected filaments than to randomly selected galaxies.

Y. Chen, S. Ho, P. Freeman, et. al.
Fri, 23 Jan 15
65/65

Comments: 15 pages, 15 figures, 1 table

|

# Relative distribution of dark matter and stellar mass in three massive galaxy clusters [CEA]

This work observationally addresses the relative distribution of total and optically luminous matter in galaxy clusters by computing the radial profile of the stellar-to-total mass ratio. We adopt state-of-the-art accurate lensing masses free from assumptions about the mass radial profile and we use extremely deep multicolor wide–field optical images to distinguish star formation from stellar mass, to properly calculate the mass in galaxies of low mass, those outside the red sequence, and to allow a contribution from galaxies of low mass that is clustercentric dependent. We pay special attention to issues and contributions that are usually underrated, yet are major sources of uncertainty, and we present an approach that allows us to account for all of them. Here we present the results for three very massive clusters at $z\sim0.45$, MACSJ1206.2-0847, MACSJ0329.6-0211, and RXJ1347.5-1145. We find that stellar mass and total matter are closely distributed on scales from about 150 kpc to 2.5 Mpc: the stellar-to-total mass ratio is radially constant. We find that the characteristic mass stays constant across clustercentric radii and clusters, but that the less-massive end of the galaxy mass function is dependent on the environment.

S. Andreon
Fri, 9 Jan 15
15/49

Comments: A&A, in press

|

# A Unifying Theory for Scaling Laws of Human Populations [CL]

The spatial distribution of people exhibits clustering across a wide range of scales, from household (~$10^{-2}$ km) to continental (~$10^4$ km) scales. Empirical data indicates simple power-law scalings for the size distribution of cities (known as Zipf’s law), the geographic distribution of friends, and the population density fluctuations as a function of scale. We derive a simple statistical model that explains all of these scaling laws based on a single unifying principle involving the random spatial growth of clusters of people on all scales. The model makes important new predictions for the spread of diseases and other social phenomena.

H. Lin and A. Loeb
Wed, 7 Jan 15
32/67

Comments: 13 pages, 2 figures, press embargo until published

# Bayesian inference of CMB gravitational lensing [IMA]

The Planck satellite, along with ground based telescopes such as the Atacama Cosmology Telescope (ACT) and the South Pole Telescope (SPT), have mapped the cosmic microwave background (CMB) at such an unprecedented resolution as to allow a detection of the subtle distortions due to the gravitational influence of intervening dark matter. This distortion is called gravitational lensing and has become a powerful probe of cosmology and dark matter. Estimating gravitational lensing is important for two reasons. First, the weak lensing estimates can be used to construct a map of dark matter which would be invisible otherwise. Second, weak lensing estimates can, in principle, un-lense the observed CMB to construct the original CMB radiation fluctuations. Both of these maps, the unlensed CMB radiation field and the dark matter field, are deep probes of cosmology and cosmic structure. Bayesian techniques seem a perfect fit for the statistical analysis of lensing and the CMB. One reason is that the priors for the unlensed CMB and the lensing potential are—very nearly—Gaussian random fields. The Gaussianity coming from physically predicted quantum randomness in the early universe. However, challenges associated with a full Bayesian analysis have prevented previous attempts at developing a working Bayesian prototype. In this paper we solve many of these obstacles with a re-parameterization of CMB lensing. This allows us to obtain draws from the Bayesian lensing posterior of both the lensing potential and the unlensed CMB which converges remarkably fast.

E. Anderes, B. Wandelt and G. Lavaux
Mon, 15 Dec 14
46/53

|

# Bayesian Evidence and Model Selection [CL]

In this paper we review the concept of the Bayesian evidence and its application to model selection. The theory is presented along with a discussion of analytic, approximate and numerical techniques. Application to several practical examples within the context of signal processing are discussed.

K. Knuth, M. Habeck, N. Malakar, et. al.
Thu, 13 Nov 14
38/49

Comments: 39 pages, 8 figures. Submitted to DSP. Features theory, numerical methods and four applications

# Densities mixture unfolding for data obtained from detectors with finite resolution and limited acceptance [CL]

A procedure based on a Mixture Density Model for correcting experimental data for distortions due to finite resolution and limited detector acceptance is presented. Addressing the case that the solution is known to be non-negative, in the approach presented here the true distribution is estimated by a weighted sum of probability density functions with positive weights and with the width of the densities acting as a regularisation parameter responsible for the smoothness of the result. To obtain better smoothing in less populated regions, the width parameter scales inversely proportional to the square root of estimated density. Furthermore, the non-negative garrotte method is used to find the most economic representation of the solution. Cross-validation is employed to determine the optimal values of the resolution and garrotte parameters. The proposed approach is directly applicable to multidimensional problems. Numerical examples in one and two dimensions are presented to illustrate the procedure.

N. Gagunashvili
Wed, 8 Oct 14
68/68

Comments: 25 pages, 14 figures. arXiv admin note: text overlap with arXiv:1209.3766

# Proceedings of the First Astrostatistics School: Bayesian Methods in Cosmology [GA]

These are the proceedings of the First Astrostatistics School: Bayesian Methods in Cosmology, held in Bogot\’a D.C., Colombia, June 9-13, 2014. The first astrostatistics school has been the first event in Colombia where statisticians and cosmologists from some universities in Bogot\’a met to discuss the statistic methods applied to cosmology, especially the use of Bayesian statistics in the study of Cosmic Microwave Background (CMB), Baryonic Acoustic Oscillations (BAO), Large Scale Structure (LSS) and weak lensing.

H. Hortua
Tue, 16 Sep 14
21/63

Comments: Proceedings (in Spanish), supplemental electronic material accessible via links within the PDF, Cuadernos en Estadistica aplicada, Edicion especial, (2014)

# The impact of super-survey modes on cosmological constraints from cosmic shear fields [CEA]

Owing to the mass-sheet degeneracy, cosmic shear maps do not probe directly the Fourier modes of the underlying mass distribution on scales comparable to the survey size and larger. To assess the corresponding effect on attainable cosmological parameter constraints, we quantify the information on super-survey modes in a lognormal model and, when interpreted as nuisance parameters, their degeneracies to cosmological parameters. Our analytical and numerical calculations clarify the central role of super-sample covariance (SSC) in shaping the statistical power of cosmological observables. Reconstructing the background modes from their non-Gaussian statistical dependence to small scales modes yields the renormalized convergence. This diagonalizes the spectrum covariance matrix, and the information content of the corresponding power spectrum is increased by a factor of two over standard methods. Unfortunately, careful calculation of the Cramer-Rao bound shows that the information recovery can never be made complete, any observable built from shear fields, including optimal sufficient statistics, are subject to severe information loss, typically $80\%$ to $90\%$ below $\ell \sim 3000$ for generic cosmological parameters. The lost information can only be recovered from additional, non-shear based data. Our predictions hold just as well for a tomographic analysis, and/or full sky surveys.

J. Carron and I. Szapudi
Mon, 11 Aug 14
43/55

Comments: 11 pages, 4 figures, submitted

|

# Estimating the distribution of Galaxy Morphologies on a continuous space [GA]

The incredible variety of galaxy shapes cannot be summarized by human defined discrete classes of shapes without causing a possibly large loss of information. Dictionary learning and sparse coding allow us to reduce the high dimensional space of shapes into a manageable low dimensional continuous vector space. Statistical inference can be done in the reduced space via probability distribution estimation and manifold estimation.

G. Vinci, P. Freeman, J. Newman, et. al.
Tue, 1 Jul 14
57/70

Comments: 4 pages, 3 figures, Statistical Challenges in 21st Century Cosmology, Proceedings IAU Symposium No. 306, 2014

# Fitting FFT-derived Spectra: Theory, Tool, and Application to Solar Radio Spike Decomposition [SSA]

Spectra derived from fast Fourier transform (FFT) analysis of time-domain data intrinsically contain statistical fluctuations whose distribution depends on the number of accumulated spectra contributing to a measurement. The tail of this distribution, which is essential for separation of the true signal from the statistical fluctuations, deviates noticeably from the normal distribution for a finite number of the accumulations. In this paper we develop a theory to properly account for the statistical fluctuations when fitting a model to a given accumulated spectrum. The method is implemented in software for the purpose of automatically fitting a large body of such FFT-derived spectra. We apply this tool to analyze a portion of a dense cluster of spikes recorded by our FST instrument during a record-breaking event that occurred on 06 Dec 2006. The outcome of this analysis is briefly discussed.

G. Nita, G. Fleishman, D. Gary, et. al.
Tue, 10 Jun 14
32/60

Comments: Accepted to ApJ, 57 pages, 16 figures

# The insignificant evolution of the richness-mass relation of galaxy clusters [GA]

We analysed the richness-mass scaling of 23 very massive clusters at $0.15<z<0.55$ with homogenously measured weak-lensing masses and richnesses within a fixed aperture of $0.5$ Mpc radius. We found that the richness-mass scaling is very tight (the scatter is $<0.09$ dex with 90 \% probability) and independent on cluster evolutionary status and morphology. This implies a close association between infall/evolution of dark matter and galaxies in the central region of clusters. We also found that the evolution of the richness-mass intercept is minor at most, and, given the small mass evolution across the studied redshift range, also the richness evolution of individual massive clusters turns out to be very small. Finally, we found that it is of paramount importance to account for the cluster mass function and the selection function. Ignoring them would led to biases larger than the (otherwise quoted) errors. Our study benefits from: a) weak-lensing masses instead of proxy-based masses, removing the ambiguity between a real trend and one induced by an accounted evolution of the used mass proxy; b) the use of projected masses simplifies the statistical analysis, not requiring consideration of the unknown covariance induced by the cluster orientation/triaxiality; c) the use of aperture masses, free from the pseudo-evolution of mass definitions anchored to the evolving Universe density; d) accounting for the sample selection function and for the Malmquist-like effect induced by the cluster mass function; e) cosmological numerical simulations for the computation of the cluster mass function, its evolution, and the mass growth of each individual cluster.

S. Andreon and P. Congdon
Mon, 9 Jun 14
16/40

Comments: A&A, in press

# Astrophysical data analysis with information field theory [IMA]

Non-parametric imaging and data analysis in astrophysics and cosmology can be addressed by information field theory (IFT), a means of Bayesian, data based inference on spatially distributed signal fields. IFT is a statistical field theory, which permits the construction of optimal signal recovery algorithms. It exploits spatial correlations of the signal fields even for nonlinear and non-Gaussian signal inference problems. The alleviation of a perception threshold for recovering signals of unknown correlation structure by using IFT will be discussed in particular as well as a novel improvement on instrumental self-calibration schemes. IFT can be applied to many areas. Here, applications in in cosmology (cosmic microwave background, large-scale structure) and astrophysics (galactic magnetism, radio interferometry) are presented.

T. Ensslin
Mon, 2 Jun 14
43/56

Comments: 4 pages, 2 figures, accepted chapter to the conference proceedings for MaxEnt 2013, to be published by AIP

|

# Statistical characterization of polychromatic absolute and differential squared visibilities obtained from AMBER/VLTI instrument [IMA]

In optical interferometry, the visibility squared modulus are generally assumed to follow a Gaussian distribution and to be independent of each other. A quantitative analysis of the relevance of such assumptions is important to help improving the exploitation of existing and upcoming multi-wavelength interferometric instruments. Analyze the statistical behaviour of both the absolute and the colour-differential squared visibilities: distribution laws, correlations and cross-correlations between different baselines. We use observations of stellar calibrators obtained with AMBER instrument on VLTI in different instrumental and observing configurations, from which we extract the frame-by-frame transfer function. Statistical hypotheses tests and diagnostics are then systematically applied. For both absolute and differential squared visibilities and under all instrumental and observing conditions, we find a better fit for the Student distribution than for the Gaussian, log-normal and Cauchy distributions. We find and analyze clear correlation effects caused by atmospheric perturbations. The differential squared visibilities allow to keep a larger fraction of data with respect to selected absolute squared visibilities and thus benefit from reduced temporal dispersion, while their distribution is more clearly characterized. The frame selection based on the criterion of a fixed SNR value might result in either a biased sample of frames or in a too severe selection.

A. Schutz, M. Vannier, D. Mary, et. al.
Thu, 15 May 14
12/55

Comments: A&A, 13 pages and 9 figures

|

# Functional Regression for Quasar Spectra [CL]

The Lyman-alpha forest is a portion of the observed light spectrum of distant galactic nuclei which allows us to probe remote regions of the Universe that are otherwise inaccessible. The observed Lyman-alpha forest of a quasar light spectrum can be modeled as a noisy realization of a smooth curve that is affected by a damping effect’ which occurs whenever the light emitted by the quasar travels through regions of the Universe with higher matter concentration. To decode the information conveyed by the Lyman-alpha forest about the matter distribution, we must be able to separate the smooth continuum’ from the noise and the contribution of the damping effect in the quasar light spectra. To predict the continuum in the Lyman-alpha forest, we use a nonparametric functional regression model in which both the response and the predictor variable (the smooth part of the damping-free portion of the spectrum) are function-valued random variables. We demonstrate that the proposed method accurately predicts the unobservable continuum in the Lyman-alpha forest both on simulated spectra and real spectra. Also, we introduce distribution-free prediction bands for the nonparametric functional regression model that have finite sample guarantees. These prediction bands, together with bootstrap-based confidence bands for the projection of the mean continuum on a fixed number of principal components, allow us to assess the degree of uncertainty in the model predictions.

M. Ciollaro, J. Cisewski, P. Freeman, et. al.
Mon, 14 Apr 14
4/41

# Use of spatial cross correlation function to study formation mechanism of massive elliptical galaxies [CL]

Spatial clustering nature of galaxies have been studied previously through auto correlation function. The same type of cross correlation function has been used to investigate parametric clustering nature of galaxies e.g. with respect to masses and sizes of galaxies.
Here formation and evolution of several components of nearby massive early type galaxies have been envisaged through cross correlations, in the mass-size parametric plane, with high redshift early type galaxies (hereafter ETG).It is found that the inner most components of nearby ETGs have significant correlation with ETGs in the highest redshift range called red nuggets whereas intermediate components are highly correlated with ETGs in the redshift range with z value greater than 0.5 and less than 0.75. The outer most part has no correlation in any range, suggesting a scenario through in situ accretion. The above formation scenario is consistent with the previous results obtained for NGC5128 (Chattopadhyay et al. (2009), Chattopadhyay et al. (2013)) and to some extent for nearby elliptical galaxies (Huang et al. (2013)) after considering a sample of ETGs at high redshift with stellar masses greater than or equal to 108.73 M-Sun. So the present work indicates a three phase formation of massive nearby elliptical galaxies instead of two as discussed in previous works.

T. De, T. Chattopadhyay and A. Chattopadhyay
Thu, 6 Mar 14
20/53

# Type Ia Supernova Colors and Ejecta Velocities: Hierarchical Bayesian Regression with Non-Gaussian Distributions [CEA]

We investigate the correlations between the peak intrinsic colors of Type Ia supernovae (SN Ia) and their expansion velocities at maximum light, measured from the Si II 6355 A spectral feature. We construct a new hierarchical Bayesian regression model and Gibbs sampler to estimate the dependence of the intrinsic colors of a SN Ia on its ejecta velocity, while accounting for the random effects of intrinsic scatter, measurement error, and reddening by host galaxy dust. The method is applied to the apparent color data from BVRI light curves and Si II velocity data for 79 nearby SN Ia. Comparison of the apparent color distributions of high velocity (HV) and normal velocity (NV) supernovae reveals significant discrepancies in B-V and B-R, but not other colors. Hence, they are likely due to intrinsic color differences originating in the B-band, rather than dust reddening. The mean intrinsic B-V and B-R color differences between HV and NV groups are 0.06 +/- 0.02 and 0.09 +/- 0.02 mag, respectively. Under a linear model for intrinsic B-V and B-R colors versus velocity, we find significant slopes of -0.021 +/- 0.006 and -0.030 +/- 0.009 mag/(1000 km/s), respectively. Since the ejecta velocity distribution is skewed towards high velocities, these effects imply non-Gaussian intrinsic color population distributions with skewness up to +0.3. Accounting for the intrinsic color-velocity correlation results in corrections in A_V extinction estimates as large as -0.12 mag for HV SN Ia and +0.06 mag for NV events. Deviance information criteria strongly favor simple functions for intrinsic colors versus velocity over no trend, while higher-order polynomials are disfavored. Velocity measurements from SN Ia spectra have potential to diminish systematic errors from the confounding of intrinsic colors and dust reddening affecting supernova distances.

K. Mandel, R. Foley and R. Kirshner
Mon, 3 Mar 14
19/55

|