Surveillance of dengue vectors using spatio-temporal Bayesian modeling

C. Costa, Ana Carolina; Codeço, Cláudia T.; Honório, Nildimar A.; Pereira, Gláucio R.; N. Pinheiro, Carmen Fátima; Nobre, Aline A.

doi:10.1186/s12911-015-0219-6

Research Article
Open access
Published: 13 November 2015

Surveillance of dengue vectors using spatio-temporal Bayesian modeling

Ana Carolina C. Costa^1,2,
Cláudia T. Codeço³,
Nildimar A. Honório^4,5,
Gláucio R. Pereira⁵,
Carmen Fátima N. Pinheiro⁵ &
…
Aline A. Nobre³

BMC Medical Informatics and Decision Making volume 15, Article number: 93 (2015) Cite this article

2971 Accesses
12 Citations
2 Altmetric
Metrics details

Abstract

Background

At present, dengue control focuses on reducing the density of the primary vector for the disease, Aedes aegypti, which is the only vulnerable link in the chain of transmission. The use of new approaches for dengue entomological surveillance is extremely important, since present methods are inefficient. With this in mind, the present study seeks to analyze the spatio-temporal dynamics of A. aegypti infestation with oviposition traps, using efficient computational methods. These methods will allow for the implementation of the proposed model and methodology into surveillance and monitoring systems.

Methods

The study area includes a region in the municipality of Rio de Janeiro, characterized by high population density, precarious domicile construction, and a general lack of infrastructure around it. Two hundred and forty traps were distributed in eight different sentinel areas, in order to continually monitor immature Aedes aegypti and Aedes albopictus mosquitoes. Collections were done weekly between November 2010 and August 2012. The relationship between egg number and climate and environmental variables was considered and evaluated through Bayesian zero-inflated spatio-temporal models. Parametric inference was performed using the Integrated Nested Laplace Approximation (INLA) method.

Results

Infestation indexes indicated that ovipositing occurred during the entirety of the study period. The distance between each trap and the nearest boundary of the study area, minimum temperature and accumulated rainfall were all significantly related to the number of eggs present in the traps. Adjusting for the interaction between temperature and rainfall led to a more informative surveillance model, as such thresholds offer empirical information about the favorable climatic conditions for vector reproduction. Data were characterized by moderate time (0.29 – 0.43) and spatial (21.23 – 34.19 m) dependencies. The models also identified spatial patterns consistent with human population density in all sentinel areas. The results suggest the need for weekly surveillance in the study area, using traps allocated between 18 and 24 m, in order to understand the dengue vector dynamics.

Conclusions

Aedes aegypti, due to it short generation time and strong response to climate triggers, tend to show an eruptive dynamics that is difficult to predict and understand through just temporal or spatial models. The proposed methodology allowed for the rapid and efficient implementation of spatio-temporal models that considered zero-inflation and the interaction between climate variables and patterns in oviposition, in such a way that the final model parameters contribute to the identification of priority areas for entomological surveillance.

Peer Review reports

Background

The complexity of dengue transmission has motivated the development of many studies to assess the numerous factors related to the circulation and persistence of this disease in human populations. While significant progress has been made, numerous questions remain unanswered, and an effective dengue control plan is still an open problem. No effective vaccines or antiviral drugs are currently available, impeding a direct intervention on the human link in the dengue transmission cycle. As such, current control measures are still focused on reducing the vectorial capacity of Aedes aegypti.

The Household Infestation index (HI) and the Breteau index (BI) are the standard mosquito abundance measurements used to evaluate the effectiveness of vector control strategies [1]. However, these indices, based on the survey of breeding sites with mosquito larvae, are poorly correlated with the abundance of the adult mosquito population, which is directly responsible for the disease transmission [2]. For example, a study has shown high dengue incidence when HI was below 3 % in Salvador city, Northeastern Brazil [3].

Ovitrap has been proposed as an alternative tool for Aedes aegypti monitoring in areas with low mosquito abundance [4]. This trap was created by Fay and Eliason [5] and perfected by Reiter and Gubler [6], and provides measures of ovipositing activity. Although not a direct measure of mosquito abundance, studies have shown a strong correlation between egg count and the density of the female mosquito population [7].

Despite the advantages of trapping over larval surveys, this approach is still scarcely applied to surveillance. One of the main reasons for this is the fact that the sampling and statistical properties of the measurements produced are not entirely understood yet. In this study, data from a long-term surveillance program carried out under very controlled conditions in eight sentinel areas, provide an opportunity to develop and test innovative models and contribute to the development of an analytical framework to be implemented in Aedes aegypti monitoring systems. The models proposed are computationally intensive and more efficient methods that allow for their implementation in surveillance and monitoring systems were investigated. Such information contributes to the development of Aedes control activities focused on areas and time periods affected by the most severe mosquito infestations.

Methods

Study area

The study area is the Manguinhos campus of the Oswaldo Cruz Foundation (Fiocruz), located in the city of Rio de Janeiro, Brazil (22°52’30”S, 43°14’53”W; 697.000 m²). The area surrounding the campus is characterized by densely populated urban zones, precarious living conditions and a general lack of infrastructure [8]. Eight sentinel areas were identified for continuous monitoring of immature Aedes aegypti and Aedes albopictus, each one representing different degrees of forest cover, distances to neighbouring residential areas, and intensity of human commutation and permanence. Sentinel Areas (SA) were designated as SA1–SA8 (Fig. 1). Table 1 describes the area and the vegetation-type percentage for each SA.

Table 1 Descriptors

Full size table

Entomological monitoring

Thirty ovitraps were randomly placed in each SA with a minimum distance of 20 m between traps. Traps consisted of black plastic pots containing water, hay infusion and a eucatex paddle. Egg collections occurred weekly from November 2010 to August 2012, for a total of 89 uninterrupted weeks. The collected material was transported to the Sentinel Operational Unit of Mosquito Vectors (NOSMOVE/Fiocruz). Each paddle was carefully inspected for egg positivity and, when confirmed, the number of eggs was quantified. The sampling using ovitraps is part of the project “Monitoring of populations of Aedes aegypti on the Fiocruz campus” which is coordinated by Dr. Honório and was approved by the Vice-Presidency of Environment, Healthcare and Health Promotion (VPAAPS) of Fiocruz.

Climate and environmental variables

Maximum and minimum temperatures were collected from the São Cristóvão meteorological station, located at a 3 km-distance from the study area, while accumulated rainfall was extracted from the Penha station, roughly 5 km away (source: Sistema Alerta Rio website [9], operated by the city of Rio de Janeiro).

To delimit the area of each SA, 50 m-radius influence zones were created for each trap, based its geographical location, using ArcGis 10.0 software. In order to calculate the percent green cover for each of the eight areas, a campus map was intersected with the oviposition traps by vectorizing a high-resolution satellite image. Finally, the distance between each trap and the nearest boundary of the study area was calculated and is hereby referred to as Border Distance.

Spatio-temporal modeling

Figure 2 a shows weekly average egg density in each of the eight sentinel areas, together with zero-egg frequency, that varied from 24 to 58 %. Ignoring zero-inflation leads to two possible consequences: 1) biased estimation of model parameters and standard errors, and 2) overdispersion. Due to the high frequency of zeros, a zero-inflated model was considered. The modeling approach was: first, use a Binomial (Bin) distribution to model the zero occurrence probability. Then, model the non-null observations using a Zero-Altered Poisson (ZAP) distribution [10]. The underlying assumption is that two separate ecological processes are occurring: presence of eggs is driven by the mosquito choice of the ovitrap for ovipositing, and the abundance of eggs is driven by the number of females that chose the ovitrap. This can be formalized as following:

$$\begin{array}{@{}rcl@{}} \begin{array}{c} f_{ZAP}(y;\zeta,\mu)=\left\{ \begin{array}{l} 1-\zeta \;\qquad\qquad\qquad\quad\quad\quad\quad\;\; y=0\\ \zeta \times f_{ZAP}\left(y;\mu\right)\;\quad\quad\quad\quad\quad\quad\;\; y>0, \end{array} \right. \end{array} \end{array} $$

in which 1−ζ represents the probability of the absence of oviposition. Analogously, ζ is the probability of the occurrence of oviposition and will be referred to as positivity.

Let the random variable Y(s_i,t) represents the number of eggs in trap i, i=1,…,30 at week t, for t=1,…,89. Moreover, let y(s_i,t) be the realization of the spatio-temporal process Y(s_i,t), when oviposition occurs. It is assumed that y(s_i,t) has a ZAP distribution with an average of μ(s_i,t) and the following equations:

$$\begin{array}{@{}rcl@{}} y(s_{i},t) \mid y(s_{i},t) > 0 &\sim & ZAP(\mu(s_{i},t)) \end{array} $$

((1))

$$\begin{array}{@{}rcl@{}} log(\mu(s_{i},t))&=&z(s_{i},t)\beta+\xi(s_{i},t)+\varepsilon(s_{i},t), \end{array} $$

((2))

$$\begin{array}{@{}rcl@{}} \xi(s_{i},t)&=&a\xi(s_{i},t-1)+\omega(s_{i},t) \end{array} $$

((3))

for t>1, where z(s_i,t)=(z₁(s_i,t),…,z_p(s_i,t)) denotes the vector of p covariates for trap i in time t, and β=(β₁,…,β_p)^′ is the vector of coefficients representing their effects. Additionally, $\varepsilon (s_{i},t) \sim N\left (0,\sigma ^{2}_{\varepsilon }\right)$ is the measurement error defined by a Gaussian white noise, both serially and spatially uncorrelated. In the geostatistics literature, the term z(s_i,t)β is referred to as the large-scale component – in this case depending on meteorological and environmental covariates – while the variance $\sigma ^{2}_{\varepsilon }$ is called the nugget effect [11]. Finally, ξ(s_i,t) is a space-time Gaussian Field (GF) that follows an auto-regressive first-order dynamics, with temporal correlation coefficient a and evolution error given by ω(s_i,t), in which |a| <1 and ξ(s_i,1) derived from the stationary distribution $N\left (0,\sigma ^{2}_{\omega }/(1-a^{2})\right)$. Additionally, ω(s_i,t) has a Gaussian distribution with zero average, no time dependence and characterized by the following covariance function

$$\begin{array}{@{}rcl@{}} \text{Cov}(\omega(s_{i},t),\omega(s_{j},t^{\prime})) = \left \{ \begin{array}{ll} 0 &\text{if}\; t \neq t^{\prime}\\ \sigma^{2}_{\omega} \mathcal{C}(h) &\text{if}\; t=t^{\prime}\;, \end{array} \right. \end{array} $$

for i≠j. The spatial correlation function $\mathcal {C}(h)$ depends on the spatial Euclidean distance between locations s_i and s_j, such that $h=\,\mid \mid s_{i} - s_{j} \mid \mid \;\in \mathbb {R}$. This way, the process is assumed to be second-order stationary and isotropic [11]. It follows immediately that $Var(\omega (s_{i},t))=\sigma ^{2}_{\omega }$, for each s_i and t. The spatial correlation function $\mathcal {C}(h)$ is defined by the Matérn covariance function

$$\begin{array}{@{}rcl@{}} \mathcal{C}(h)=\frac{1}{\Gamma(\nu)2^{\nu-1}}(\kappa h)^{\nu} K_{\nu}(\kappa h), \end{array} $$

((4))

with K_ν denoting the modified Bessel function of the second type and order ν>0. The parameter ν, which is usually kept fixed, measures the smoothness of the process. In other words, ν controls the behavior of the covariance function for measures that are separated by small distances. On the other hand, κ>0 is a scaling parameter related to range ρ, i.e., a distance at which spatial correlation becomes small. In particular, we use the empirically derived definition $\rho =\frac {\sqrt {8\nu }}{\kappa }$, with ρ corresponding to the distance at which spatial correlation is approximately 0.1, for each ν (see Lindgren et al. [12] for further details).

In order to identify the climate and environmental covariates that best predicted the number of eggs, ZAP models were adjusted by area for each of the following climate variables: accumulated rainfall, minimum and maximum temperature. As the effect of these variables may not be immediately related to egg density, three-week lag periods were investigated for each variable and the best lag chosen using biological plausibility and Deviance Information Criteria (DIC) [13] available in the R-INLA package [14]. Then, each selected climate variable was tested for interaction with every other climate variable in the model. Figure 2 b shows the climate variables composing the model. Besides the aforementioned variables, the only environmental covariate considered for inclusion in the spatio-temporal model was border distance. Because the variable-measuring scales were different, each covariate was standardized by subtracting the mean and dividing it by the standard deviation.

We used 24 traps for model fitting (blue dots in Fig. 3) and the remaining six traps to validate the model (red triangles in Fig. 3). The predictive performance of the models was evaluated by calculating the percentage of observations, which fell within the 95 % credibility intervals for validating data.

In addition to the adjustment of individual models per SA, a hierarchical model was also considered for the whole set of sentinel areas.

Hierarchical model

This model is similar to the one presented above, but includes an area-specific random effect. In this analysis, from the total of 240 traps, 192 traps were used for model fitting and 48 ones for validation. As before, the effect of the interaction among climate variables was also tested. The validity of the model was checked as before.

SPDE approach

ξ is a spatially structured GF pursuant to the effect of ω, in such a way that it can be considered to be a multivariate Normal distribution. $\tilde {\Sigma }$ is the dense correlation matrix, which describes the covariance structure of ξ. The factorization of $\tilde {\Sigma }$ has a computational cost of the order O(n³), which can pose a problem in case of large array. Thus, the suggestion is to represent the GF as a Gaussian Markov Random Field (GMRF) based on Stochastic Partial Differential Equations (SPDE) [15].

The GMRF is a process that models the spatial dependence of the data by areal unit, such as regular/irregular grids, or by geographic region. The primary advantage of using GMRF instead of GF entails its strong computational properties. The computational advantage of making inference with GMRF stems directly from the sparsity of the precision matrix $\tilde {\Sigma }$, so that linear algebra operations can be performed using numerical methods for sparse matrices, resulting in a substantial computational gain [16].

The objective of the SPDE approach is in the way it identifies the GMRF, with local neighborhood and a sparse precision matrix, which best describes the Matérn field – a GF with a Matérn covariance function. Given this representation, it is possible to derive inference from the GMRF through the use of its good computational properties. Essentially, the SPDE approach uses a finite element representation to define the Matérn field as a linear combination of base functions defined on a triangulation of the domain $\mathcal {D}$. This consists of subdividing $\mathcal {D}$ into a set of triangles that do not intersect and have maximum one edge or vertex in common [15]. Figure 3 illustrates the concept of triangulation in SA1.

Integrated nested Laplace approximation (INLA)

Let $\theta =\left \{\zeta,\beta,\sigma ^{2}_{\varepsilon },a,\sigma ^{2}_{\omega },\kappa \right \}$ denote the parameter vector to be estimated. The joint posterior distribution is given by:

$$\begin{array}{@{}rcl@{}} \pi(\theta,\xi,\mu|{y}) \propto \pi(\,{y}|\mu)\pi(\mu|\xi,\theta)\pi(\xi|\theta)\pi(\theta) \end{array} $$

((5))

where π(·) denotes the probability density function, y={y_t}, μ={μ_t} and ξ={ξ_t} with t=1,…,T. Usually, independent prior distributions are chosen for the parameters, so that $\pi (\boldsymbol {\theta })=\prod ^{\text {dim}\left (\boldsymbol {\theta }\right)}_{i=1}\pi (\boldsymbol {\theta }_{i})$. Considering that conditionally on μ the observations y_t are serially independent and that the state process follows Markovian time dynamics, the Eq. (5) can be written as:

$$\begin{array}{@{}rcl@{}} \pi(\boldsymbol{\theta},\boldsymbol{\xi},\boldsymbol{\mu}|{\boldsymbol{y}}) &\propto& \left(\prod^{T}_{t=1}\pi({\boldsymbol{y}}_{t}|\boldsymbol{\mu}_{t})\right) \left(\prod^{T}_{t=1}\pi(\boldsymbol{\mu}_{t}|\boldsymbol{\xi}_{t},\boldsymbol{\theta})\right)\\&\times& \left(\pi(\boldsymbol{\xi}_{1}|\boldsymbol{\theta})\prod^{T}_{t=2}\pi(\boldsymbol{\xi}_{t} \mid \boldsymbol{\xi}_{t-1},\boldsymbol{\theta})\right)\pi(\boldsymbol{\theta}) \end{array} $$

As the distribution π(θ,ξ,μ|y) has no analytic solution, it is necessary to use approximation methods to sample from it. From a Bayesian perspective, the most common approach is to make inference for the model based on Markov Chain Monte Carlo (MCMC) methods [17]. However, it is possible to use the Integrated Nested Laplace Approximation (INLA) method, proposed by Rue et al. [18], as an alternative to MCMC methods. The main advantage of INLA over MCMC is computational, as the algorithm rapidly produces accurate approximations to posterior marginals distributions for the latent variables, as well as for the hyperparameters.

Unlike the MCMC, where posterior inference is based on simulations, the INLA method directly ties distributions of interest with a closed form expression. Therefore, the convergence diagnosis inherent to MCMC methods is not a problem. The main objective of the INLA approach is to approximate the posterior marginal distributions of the latent field and of the hyperparameters, given by:

$$\begin{array}{@{}rcl@{}} \pi(\xi_{i}|{\boldsymbol{y}})&=&\int \pi(\xi_{i}|\boldsymbol{\theta},{\boldsymbol{y}})\pi(\boldsymbol{\theta}|{\boldsymbol{y}})d\boldsymbol{\theta} \end{array} $$

((6))

$$\begin{array}{@{}rcl@{}} \pi(\theta_{j}|{\boldsymbol{y}})&=&\int \pi(\boldsymbol{\theta}|{\boldsymbol{y}})d\boldsymbol{\theta}_{-j}. \end{array} $$

((7))

This approach is based on an efficient combination of Laplace approximations for the full conditional distributions π(θ|y) and π(ξ_i|θ,y), i=1,…,n, and numerical integration routines to integrate out the hyperparameters θ.

The INLA method as proposed in Rue et al. (2009) includes three main approximation steps to obtain the marginal posteriors in (6) and (7). The first step entails approximating the full posterior π(θ|y). Firstly, it is necessary to obtain an approximation of the full conditional distribution of ξ, π(ξ|y,θ), using a multivariate Gaussian density $\widetilde {\pi }_{G}(\boldsymbol {\xi }|\boldsymbol {y},\boldsymbol {\theta })$ (see Rue and Held [16] for further details) and evaluate it in your mode. Then, the posterior density of θ is approximated using the Laplace approximation

$$\begin{array}{@{}rcl@{}} \widetilde{\pi}(\boldsymbol{\theta}|\boldsymbol{y})\propto{\frac{\pi(\boldsymbol{\xi},\boldsymbol{\theta}, \boldsymbol{y})}{\widetilde{\pi}_{G}(\boldsymbol{\xi}|\boldsymbol{\theta},\boldsymbol{y})}} \left|\!\!\vphantom{1^{3}_{1}}\right.{}_{\boldsymbol{\xi}=\boldsymbol{\xi}^{*}\left(\boldsymbol{\theta}\right)}, \end{array} $$

where ξ^∗(θ) is the mode of the full conditional of ξ for a given θ. Since there is no exact closed form for ξ^∗(θ), an optimization scheme is necessary. Rue et al. [18] calculated this mode using the Newton-Raphson algorithm. The posterior $\widetilde {\pi }(\boldsymbol {\theta }|\boldsymbol {y})$ will be used later to integrate out the uncertainty with respect to θ when approximating the posterior marginal of ξ_i.

The second step involves calculating the Laplace approximation of the full conditionals π(ξ_i|y,θ) for some values of θ. These values will be used as evaluation points in the numerical integration to obtain the posterior marginals of ξ_i in (6). The distribution of π(ξ_i|θ,y) is approximated using the Laplace approximation defined by

$$\begin{array}{@{}rcl@{}} \widetilde{\pi}_{LA}\left(\xi_{i}|\boldsymbol{\theta},\boldsymbol{y}\right)\propto \frac{\pi\left(\boldsymbol{\xi},\boldsymbol{\theta},\boldsymbol{y}\right)} {\widetilde{\pi}_{G}\left(\boldsymbol{\xi}_{-i}|\xi_{i},\boldsymbol{\theta},\boldsymbol{y}\right)} \left|\!\!\vphantom{1^{3}_{1}}\right.{}_{\boldsymbol{\xi}_{-i}=\boldsymbol{\xi}^{*}_{-i}\left(\xi_{i},\boldsymbol{\theta}\right)}, \end{array} $$

((8))

where ξ_−i is the vector ξ with the i-th component omitted, $\widetilde {\pi }_{G}\left (\boldsymbol {\xi }_{-i}|\xi _{i},\boldsymbol {\theta },\boldsymbol {y}\right)$ is the Gaussian approximation of π(ξ_−i|ξ_i,θ,y), considering ξ_i as fixed (observed) and $\boldsymbol {\xi }^{*}_{-i}\left (\xi _{i},\boldsymbol {\theta }\right)$ is the mode of π(ξ_−i|ξ_i,θ,y).

The approximation of π(ξ_i|θ,y) using (8) can be expensive, since $\widetilde {\pi }_{G}\left (\boldsymbol {\xi }_{-i}|\xi _{i},\boldsymbol {\theta },\boldsymbol {y}\right)$ have to be recalculated for each value of ξ_i and θ. Rue et al. [18] proposed two cheaper alternatives to obtain these distributions. The first one is the Gaussian approximation $\widetilde {\pi }_{G}\left (\xi _{i}|\boldsymbol {\theta },\boldsymbol {y}\right)$, which provides reasonable results in a short computational time; however, according to Rue and Martino [19], its accuracy can be affected by several factors. These problems can be corrected with moderate computational cost using a simplified version of the Laplace approximation, defined as the series expansion of $\widetilde {\pi }_{\textit {LA}}\left (\xi _{i}|\boldsymbol {\theta },\boldsymbol {y}\right)$ around ξ_i=μ_i(θ), the mean of $\widetilde {\pi }_{G}\left (\xi _{i}|\boldsymbol {y},\boldsymbol {\theta }\right)$ [18].

At last, the full posteriors obtained through the two previous steps are combined and the marginal densities of ξ_i and θ_j are obtained by integrating the irrelevant terms. The approximation for the marginal of the latent variables can be obtained by

$$ \begin{aligned} \pi\left(\xi_{i}|\boldsymbol{y}\right) &= \int\pi\left(\xi_{i}|\boldsymbol{y},\boldsymbol{\theta}\right)\pi\left(\boldsymbol{\theta}|\boldsymbol{y}\right) d\boldsymbol{\theta}\\ &\approx\sum_{k}\widetilde{\pi}\left(\xi_{i}|\theta_{k},\boldsymbol{y}\right)\widetilde{\pi} \left(\theta_{k}|\boldsymbol{y}\right)\;\Delta_{k}, \end{aligned} $$

((9))

which is evaluated on a set of grid points θ_k with weights Δ_k, for k=1,2,…,K. According to Rue et al. [18], since the integration points are selected in a regular grid, it is feasible to assume all the weights Δ_k to be equal. A similar numerical integration procedure is used to evaluate the marginals π(θ_j|y). Since the dimension of θ is small (less than or equal to seven), these numerical routines are effective in returning a discretized representation of the marginal posteriors.

A good choice of the set θ_k of evaluation points is important for the accuracy of the above numerical integration steps. Rue et al. [18] suggest to compute the negative Hessian matrix S at the mode θ^∗, of $\widetilde {\pi }\left (\boldsymbol {\theta }|\boldsymbol {y}\right)$ and to consider its spectral value decomposition, S⁻¹=QΛQ^T. Then, θ is defined through a standardized variable z, such that:

$$\begin{array}{@{}rcl@{}} z = Q^{T}\Lambda^{-1/2}\left(\theta-\theta^{*}\right) \quad \text{or} \quad \theta(z)=\theta^{*}+Q\Lambda^{1/2}z \end{array} $$

and a collection Z of z values is obtained, such that the corresponding θ(z) points are located around the mode θ^∗. Starting from z=0 (θ=θ^∗), each component entry of z is searched in the positive and negative directions in step sizes of η_z. All z points that satisfy

$$\begin{array}{@{}rcl@{}} \text{log}\;\widetilde{\pi}\left(\boldsymbol{\theta}(0)|\boldsymbol{y}\right)-\text{log}\;\widetilde{\pi} \left(\boldsymbol{\theta}(z)|\boldsymbol{y}\right)<\eta_{\pi} \end{array} $$

are considered to be belonging to Z. The set of evaluation points is based on the values in Z. An appropriate calibration of η_z and η_π values must be performed, in order to produce accurate approximations.

In the present work, SPDE approach was used together with the INLA method. All analyses were conducted using the R software version 3.0.1 [20] in the R-INLA package [21].

Results

Summaries of the posterior means of the model’s fixed effects and their respective 95 % credibility intervals (CI) are shown in Table 2. The “interaction” component, whenever shown, represents the effect of the interaction between accumulated rainfall in the two weeks prior to collection and the minimum temperature in the week prior to collection on the number of eggs. In the presence of statistically significant interaction, the primary effects of variables involved in the interaction terms are not interpreted, in order to avoid erroneous conclusions. All the models fit the data reasonably well, with 58 to 88 % of the validation dataset encompassed by the 95 % CI. The validation analysis could capture the oviposition pattern for all SAs, except for the highest number of eggs. Additional file 1 shows the validation analysis per trap for one particular SA (SA6).

Table 2 Individual models

Full size table

The positivity index ranged from 0.38 to 0.78. The area with the least positivity was SA8, and the most “attractive” for oviposition was SA2. Border distance had significant effect on the reduction of the number of eggs in SA2, and contributed to an increase in eggs in SA7. Total accumulated rainfall (Rainfall) was important to explain the increase in the number of eggs in all SAs except SA4. Minimum temperature (Tmin) also significantly contributed to explain the egg frequency in all SAs, except SA8.

Interaction effects between climate variables in SA1 and SA3 suggest that the effect of the quantity of accumulated rainfall (lag 2) on egg abundance changes as a function of the minimum temperature (lag 1). Figure 4 shows how the interaction between these variables influenced egg density, highlighting the change in direction of the effects for values above or below 26.3 mm of rain and 24.6 °C minimum temperature in SA1. In this area, an increase in the number of eggs could be expected only if the total rainfall was more than 26.3 mm, and minimum temperatures fell below 24.6 °C. In SA3, the effects changed direction according to the thresholds of 14.5 mm and 23.8 °C, so that rainfall greater than 14.5 mm and temperatures below 23.8 °C favored a reduction in egg density.

The temporal correlation between the weekly average number of eggs was moderate, varying from 0.29 (SA8) to 0.46 (SA2 and SA4). The variance of the nugget effect or measurement error ranged from 0.02 (SA7) to 0.09 (SA5 and SA8). The posterior mean of the spatial effect variance ranged from 1.04 (SA8) to 1.79 (SA4). More variation is explained by the spatial term rather than by the measurement error in all SAs. The empirical range varied from 21.23 to 34.19 m (the maximum distance between traps was 182.6 and 623.8 m) occurring respectively in SA2 and SA8. As these are the distances at which correlation is close to 0.1, we can conclude that the data feature moderate spatial correlation, which slowly decreases with distance.

The mean spatial egg distribution during the study period is presented in Fig. 5. In each area, the filled circle indicates the traps’ location. In general, the highest concentration of eggs was found in areas that bordered settlements with high population densities (slums), and in those close to campus buildings, where pedestrian foot traffic was heavier. Within each area, egg distribution was heterogeneous. Additional file 2 shows weekly changes in the number of eggs in SA1 throughout the study period using an animated map.

The hierarchical model estimated an average positivity of 62.2 %. Border distance was not important to explain the variations in the number of eggs, so it did not enter in the final model. The accumulated rainfall in the two weeks prior to trap installation was statistically significant but the minimum temperature in the previous week was not, as well as the interaction between them. As expected, higher rain rates (β₂=0.08 [ 95 % CI: 0.01−0.16]) was associated with larger quantities of eggs. The variability attributed to the spatial-structured effect ($\sigma ^{2}_{\omega } = 8.98$ [ 95 % CI: 6.98−12.42]) was greater than that attributed to measurement error ($\sigma ^{2}_{\varepsilon } = 0.17$ [ 95 % CI: 0.15−0.19]), while the temporal correlation (a) found was 0.77 (0.71 - 0.83). The variance attributable to the random effect was 0.02 (0.02 - 0.03), suggesting homogeneity among SAs. The estimated empirical range (ρ=37.27 [ 95 % CI: 32.25−42.13]) suggests that the results from the hierarchical model are similar to those of individual models. The validation dataset encompassed by the 95 % CI was 57.8 %.

Discussion

Due to the complexity of estimation involved in spatio-temporal modeling, these dimensions of variation inherent to many epidemiological processes are rarely analyzed together. The present methodology for estimation allows for the rapid and efficient implementation of these models, while considering 1) zero-inflation within SA, 2) interaction among climate variables and 3) different oviposition patterns over the course of the study period. This methodology can contribute to the identification of priority areas for entomological surveillance and targeted fieldwork.

Oviposition activity occurred over the entire course of the study but varied among SAs. The highest ovitrap positivity was 78 % (76–79 %) in SA2, while the lowest one was 38 % (36–38 %) in SA8. The remaining SAs presented ovitrap positivities varying between 55 and 67 % (Table 2). As expected, summertime featured the greatest infestation.

The distance between ovitraps and the border of the campus was assessed based on the hypothesis that mosquitoes came from outside the campus [8]. If this was true, significant negative effect would be found. However, such result was only observed in SA2, which limits a densely populated slum. All the other SAs presented nonsignificant or inverted relationship (SA7). This opposite trend suggests that the eggs captured in SA7 are not from outside mosquitoes, but from mosquitoes established inside. Within the campus, the greatest egg collection occurred in areas close to buildings and passages, in comparison to more isolated areas. Indeed, on the same campus, Honório et al. [8] found statistically significant relationship between the number of mosquito larvae in artificial breeding sites within the campus and the distance to the border in both the wet and dry seasons. The anthropophilic behavior of Aedes aegypti is well documented.

A model containing the interaction between temperature and rain was more informative and provides thresholds that could be used to issue alerts. However, the threshold values differed between areas, being significant only in two areas, SA1 and SA3 (Fig. 4). This variation suggests local microclimate effects not captured by the single meteorological station and can be attributed to the particular characteristics of each area (Table 1). Minh An and Rocklöv [22] evaluated the effect of the interaction between rain and temperature on the number of dengue cases in Hanoi, Vietnam. They showed rain occurrence and temperatures between 15 and 30 °C to correlate to an increase in the number of dengue cases.

Total precipitation accumulated over the two weeks prior to collection contributed to the increase in the number of eggs in five of the SAs, located within both high and low levels of vegetation areas. SA1 and SA3 also featured the effect of rain, but this effect interacted with temperature. Only SA4 showed no association between the number of eggs and rainfall. The relationship between precipitation and the proliferation of A. aegypti is likely to vary on small geographical scales [23]. Rain contributes to the formation of breeding habitats, but this effect will depend on the local availability of containers [24]. During heavy rainfall, these water containers offer favorable conditions for oviposition and the development of immature mosquitoes.

Significant positive relationship between minimum temperature and the number of eggs deposited over the following week was found in almost all SAs. High temperatures increase the rate at which mosquito larvae develop, leading to the rapid subsequent development of adult life forms of the mosquito. Under such conditions, the frequency of mosquito bites in humans also increases [25, 26]. Honório et al. [27] also found a significant positive effect of temperature on A. aegypti egg density in three neighborhoods of the municipality of Rio de Janeiro.

In the present study, we identified spatial patterns consistent with human population in all sentinel areas. Duncombe et al. [28] showed that vector density tends to correlate directly to high population density. A Colombian study also reinforced these findings [29]. The presence or absence of spatial correlation could be influenced by the distance between traps. Our findings suggest that ideal entomological surveillance should occur with weekly visits to traps located between 18 and 24 m apart. This result is important to guide the implementation of ovitrap surveillance systems, but also presents a challenge due to the high sampling effort required. Often surveillance is carried out using more sparsely distributed traps, from 50 m to 200 m apart [7]. One possibility to overcome this problem is to cluster traps in sentinel areas, as done in this study.

The joint analysis of the sentinel areas (Hierarchical model) only confirmed the rainfall as driving the number of eggs. Nevertheless, the estimates of spatial and temporal dependence parameters were similar.

The present study had some limitations. Other data, such as wind direction and speed were not available for analysis, as well as other potentially confounding variables, such as the amount and location of breeding sites. Moreover, precipitation data were collected from a weather station 5 km away from the study area. This may introduce bias, as the quantity of rainfall can vary substantially even in close geographical areas.

Aside from these limitations, the results suggest that border distance, minimum temperature and precipitation are all associated with population density of A. aegypti. Maps describing the abundance of eggs identified areas with high potential for transmission, so that control and prevention activities could be developed. The results also indicates the ideal spacing for traps, which constitutes an important aspect of sampling.

Conclusions

Aedes aegypti, due to it short generation time and strong response to climate triggers, tend to show an eruptive dynamics that is difficult to predict and understand through just temporal or spatial models. Spatio-temporal modeling has been prohibitive due the computational costs involved in MCMC based parameter estimation. Our results suggest that INLA based inference increases the efficiency of the estimation process in a way to allow its calculation within the time frame expected for any surveillance program. Differently from other ovitrap surveys, the studied dataset consisted of a high density of traps placed at close distances, and surveyed very frequently. This design allowed to assess the spatial and temporal autocorrelation structure of the oviposition process. The short range of these correlations support the notion that high sampling is necessary to capture the spatio-temporal patterns of mosquito activity, for example, the identification of hotspots. The extrapolation of these results to other areas must be done with care. Ideally, this study should be replicated in other settings. However, despite the specific results obtained, we believe this framework (zero-inflation + truncated Poisson model + INLA based inference) is an efficient way to model oviposition dynamics anywhere.

Abbreviations

BI:: Breteau index
Bin:: Binomial distribution
CI:: Credibility intervals
DIC:: Deviance information criterion
Fiocruz:: Oswaldo Cruz foundation
GF:: Gaussian field
GMRF:: Gaussian Markov random field
HI:: Household infestation index
ICICT:: Institute of scientific and technological communication and information in health
INLA:: Integrated nested Laplace approximation
MCMC:: Markov chain Monte Carlo methods
NOSMOVE:: Sentinel operational unit of mosquito vectors
Rainfall:: Total accumulated rainfall
SA:: Sentinel areas
SPDE:: Stochastic partial differential equations
Tmin:: Minimum temperature
VPAAPS:: Vice-presidency of environment, healthcare and health promotion
ZAP:: Zero-altered poisson distribution

References

Ministério da Saúde. Secretaria de vigilância em Saúde. Diretoria Técnica de Gestão: Levantamento Rápido de índices Para Aedes Aegypti - LIRAa - Para Vigilância Entomológica do Aedes Aegypti No Brasil: Metodologia Para Avaliação Dos índices de Breteau e Predial e Tipo de Recipientes. Brasil; 2013. Ministério da Saúde. Secretaria de vigilância em Saúde. Diretoria Técnica de Gestão.
Braga IA, Valle D. Aedes aegypti: vigilância, monitoramento da resistência e alternativas de controle no brasil. Epidemiologia e Serviços de Saúde. 2007; 16(4):295–302.
Google Scholar
Teixeira MG, Barreto ML, Costa MCN, Ferreira LDA, Vasconcelos PFC. Avaliação de impacto de ações de combate ao Aedes aegypti na cidade de Salvador, Bahia. Rev Bras Epidemiol. 2002; 5:108–15.
Article Google Scholar
Gomes AC. Medidas dos níveis de infestação urbana para aedes (stegomyia) aegypti e aedes (stegomyia) albopictus em programa de vigilância entomológica. Informativo Epidemiológico do SUS. 1998; 5:49–57.
Article Google Scholar
Fay RW, Eliason DA. A preferred oviposition site as a surveillance method for aedes aegypti. Mosq News. 1966; 26:531–5.
Google Scholar
Westaway EG, Blok J. Taxonomy and evolutionary relationships of flaviviruses In: Gubler DJ, Kuno G, editors. Dengue and Dengue Hemorrhagic Fever. Wallingford, UK: CAB International: 1997. p. 147–174.
Google Scholar
Codeço CT, Lima AWS, Araújo SC, Lima JBP, Maciel-de-Freitas R, Honório NA, et al. Surveillance of Aedes aegypti: Comparison of house index with four alternative traps. PLoS Neglected Tropical Diseases. 2015; 9(2):e0003475. Public Library of Science.
Article PubMed PubMed Central Google Scholar
Honório NA, Castro MG, Barros FS, Magalhães MA, Sabroza PC. The spatial distribution of Aedes aegypti and Aedes albopictus in a transition zone, Rio de Janeiro, Brazil. Cad Saude Publica. 2008; 25(6):1203–14.
Article Google Scholar
Sistema Alerta Rio. http://alertario.rio.rj.gov.br/.
Zuur AF. Zero Inflated Models and Generalized Linear Mixed Models with R. United Kingdom: Highland Statistics Limited; 2012.
Google Scholar
Cressie NAC. Statistics for Spatial Data. Revised Edition. Hoboken, NJ, USA: John Wiley & Sons, Inc; 1993.
Google Scholar
Lindgren F, Rue H, Lindström J. An explicit link between gaussian fields and gaussian markov random fields: the stochastic partial differential equation approach. J R Stat Soc Ser B (Stat Methodol). 2011; 73(4):423–98.
Article Google Scholar
Spiegelhalter DJ, Best NG, Carlin BP, Van Der Linde A. Bayesian measures of model complexity and fit. J R Stat Soc Ser B (Stat Methodol). 2002; 64(4):583–639. doi:10.1111/1467-9868.00353.
Article Google Scholar
How Are the Deviance Information Criteria (DIC) and The Watanabe-Akaike Information Criterion (WAIC) computed?http://www.r-inla.org/faq#TOC-How-are-the-Devicance-Information-Criteria-DIC-and-The-Watanabe-Akaike-information-criterion-WAIC-computed-/ .
Cameletti M, Lindgren F, Simpson D, Rue H. Spatio-temporal modeling of particulate matter concentration through the spde approach. Adv Stat Anal. 2013; 97(2):109–31.
Article Google Scholar
Rue H, Held L, Vol. 104. Gaussian Markov Random Fields: Theory and Applications. Monographs on Statistics and Applied Probability. London: Chapman & Hall; 2005.
Book Google Scholar
Gamerman D, Lopes HF. Monte Carlo Markov Chain: Stochastic Simulation for Bayesian Inference. London, UK: Chapman & Hall; 2006.
Google Scholar
Rue H, Martino S, Chopin N. Approximate bayesian inference for latent gaussian models by using integrated nested laplace approximations. J R Stat Soc Ser B (Stat Methodol). 2009; 71(2):319–92.
Article Google Scholar
Rue H, Martino S. Approximate Bayesian inference for hierarchical Gaussian Markov random field models. Journal of Statistical Planning and Inference. 2007; 137(10):3177–3192.
Article Google Scholar
R Core Team. R: A Language and Environment for Statistical Computing. Vienna, Austria: R Foundation for Statistical Computing; 2013. Foundation for Statistical Computing, http://www.R-project.org/.
Google Scholar
Rue H, Martino S. INLA: Functions Which Allow to Perform a Full Bayesian Analysis of Structured Additive Models Using Integrated Nested Laplace Approximation. 2009. http://www.r-inla.org/.
Minh An DT, Rocklöv J. Epidemiology of dengue fever in hanoi from 2002 to 2010 and its meteorological determinants. Global Health Action. 2014; 7:23074.
Article Google Scholar
Halstead SB. Dengue virus - mosquito interactions. Annu Rev Entomol. 2008; 53:273–91.
Article CAS PubMed Google Scholar
Alto BW, Juliano SA. Precipitation and temperature effects on populations of aedes albopictus (diptera: Culicidae): implications for range expansion. J Med Entomol. 2001; 38(5):646–56.
Article CAS PubMed PubMed Central Google Scholar
Focks DA, Haile DG, Daniels E, Mount GA. Dynamic life table model for aedes aegypti (diptera: Culicidae): simulation results and validation. J Med Entomol. 1993; 30:1018–28.
Article CAS PubMed Google Scholar
Jansen CC, Beebe NW. The dengue vector aedes aegypti: what comes next. Microbes Infect. 2010; 12:272–9.
Article PubMed Google Scholar
Honório NA, Codeço CT, Alves FC, Magalhães MA, Lourenço-D-Oliveira R. Temporal distribution of aedes aegypti in different districts of rio de janeiro, brazil, measured by two types of traps. J Med Entomol. 2009; 46(5):1001–1014.
Article PubMed Google Scholar
Duncombe J, Clements A, Davis J, Hu W, Weinstein P, Ritchie S. Spatiotemporal patterns of aedes aegypti populations in cairns, Australia: assessing drivers of dengue transmission. Tropical Med Int Health. 2013; 18(7):839–49.
Article Google Scholar
Padmanabha H, Durham D, Correa F, Diuk-Wasser M, Galvani A. The interactive roles of aedes aegypti super-production and human density in dengue transmission. PLoS Negl Trop Dis. 2012; 6(8):1799.
Article Google Scholar

Download references

Acknowledgements

The authors would like to thank the Sentinel Operational Unit of Mosquito Vectors (NOSMOVE/Fiocruz) for data collection, processing and tabulation, as well as the technical team from the Geoprocessing Laboratory at the Health Institute of Scientific and Technological Communication and Information (ICICT) at the Oswaldo Cruz Foundation for generating the campus map, border distance and percent ground cover. This project was funded by the Dengue Fiocruz Network.

Author information

Authors and Affiliations

Sergio Arouca National School of Public Health, Oswaldo Cruz Foundation, Rua Leopoldo Bulhões 1.480, Rio de Janeiro, Brazil
Ana Carolina C. Costa
National Institute of Women, Children and Adolescents Health Fernandes Figueira, Department of Clinical Research Oswaldo Cruz Foundation, Avenida Rui Barbosa 716, Rio de Janeiro, Brazil
Ana Carolina C. Costa
Scientific Computing Program, Oswaldo Cruz Foundation, Avenida Brasil 4365, Rio de Janeiro, Brazil
Cláudia T. Codeço & Aline A. Nobre
Laboratory of Transmitters of Hematozoa, Oswaldo Cruz Institute, Oswaldo Cruz Foundation, Avenida Brasil 4365, Rio de Janeiro, Brazil
Nildimar A. Honório
Sentinel Operational Unit of Mosquito Vectors, Oswaldo Cruz Foundation, Avenida Brasil 4365, Rio de Janeiro, Brazil
Nildimar A. Honório, Gláucio R. Pereira & Carmen Fátima N. Pinheiro

Authors

Ana Carolina C. Costa
View author publications
You can also search for this author in PubMed Google Scholar
Cláudia T. Codeço
View author publications
You can also search for this author in PubMed Google Scholar
Nildimar A. Honório
View author publications
You can also search for this author in PubMed Google Scholar
Gláucio R. Pereira
View author publications
You can also search for this author in PubMed Google Scholar
Carmen Fátima N. Pinheiro
View author publications
You can also search for this author in PubMed Google Scholar
Aline A. Nobre
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Ana Carolina C. Costa.

Additional information

Competing interests

The authors declare that there is no conflict of interests.

Authors’ contributions

ACCC performed all analyses, interpreted results, and wrote the first and final drafts of the manuscript. AAN and CTC advised on the statistical analyses and the drafting of the manuscript. NAH performed the study design, coordinated the study and critically revised the manuscript. GRP and CFNP participated in the study design and were responsible for data collection and tabulation. All authors read and approved the final manuscript.

Additional files

Additional file 1

Validation analysis for SA6. Validation analysis per trap for SA6 throughout the study period. (PDF 23.6 Kb)

Animated map for SA1. Animated map of weekly changes in the number of eggs in SA1 throughout the study period. This file can be viewed with: QuickTime Player. (MP4 11980 kb)

Rights and permissions

Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License(http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver(http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.

Reprints and permissions

About this article

Cite this article

C. Costa, A.C., Codeço, C.T., Honório, N.A. et al. Surveillance of dengue vectors using spatio-temporal Bayesian modeling. BMC Med Inform Decis Mak 15, 93 (2015). https://doi.org/10.1186/s12911-015-0219-6

Download citation

Received: 22 May 2015
Accepted: 03 November 2015
Published: 13 November 2015
DOI: https://doi.org/10.1186/s12911-015-0219-6

Surveillance of dengue vectors using spatio-temporal Bayesian modeling