- Research article
- Open Access
- Open Peer Review
- Published:

# Measurement error adjustment in essential fatty acid intake from a food frequency questionnaire: alternative approaches and methods

*BMC Medical Research Methodology*
**volume 7**, Article number: 41 (2007)

## Abstract

### Background

We aimed at assessing the degree of measurement error in essential fatty acid intakes from a food frequency questionnaire and the impact of correcting for such an error on precision and bias of odds ratios in logistic models. To assess these impacts, and for illustrative purposes, alternative approaches and methods were used with the binary outcome of cognitive decline in verbal fluency.

### Methods

Using the Atherosclerosis Risk in Communities (ARIC) study, we conducted a sensitivity analysis. The error-prone exposure – visit 1 fatty acid intake (1987–89) – was available for 7,814 subjects 50 years or older at baseline with complete data on cognitive decline between visits 2 (1990–92) and 4 (1996–98). Our binary outcome of interest was clinically significant decline in verbal fluency. Point estimates and 95% confidence intervals were compared between naïve and measurement-error adjusted odds ratios of decline with every SD increase in fatty acid intake as % of energy. Two approaches were explored for adjustment: (A) External validation against biomarkers (plasma fatty acids in cholesteryl esters and phospholipids) and (B) Internal repeat measurements at visits 2 and 3. The main difference between the two is that Approach B makes a stronger assumption regarding lack of error correlations in the structural model. Additionally, we compared results from regression calibration (RCAL) to those from simulation extrapolation (SIMEX). Finally, using structural equations modeling, we estimated attenuation factors associated with each dietary exposure to assess degree of measurement error in a bivariate scenario for regression calibration of logistic regression model.

### Results and conclusion

Attenuation factors for Approach A were smaller than B, suggesting a larger amount of measurement error in the dietary exposure. Replicate measures (Approach B) unlike concentration biomarkers (Approach A) may lead to imprecise odds ratios due to larger standard errors. Using SIMEX rather than RCAL models tends to preserve precision of odds ratios. We found in many cases that bias in naïve odds ratios was towards the null. RCAL tended to correct for a larger amount of effect bias than SIMEX, particularly for Approach A.

## Background

Food frequency questionnaires (FFQ) have been historically used in large epidemiological studies to assess usual dietary intake of specific nutrients and food groups. They have been repeatedly validated against another dietary assessment method assumed to be more accurate such as multiple 24-hour recalls or food records of food intake [1–3]. The problem inherent in this approach is that the same factors that affect these reference methods (*R*) may also affect the FFQ-based assessments (*Q*), which include over or under-reporting biases of subjects with certain socio-demographic or health-related characteristics. This problem would make it impossible to presume independent random errors in the two methods, which in turn leads to over-estimation of the correlation between the reference method and the FFQ [4]. Hence, to consider a method a gold standard, it should ideally reflect the true values of what we are trying to measure. In nutritional epidemiology, very few biomarkers can be considered as gold standards and are often very expensive to carry out even on relatively small samples. Alloyed gold standards are used instead which are considered to be accurate depictions of the truth, being unbiased in expectation. Moreover, any error associated with them can be assumed to be random and independent of the true unknown value of intake. Compared to self-reports (e.g. multiple 24 hr. recalls), biomarkers are alloyed gold standards that can additionally be assumed to have independent measurement errors from those of the test measure itself (i.e. FFQ). Such restriction on measurement error associations makes biomarkers a desirable target for validation studies. A number of epidemiologic studies however, consider repeat measures of the FFQ as alternative means to correct for measurement error, given that biomarkers have also some drawbacks including their associated random and systematic error components. In fact, a previous study favored this method over external validation in terms of preserving the precision of estimates [5].

The present study aims first at assessing the degree of measurement error in the intake of essential fatty acids using a food frequency questionnaire, by estimating attenuation factors for each exposure. Second, we studied the impact of correcting for such measurement error on bias and precision of odds ratios from a multivariate logistic regression model. For illustrative purposes, we examined the effect of dietary essential fatty acids on clinically significant cognitive decline among older adults, controlling for other potential confounders. For both objectives, two alternative approaches were used: (A) *External validation against two biomarkers* measured at visit 1 among a subset of the Atherosclerosis Risk in Communities (ARIC) study. These concentration biomarkers are the percent of each fatty acid or group of fatty acids out of total fatty acid concentration in the cholesteryl ester (*M*) and phospholipids (*N*) fractions of plasma which were previously shown to be linearly related to dietary intake as assessed by more reliable reference methods such as multiple 24-hour recalls, food records or diet history [6–8]. In fact, one study conducted in UK (Whitehall II) showed that correlation between 7-day food records estimated fatty acids and cholesteryl ester concentration was 0.38 for EPA (one major n-3 highly unsaturated fatty acid), 0.67 for linoleic acid and 0.56 for all polyunsaturated fatty acids. The other studies came to a similar conclusion. Hence, despite the issue of transportability, it can be concluded that M and N are moderately correlated with reference measures of dietary intake (B) *Internal repeat measurement of the FFQ* values at visits 2 and 3. While the former approach eliminates the possibility of correlated errors between test and reference method (one being a self-report and the other a biomarker), the second approach is often more readily available in many large cohort studies. In addition, for the second objective, we contrasted regression calibration with another method known as simulation extrapolation or SIMEX. Previous research indicated that dietary n-3 fatty acids were suggestive of a protective effect against cognitive aging [9–13]. Other studies of concentration biomarkers for n-3 fatty acids indicated a similar association [14–18]. However, a number of other reports could not replicate these findings [19, 20]. An evidence-based report suggested a need to look for the effect of *n*-3 fatty acids on cognitive decline and to define exposure in terms of absolute value of medium chain and long chain fatty acids, as well as the ratio between n-3 and n-6 fatty acids in diet and plasma [21].

Findings from our study and its overall methodology may be used in subsequent analyses to adjust for measurement error in primary regression linking intake of essential fatty acids with various disease outcomes.

## Results

### Fatty Acid Exposure Distribution

Table 1 presents the distribution of fatty acid groups and ratios for the main study as well as the sub-samples with complete biomarker and repeat measure data. Fatty acid groups are expressed as % of total energy for *Q*
_{1}, *Q*
_{2} and *Q*
_{3}. For *M* and *N*, they were expressed as % of total fatty acids in the plasma fraction. A decreasing trend in reported consumption of groups *6P* and *6* was noted. Similarly, the ratio of reported 3H/6H declined almost linearly over visits 1, 2 and 3. Standard deviations of *Q*
_{1} are instrumental in interpreting results for logistic regression model analyses.

### Measurement error in fatty acid exposures: attenuation factors estimation with two alternative approaches

Table 2 presents the attenuation factor estimates from the measurement error models using Approaches A (external validation with biomarkers *M/N*) and B (Internal repeat measurements *Q*
_{2}
*/Q*
_{3}). Our findings suggested that attenuation of bivariate effects was more pronounced for approach A with factors ranging between 0.046 for 6H and 0.426 for 3H/6H. As for Approach B, attenuation factors ranged between 0.456 for 3P and 0.655 for 3H/6H and 3/6. Further, a sensitivity analysis with assumed naïve odds ratios of two for all *j* variables (Log_{e}(OR_{naive}) = 0.693 with SE = 0.10) was conducted to get a feel of the magnitude in attenuation in a measure of association that is commonly used in epidemiologic studies. Equations **4.1–4.4 (Methods section)** were used for this purpose. In many cases, approach A yielded corrected odds ratios that were outside the reliable range (The two odds ratios differed by more than 10-folds) while approach B gave adjusted odds ratios within a reliable range. However, further sensitivity analysis with other values of the naïve estimate of the odds ratio indicated that a value between 1.0 and 1.5 gave more reliable corrected estimates compared to values of 1.5 or higher, particularly for approach A. It is worth noting that these computations are only applicable to bivariate models with one error prone variable.

### Measurement error adjustment in multivariate logistic models: alternative approaches

Figure 1. shows a LOWESS smoothed representation of the behavior of clinically significant cognitive decline in verbal fluency between visits 2 and 4 in our study population (n = 7,814) by age, stratified by sex, education and ethnicity [22]. In general, cognitive decline was shown to affect a greater proportion of individuals for subjects whose baseline age was 60 years or more, and that was particularly true for those with education exceeding high school and among African-Americans. The same patterns were observed when continuous cognitive change was considered (Visit 4 – Visit 2).

As an application to multivariate logistic models, we conducted further analysis using available data on cognitive decline and essential fatty acids and controlling for relevant confounders (Equations 4 and 5). Table 3 presents naïve and corrected odds ratios for each fatty acid group/ratio in relation to decline in WFT using two approaches (external validation and internal repeat measurements) as well as two measurement error correction methods (RCAL and SIMEX). In general, the results showed that RCAL tended to yield wider confidence intervals as compared to SIMEX when contrasting confidence limit ratios, particularly when the naïve estimate was close to the null value of 1. The point estimate indicated bias towards the null value of 1, when comparing the corrected to the naïve for 28 out of the 36 models that were ran. Most of the 8 associations that did not fit this finding showed bias through the null and had naïve odds ratios very close to the null value of 1.00. On average, the absolute bias that is corrected in odds ratios for each approach/method dyad was as follows: Approach A/RCAL (|Δ*bias*| = 0.12); Approach A/SIMEX (|Δ*bias*| = 0.06); Approach B/RCAL (|Δ*bias*| = 0.08); Approach B/SIMEX (|Δ*bias*| = 0.06). Hence, in general, RCAL corrected for more bias than SIMEX did and this was more obvious for Approach A.

For internal repeat measurements (Approach B), results indicated an appreciable loss in precision, given that replicate *Q*
_{2} was measured for only 657 subjects out of the 7,814 who were eligible. Comparing the two approaches, many of the unexpected results with bias through or away from the null occurred in Approach B. In addition, in terms of precision, approach A yielded an average for ratios of confidence limit ratios (CLRs) of 1.22 (1.41 for RCAL and 1.14 for SIMEX), compared to 1.62 (1.75 for RCAL and 1.48 for SIMEX) for approach B, which indicates a greater overall precision in approach A.

To illustrate graphically the SIMEX procedure that allows for correction of the point estimate of effect (*β*
_{1} = Log_{e}OR), Figure 2 shows a SIMEX plot of the association between *3H* (z-score) and odds of decline in WFT corrected using both approaches A (concentration biomarkers M and N) and B (replicate measures of Q_{1}: Q_{2} and/or Q_{3}). In both cases, *6H* was included as an error prone confounder and the effect of measurement error on its associated regression coefficient is also presented. It is clear that while approach A revealed an underestimation of the protective effect (O R< 1; *β*
_{1} = Log_{e}(OR) < 0, assuming the true value of 1) of *3H* on WFT decline by an absolute value of 0.09 between the naïve and the SIMEX coefficients, approach B had a discrepancy of no more than 0.05. The same difference in underestimation of adverse effect (OR > 1; *β*
_{1} = Log_{e}(OR) > 0) is noted for *6H*.

## Discussion

Our main findings suggested that estimates were appreciably less precise for internal repeat measurements when compared to external validation with biomarkers. However, they gave more conservative results regarding the association between fatty acid groups and ratio with cognitive decline in WFT. For instance, based on biomarker/RCAL results, one could conclude that a 1 SD increase in 3H (n-3 highly unsaturated fatty acids) may reduce the odds of decline by as much as 72% or as little as 10%, with an average of 27%. While the point estimate for the repeat measures approach still indicated a protective effect of 3H, both RCAL and SIMEX analyses resulted in broad confidence intervals that crossed the null value of 1. These findings can be contrasted with the study by Duffy and colleagues [5], who found that repeat measurements actually lead to a better precision when compared to external validation. This finding may be caused by an artifact of sample size difference between *Q*
_{2} (n = 634) and *Q*
_{1} (n = 7,814). However, it is important to note that the main study population differed significantly from the sub-study with external validation data (i.e. biomarkers). In contrast, demographic and lifestyle differences were not as striking for the repeat measurements sub-study, probably due to the relative preservation of racial and ethnic diversity in this sub-group. Hence, transportability of attenuation factors from the validation sub-study is more questionable for the subgroup with available biomarker data (MN whites, n = 2,251). In addition, although loss of precision is an issue, it may also be viewed by some as yielding more conservative estimates of effect particularly if this loss is not substantial, as is the case for SIMEX. Hence, we recommend using SIMEX rather than RCAL whether the choice is to use biomarker measures results (Approach A) or replicate measures (Approach B). As expected, while RCAL gave less precise estimates, however, it corrected for a larger amount of bias in odds ratios, particularly for Approach A.

The present study is also one of the very few attempts to estimate an attenuation factor for essential fatty acids as exposures that can be used subsequently by other researchers for the purpose of correcting for measurement error in bivariate generalized linear models. The approach used was similar to previous research [23–25]. While this article focused on RCAL and SIMEX, other measurement error models utilize the measurement error variance matrix (${\widehat{\Sigma}}_{uu}$), including methods with instrumental variables and maximum-likelihood methods [26]. Aside from cognitive decline, health outcomes that have traditionally been of interest in relation to essential fatty acids and the balance between them include coronary heart disease [27, 28], stroke [29, 30], type II diabetes [31], breast and prostate cancer [32, 33], depression [34, 35], a hypercoagulable profile [36, 37] and COPD [38].

One major implication to measurement error, as stated earlier, is loss of statistical power to detect an exposure-disease association. In fact, the sample size required to detect a specific odds ratio (e.g. OR = 2) is inflated proportionally to the inverse squared attenuation factor. For instance, if the true *λ*
_{j} was 0.2, the sample size, calculated by assuming that *λ*
_{j} is equal to 0.4, should be multiplied by 0.4^{2}/0.2^{2} = 4 to achieve the same nominal power [39]. It is worth noting that because latent variable T (true intake of fatty acids as % of energy intake) is on a z-score standardized scale, the use of the attenuation factor would lead to a calibrated standardized regression model. For a logistic model, an odds ratio is interpreted as increase or decrease in odds of disease with every SD increase in the continuous z-scored exposure.

Some of the main limitations of this study include the lack of a reference method that is known to be more reliable than FFQs in the ARIC study (e.g. multiple 24-hour recalls or food records). However, because of correlated errors between self-report methods, the use of biomarkers has often been cited as a more adequate means to assess the extent of measurement error in a test instrument. Another drawback is the fact that plasma levels of fatty acids in both fractions studied constitute a short-term measure of intake although they have been shown to correlate well with long-term intake [40]. In addition, the lack of certainty as to the nature of the relationship between the biomarkers considered and the intake variables and the potential interaction of these dietary exposures with other nutritional, environmental and genetic factors constitutes a major challenge for interpretation. For this reason, and using structural equations modeling, estimation of measurement error in FFQ derived nutrients took into consideration two approaches, by including repeat measurements at visit 2 and 3 of the FFQ in one and two concentration biomarkers with assumed linear relationship with intake in another. Finally, although there has been evidence of correlation between intake of fatty acids and their levels in the substrates considered in our study, such a correlation does not necessarily render these biomarkers an adequate reflection of long-term fatty acid intake. In fact, the only substrate that has been shown to work as a gold standard is adipose tissue. However, because of the elevated cost and invasiveness of the procedure, studies using adipose tissue fatty acid concentration as an intake biomarker were often of limited sample size and hence correlations obtained had insufficient levels of precision [41, 42]. Another potentially adequate biomarker that was often used to validate medium-term intake of fatty acids is erythrocyte membrane concentration [43, 44].

## Conclusion

Future endeavors to correct for error should make use of structural equations modeling and include as many concentration biomarkers as is available along with other self-reported or biomarker-based reference methods of dietary assessment. However, the choice of biomarkers and interpretation of their variability must be made as to account for biochemical and physiological interactions between dietary, environmental and genetic factors. Moreover, one must be cautious of coupled errors between biological markers and must take into account these correlations when specifying the structural model. Finally, because structural equations modeling makes a strong assumption about joint multivariate normality, often not present, it is crucial for future studies to use newly developed methodologies which appear to be more flexible in many ways [45].

## Methods

### Study subjects

ARIC is a prospective cohort study which aimed at investigating the etiology of atherosclerosis and its clinical sequelae and the longitudinal impact of variation in cardiovascular risk factors, medical care, and disease by race, sex, place, and time. In each of four US communities – Forsyth County (NC), Jackson (MS), suburbs of Minneapolis (MN), and Washington County (MD) – 4,000 adults aged 45–64 years were examined four times, three years apart (visits 1 through 4). Three out of the four cohorts represented the ethnic mix of their communities, while at Jackson, MS, only African American residents were recruited [46]. Out of the total sample examined at baseline (N = 15,792) we restricted these analyses to 11,557 individuals aged 50 years or older at baseline. Eligibility for these analyses further required complete data on cognitive functioning at visits 2 (1990–92) and 4 (1996–98) and also complete dietary intake at visit 1 (1987–89), which yielded n = 7,814 men and women. Of these, plasma fatty acid data at visit 1 was available on a sub-set of the Minneapolis cohort, MN (n = 2,251). Additionally, repeat measures using the same FFQ were conducted among 657 at visit 2 and 7,482 at visit 3 of the 7,814 eligible subjects who had exposure data at baseline and complete outcome assessment. Repeat measures on both visits 2 and 3 were available for 634 of the eligible subjects.

### Outcome assessment

Measures of cognitive functioning were made for visits 2 and 4 of the ARIC study. In our present study, we focus on decline in Word Fluency Test (WFT). This test requires subjects to record as many words as possible using the initial letters F, A and S and to list these words, the subject is given only 60 seconds per letter. The total score corresponds to the total number of words generated during these three trials. The test is particularly sensitive to linguistic impairment [47, 48] and early mental decline in older persons [49]. It is also a sensitive marker of damage in the left lateral frontal lobe [47, 48]. The immediate test-retest correlation coefficient based on an alternate test form has been found to be high (r = 0.82)[50].

Cutoff points were determined for decline in cognitive status WFT test using the Reliable Change Index (RCI) method in order to correct for measurement error and practice effects [51]. RCI is defined as ((X_{2}-X_{1})-(M_{2}-M_{1}))/S.D., where X_{1} is the individual's score at baseline, X_{2} the individual's score at follow-up, M_{1} and M_{2} are the group mean pretest and follow-up scores respectively, and S.D. the observed standard deviation of the difference scores. Scoring below an RCI of -1.645 was regarded as a "statistically reliable" deterioration in the test scores.

### Error-prone dietary exposure

Usual dietary intake was estimated from an interviewer-administered 61-item semi-quantitative food frequency questionnaire (FFQ) previously developed and validated by W. Willet and colleagues against multiple food records among a sub-sample of the Nurse's Health Study cohort [52].

In our study, dietary intake of essential fatty acids and their elongated and desaturated products were expressed as percent of total energy intake and grouped under four main categories, as suggested by Lands and colleagues [53, 54]: **(3P)** n-3 C_{18} polyunsaturated fatty acids: 18:3+18:4n-3 **(6P)** n-6 C_{18} polyunsaturated fatty acids: 18:2+18:3n-6 **(3H)** n-3 C_{20} and C_{22} highly unsaturated fatty acids (HUFAs): 20:5+22:5+22:6n-3 and **(6H)** n-6 HUFAs: 20:3+20:4+22:4+22:5n-6. Sums of fatty acid intake as percent of energy included (3) = (3P)+(3H) and (6) = (6P)+(6H). Ratios of interest included (3P)/(6P), (3H)/(6H) and (3P+3H)/(6P+6H) also denoted as 3/6. In multivariate models, all exposure variables were standardized by subtracting each observation from the variable mean and dividing the difference by the standard deviation. Adjustment was made for the other fatty acid variables when appropriate, and total energy intake was considered as a potential confounder to emulate a multivariate nutrient density model [55]. While other energy adjustment methods were possible, the latter was considered as more amenable to public health implications.

### Concentration biomarkers

Twelve-hour fasting blood was collected according to the ARIC study wide protocol. The Minneapolis field center conducted fatty acid analysis in plasma phospholipid and cholesteryl ester fractions for visit 1 blood specimens (1987–89) among the white segment of the study population in that center. The procedure is described in detail elsewhere [38]. The identity of 28 fatty acid peaks were revealed by gas chromatography by comparing each peak's retention time to the retention times of fatty acids in synthetic standards of known compositions. The relative amount of each fatty acid (as a percent of all fatty acids) could be calculated by integrating the area under the peak and dividing the result by the total area for all fatty acids and multiplying by 100. To minimize transcription errors, data from the chromatogram was transferred electronically to a computer for analysis. Two concentration biomarkers, consisting of the plasma phospholipids and cholesteryl ester level of fatty acids in each of the groups described above, were used to assess measurement error in the FFQ and correct for that error.

### Repeat FFQ measures

Dietary intake was assessed among the surviving ARIC sample at visit 3 (1992–94), using the same FFQ that was administered at baseline. At visit 2 (i.e. 1990–92), a sub-sample of ARIC (around 10% of the original sample) was asked to repeat the FFQ, unlike visits 1 and 3 in which the whole ARIC sample was covered. As stated earlier, of our eligible subset with baseline data on exposure and complete outcome data (n = 7,814), 657 had data on visit 2 exposure, 7,482 had complete data at visit 3, while 634 had both.

### Covariates

Most covariates considered as potential confounders were measured at visits 1 or 2. These included sociodemographics (age, gender, ethnicity, education), genetic (Apo E *ε*4 carrier status), behavioral (smoking, alcohol and caffeine consumption and physical activity), nutritional (body mass index, intake of antioxidants and other micronutrients mainly Vitamins B_{6}, B_{12} and folate). All these were previously shown to be independently predictive of the outcome and associated with our exposure. The distributions of these covariates within each sub-sample considered are presented in Table 4.

### Statistical analysis

**(1)** ** Bivariate scenario: estimation of attenuation factors**. In this part, each essential fatty acid using the two alternative approaches and a structural equations modeling technique.

**(2)**

**. In this part, the association between essential fatty acid intake and clinically significant decline in WFT as an outcome was studied and adjusted for measurement error in exposures. We examined changes in 95% CI and point estimates after correction for error using alternative approaches and methods.**

*Multivariate scenario: regression calibration and Simulation Extrapolation*#### (1) Bivariate scenario: estimation of attenuation factors

Estimating the attenuation factor associated with the effect of each exposure variable or covariate on an outcome constitutes the first step for measurement error adjustment using regression calibration. In our example, and for attenuation factor estimation, the structural models were considered for each of the two approaches **(Eq. 1 and 2)**.

*Approach A*

*Approach B*

This set of equations allowed us to estimate the attenuation factor *λ*
_{j} with its approximate SE, using the delta method (**Eq. 3**).

when Var(Q_{1}) = Var(T) = 1. This is the case when all measured variables (Q_{k}, M, N) are standardized z-scores.

The lower the attenuation factor, the higher the measurement error in the error-prone exposure variable. *λ*
_{j} = 1

if there is no measurement error or Q_{1} = T.

Assuming non-differential misclassification, dietary measurement error often attenuates diet-disease relationship and thus biases the effect towards the null. The amount by which the association is biased can be estimated by the attenuation factor [25]. According to Rosner and colleagues [56], if a primary regression model **(Eq. 4.1)** with a response variable *Y* contains only one error-prone explanatory variable *Q*
_{
j
}(bivariate scenario) correction for attenuation, consists simply of dividing the regression coefficient *β*
_{(naïve, j)} of that variable *Q*
_{
j
}by the attenuation factor *λ*
_{j} **(Eq. 4.2)**. The variance estimates for the corrected regression coefficient of effect as well as 95 percent CI are estimated using Eqs. **4.3.1, 4.3.2** and **4.4**. *Q*
_{1ij
}and biomarkers *M*
_{
ij
}and *N*
_{
ij
}are entered into the model as z-scored variables. The same applies to the repeat measures *Q*
_{2ij
}and *Q*
_{3ij.
}Note that *Q*
_{1ij
}is the manifest variable denoting visit 1 dietary exposures, which are the test exposures of interest (n = 7,814).

For logistic regression, the assumptions made are linear homoscedastic regression of T on Q with a normally distributed error term and a rare disease requirement [56]. Subsequently Kuha [57] introduced two key requirements for approximate unbiasedness of *β*
_{RC} : **(i)** *β*
^{2}
_{1} * *σ*
^{2} product is small; where *σ*
^{2} = Var (*T*|*Q*, *Z*
_{
k
}); **(ii)** Pr(*Y* = 1|*T*) is small and f(*T*|*Q*) is normal.

In cases where more than one error-prone variable and several perfectly measured variables are introduced into a primary regression model **(5)**, the attenuation factor is no longer the parameter to be used for calibration.

#### (2) Multivariate scenario: regression calibration and simulation extrapolation

For the multivariate scenario, we computed odds ratios of clinically significant decline in WFT (RCI < -1.645) with increase in each exposure by 1 SD through a multivariate logistic regression analysis. Control for confounding was accomplished using backward elimination whereby covariates that changed the estimated effect of the exposure by more than 5% were retained in the final model [58]. The parsimonious model which provided a non-confounded estimate of the effect of a fatty acid exposure on the outcome (decline in WFT over a period of six years: a binary variable) was represented as the naïve estimate of effect. Subsequently, regression calibration and simulation extrapolation were conducted on this same parsimonious model, as alternative methods for two approaches (A and B) described earlier.

Under such a multivariate scenario, regression calibration becomes reliant on the variance-covariance matrix of error in the measurement of different error-prone variables *Q*
_{
j
}(${\widehat{\Sigma}}_{\text{uu}}$) as well as the variance-covariance matrices of the variables themselves. This relationship termed "method of moments" can be summarized in equation **(6)** [26].

The detailed explanations of each moment is laid out in equations **7.1** through **7.5** [59], and table 5.

**Example**: FFQ visit 1 z-score for fatty acid exposures (Q1) are error-prone variables and have at most two replicates: 1 ≤ k ≤ 3 for each observation. Z (e.g. age) is available for all observations and is not error-prone.

k replicates for each individual i. W = {Q_{1}, Q_{2}/Q_{3}} or {Q_{1}, M/N}.

Alternatively, one can use simulation extrapolation (SIMEX) which also relies on the method of moments with an estimate of ${\widehat{\Sigma}}_{\text{uu}}$ or replicate measures of the error-prone variable. This method can be summarized as follows[60]:

**Step 1** : Simulation step:

• Create additional datasets with increasingly larger amounts of measurement error, after estimating error variance for each error-prone variable (${\widehat{\sigma}}_{u}^{2}$):

(1 + *θ*)${\widehat{\sigma}}_{u}^{2}$ with: *θ* = {0.5,1.0,1.5,2.0}

• Regression coefficients estimated using method of moments **(Eq. 6–7.5)**

**Step 2**: Extrapolation step:

After plotting each of the estimated coefficients with *θ* = {0.5,1.0,1.5,2.0}on the x-axis, the coefficient *β*
_{
T, SIMEX
}is estimated by extrapolating backwards to a value of *θ* = -1, where covariate Q becomes error-free,

**Step 3**: Variance and SE of the regression coefficients for SIMEX point estimates are estimated either with an asymptotic or bootstrap method.

## References

- 1.
Subar AF, Thompson FE, Kipnis V, Midthune D, Hurwitz P, McNutt S, McIntosh A, Rosenfeld S: Comparative validation of the Block, Willett, and National Cancer Institute food frequency questionnaires : the Eating at America's Table Study. Am J Epidemiol. 2001, 154 (12): 1089-1099. 10.1093/aje/154.12.1089.

- 2.
Shatenstein B, Nadon S, Godin C, Ferland G: Development and validation of a food frequency questionnaire. Can J Diet Pract Res. 2005, 66 (2): 67-75. 10.3148/66.2.2005.67.

- 3.
Date C, Fukui M, Yamamoto A, Wakai K, Ozeki A, Motohashi Y, Adachi C, Okamoto N, Kurosawa M, Tokudome Y, Kurisu Y, Watanabe Y, Ozasa K, Nakagawa S, Tokui N, Yoshimura T, Tamakoshi A: Reproducibility and validity of a self-administered food frequency questionnaire used in the JACC study. J Epidemiol. 2005, 15 (Suppl 1): S9-23. 10.2188/jea.15.S9.

- 4.
Kabagambe EK, Baylin A, Allan DA, Siles X, Spiegelman D, Campos H: Application of the method of triads to evaluate the performance of food frequency questionnaires and biomarkers as indicators of long-term dietary intake. Am J Epidemiol. 2001, 154 (12): 1126-1135. 10.1093/aje/154.12.1126.

- 5.
Duffy SW, Maximovitch DM, Day NE: External validation, repeat determination, and precision of risk estimation in misclassified exposure data in epidemiology. J Epidemiol Community Health. 1992, 46 (6): 620-624.

- 6.
Amiano P, Dorronsoro M, de Renobales M, Ruiz de Gordoa JC, Irigoien I: Very-long-chain omega-3 fatty acids as markers for habitual fish intake in a population consuming mainly lean fish: the EPIC cohort of Gipuzkoa. European Prospective Investigation into Cancer and Nutrition. Eur J Clin Nutr. 2001, 55 (10): 827-832. 10.1038/sj.ejcn.1601242.

- 7.
Brunner E, Stallone D, Juneja M, Bingham S, Marmot M: Dietary assessment in Whitehall II: comparison of 7 d diet diary and food-frequency questionnaire and validity against biomarkers. Br J Nutr. 2001, 86 (3): 405-414.

- 8.
Kobayashi M, Sasaki S, Kawabata T, Hasegawa K, Akabane M, Tsugane S: Single measurement of serum phospholipid fatty acid as a biomarker of specific fatty acid intake in middle-aged Japanese men. Eur J Clin Nutr. 2001, 55 (8): 643-650. 10.1038/sj.ejcn.1601194.

- 9.
Morris MC, Evans DA, Bienias JL, Tangney CC, Bennett DA, Wilson RS, Aggarwal N, Schneider J: Consumption of fish and n-3 fatty acids and risk of incident Alzheimer disease. Arch Neurol. 2003, 60 (7): 940-946. 10.1001/archneur.60.7.940.

- 10.
Kalmijn S, van Boxtel MP, Ocke M, Verschuren WM, Kromhout D, Launer LJ: Dietary intake of fatty acids and fish in relation to cognitive performance at middle age. Neurology. 2004, 62 (2): 275-280.

- 11.
Morris MC, Evans DA, Tangney CC, Bienias JL, Wilson RS: Fish Consumption and Cognitive Decline With Age in a Large Community Study. Arch Neurol. 2005

- 12.
Kotani S, Sakaguchi E, Warashina S, Matsukawa N, Ishikura Y, Kiso Y, Sakakibara M, Yoshimoto T, Guo J, Yamashima T: Dietary supplementation of arachidonic and docosahexaenoic acids improves cognitive dysfunction. Neurosci Res. 2006, 56 (2): 159-164. 10.1016/j.neures.2006.06.010.

- 13.
Beydoun MA, Kaufman JS, Sloane PD, Heiss G, Ibrahim J: n-3 fatty acids, hypertension and risk of cognitive decline among older adults in the Atherosclerosis Risk in Communities (ARIC) study. Public Health Nutr. 2007

- 14.
Conquer JA, Tierney MC, Zecevic J, Bettger WJ, Fisher RH: Fatty acid analysis of blood plasma of patients with Alzheimer's disease, other types of dementia, and cognitive impairment. Lipids. 2000, 35 (12): 1305-1312. 10.1007/s11745-000-0646-3.

- 15.
Tully AM, Roche HM, Doyle R, Fallon C, Bruce I, Lawlor B, Coakley D, Gibney MJ: Low serum cholesteryl ester-docosahexaenoic acid levels in Alzheimer's disease: a case-control study. Br J Nutr. 2003, 89 (4): 483-489. 10.1079/BJN2002804.

- 16.
Heude B, Ducimetiere P, Berr C: Cognitive decline and fatty acid composition of erythrocyte membranes – The EVA Study. Am J Clin Nutr. 2003, 77 (4): 803-808.

- 17.
Laurin D, Verreault R, Lindsay J, Dewailly E, Holub BJ: Omega-3 fatty acids and risk of cognitive impairment and dementia. J Alzheimers Dis. 2003, 5 (4): 315-322.

- 18.
Beydoun MA, Kaufman JS, Satia JA, Rosamond W, Folsom AR: Plasma n-3 fatty acids and risk of cognitive decline among older adults: The Atherosclerosis Risk in Communities (ARIC) study. Am J Clin Nutr. 2007, 85 (4): 1103-1111.

- 19.
Kalmijn S, Launer LJ, Ott A, Witteman JC, Hofman A, Breteler MM: Dietary fat intake and the risk of incident dementia in the Rotterdam Study. Ann Neurol. 1997, 42 (5): 776-782. 10.1002/ana.410420514.

- 20.
Engelhart MJ, Geerlings MI, Ruitenberg A, Van Swieten JC, Hofman A, Witteman JC, Breteler MM: Diet and risk of dementia: Does fat matter?: The Rotterdam Study. Neurology. 2002, 59 (12): 1915-1921. 10.1001/archneur.59.12.1915.

- 21.
Maclean CH, Issa AM, Newberry SJ, Mojica WA, Morton SC, Garland RH, Hilton LG, Traina SB, Shekelle PG: Effects of omega-3 fatty acids on cognitive function with aging, dementia, and neurological diseases. Evid Rep Technol Assess (Summ). 2005, 114: 1-3.

- 22.
Schimek MG: Smoothing and regression: approaches, computations and application. 2000

- 23.
Budtz-Jorgensen E, Keiding N, Grandjean P, Weihe P, White RF: Consequences of exposure measurement error for confounder identification in environmental epidemiology. Stat Med. 2003, 22 (19): 3089-3100. 10.1002/sim.1541.

- 24.
Fraser GE, Butler TL, Shavlik D: Correlations between estimated and true dietary intakes: using two instrumental variables. Ann Epidemiol. 2005, 15 (7): 509-518. 10.1016/j.annepidem.2004.12.012.

- 25.
Kipnis V, Subar AF, Midthune D, Freedman LS, Ballard-Barbash R, Troiano RP, Bingham S, Schoeller DA, Schatzkin A, Carroll RJ: Structure of dietary measurement error: results of the OPEN biomarker study. Am J Epidemiol. 2003, 158 (1): 14-21. 10.1093/aje/kwg091.

- 26.
Carroll RJ, Ruppert D, Stefanski LA: Measurement error in Nonlinear Models. 1995, CRC Press

- 27.
Whelton SP, He J, Whelton PK, Muntner P: Meta-analysis of observational studies on fish intake and coronary heart disease. Am J Cardiol. 2004, 93 (9): 1119-1123. 10.1016/j.amjcard.2004.01.038.

- 28.
He K, Song Y, Daviglus ML, Liu K, Van Horn L, Dyer AR, Greenland P: Accumulated evidence on fish consumption and coronary heart disease mortality: a meta-analysis of cohort studies. Circulation. 2004, 109 (22): 2705-2711. 10.1161/01.CIR.0000132503.19410.6B.

- 29.
He K, Song Y, Daviglus ML, Liu K, Van Horn L, Dyer AR, Goldbourt U, Greenland P: Fish consumption and incidence of stroke: a meta-analysis of cohort studies. Stroke. 2004, 35 (7): 1538-1542. 10.1161/01.STR.0000130856.31468.47.

- 30.
Mozaffarian D, Longstreth WT, Lemaitre RN, Manolio TA, Kuller LH, Burke GL, Siscovick DS: Fish consumption and stroke risk in elderly individuals: the cardiovascular health study. Arch Intern Med. 2005, 165 (2): 200-206. 10.1001/archinte.165.2.200.

- 31.
Feskens EJ, Virtanen SM, Rasanen L, Tuomilehto J, Stengard J, Pekkanen J, Nissinen A, Kromhout D: Dietary factors determining diabetes and impaired glucose tolerance. A 20-year follow-up of the Finnish and Dutch cohorts of the Seven Countries Study. Diabetes Care. 1995, 18 (8): 1104-1112. 10.2337/diacare.18.8.1104.

- 32.
Wakai K, Tamakoshi K, Date C, Fukui M, Suzuki S, Lin Y, Niwa Y, Nishio K, Yatsuya H, Kondo T, Tokudome S, Yamamoto A, Toyoshima H, Tamakoshi A: Dietary intakes of fat and fatty acids and risk of breast cancer: a prospective study in Japan. Cancer Sci. 2005, 96 (9): 590-599. 10.1111/j.1349-7006.2005.00084.x.

- 33.
Terry PD, Terry JB, Rohan TE: Long-chain (n-3) fatty acid intake and risk of cancers of the breast and the prostate: recent epidemiological studies, biological mechanisms, and directions for future research. J Nutr. 2004, 134 (12 Suppl): 3412S-3420S.

- 34.
Hibbeln JR, Salem N: Dietary polyunsaturated fatty acids and depression: when cholesterol does not satisfy. Am J Clin Nutr. 1995, 62 (1): 1-9.

- 35.
Hibbeln JR, Umhau JC, George DT, Salem N: Do plasma polyunsaturates predict hostility and depression?. World Rev Nutr Diet. 1997, 82: 175-186.

- 36.
Shahar E, Folsom AR, Wu KK, Dennis BH, Shimakawa T, Conlan MG, Davis CE, Williams OD: Associations of fish intake and dietary n-3 polyunsaturated fatty acids with a hypocoagulable profile. The Atherosclerosis Risk in Communities (ARIC) Study. Arterioscler Thromb. 1993, 13 (8): 1205-1212.

- 37.
Conquer JA, Cheryk LA, Chan E, Gentry PA, Holub BJ: Effect of supplementation with dietary seal oil on selected cardiovascular risk factors and hemostatic variables in healthy male subjects. Thromb Res. 1999, 96 (3): 239-250. 10.1016/S0049-3848(99)00106-1.

- 38.
Shahar E, Boland LL, Folsom AR, Tockman MS, McGovern PG, Eckfeldt JH: Docosahexaenoic acid and smoking-related chronic obstructive pulmonary disease. The Atherosclerosis Risk in Communities Study Investigators. Am J Respir Crit Care Med. 1999, 159 (6): 1780-1785.

- 39.
Kaaks R, Riboli E, van Staveren W: Calibration of dietary intake measurements in prospective cohort studies. Am J Epidemiol. 1995, 142 (5): 548-556.

- 40.
Arab L: Biomarkers of fat and fatty acid intake. J Nutr. 2003, 133 (Suppl 3): 925S-932S.

- 41.
Knutsen SF, Fraser GE, Beeson WL, Lindsted KD, Shavlik DJ: Comparison of adipose tissue fatty acids with dietary fatty acids as measured by 24-hour recall and food frequency questionnaire in Black and White Adventists: the Adventist Health Study. Ann Epidemiol. 2003, 13 (2): 119-127. 10.1016/S1047-2797(02)00260-0.

- 42.
Cantwell MM, Gibney MJ, Cronin D, Younger KM, O'Neill JP, Hogan L, Flynn MA: Development and validation of a food-frequency questionnaire for the determination of detailed fatty acid intakes. Public Health Nutr. 2005, 8 (1): 97-107. 10.1079/PHN2005668.

- 43.
Feunekes GI, Van Staveren WA, De Vries JH, Burema J, Hautvast JG: Relative and biomarker-based validity of a food-frequency questionnaire estimating intake of fats and cholesterol. Am J Clin Nutr. 1993, 58 (4): 489-496.

- 44.
Patch C, Murphy K, Mansour J, Tapsel L, Meyer B, Mori T, Noakes M, Clifton P, Puddey I, Howe P: Erythrocyte biomarker-based validation of a diet history method used in a dietary intervention trial. Asia Pac J Clin Nutr. 2004, 13 (Suppl): S60-

- 45.
Spiegelman D, Zhao B, Kim J: Correlated errors in biased surrogates: study designs and methods for measurement error correction. Stat Med. 2005, 24 (11): 1657-1682. 10.1002/sim.2055.

- 46.
The Atherosclerosis Risk in Communities (ARIC) Study: design and objectives. The ARIC investigators. Am J Epidemiol. 1989, 129 (4): 687-702.

- 47.
Lezak MD: Neuropsychological assessment. 1983, New York: Oxford University Press, 2

- 48.
Tranel D: Neuropsychological assessment. Psychiatr Clin North Am. 1992, 15 (2): 283-299.

- 49.
Benton AL, Eslinger PJ, Damasio AR: Normative observations on neuropsychological test performances in old age. J Clin Neuropsychol. 1981, 3 (1): 33-42.

- 50.
Franzen MD, (Ed): Mutlilingual aphasia examination. 1986, Kansas City, MO: Test Corporation of America, 5:

- 51.
Frerichs RJ, Tuokko HA: Reliable change scores and their relation to perceived change in memory: Implications for the diagnosis of mild cognitive impairment. Arch Clin Neuropsychol. 2005

- 52.
Willett WC, Sampson L, Stampfer MJ, Rosner B, Bain C, Witschi J, Hennekens CH, Speizer FE: Reproducibility and validity of a semiquantitative food frequency questionnaire. Am J Epidemiol. 1985, 122 (1): 51-65.

- 53.
Lands WE, Libelt B, Morris A, Kramer NC, Prewitt TE, Bowen P, Schmeisser D, Davidson MH, Burns JH: Maintenance of lower proportions of (n - 6) eicosanoid precursors in phospholipids of human plasma in response to added dietary (n - 3) fatty acids. Biochim Biophys Acta. 1992, 1180 (2): 147-162.

- 54.
Lands WE: Long-term fat intake and biomarkers. Am J Clin Nutr. 1995, 61 (3 Suppl): 721S-725S.

- 55.
Willet WC: Nutritional epidemiology. 1990, New York: Oxford University Press

- 56.
Rosner B, Willett WC, Spiegelman D: Correction of logistic regression relative risk estimates and confidence intervals for systematic within-person measurement error. Stat Med. 1989, 8 (9): 1051-1069. discussion 1071–1053

- 57.
Kuha J: Corrections for exposure measurement error in logistic regression models with an application to nutritional data. Stat Med. 1994, 13 (11): 1135-1148. 10.1002/sim.4780131105.

- 58.
Maldonado G, Greenland S: Simulation study of confounder-selection strategies. Am J Epidemiol. 1993, 138 (11): 923-936.

- 59.
Hardin JW, Schmiediche H, Carroll RJ: The Regression Calibration Method for fitting Generalized Linear Models with Additive Measurement Error. The STATA Journal. 2003, 4 (1–11):

- 60.
Hardin JW, Schemiediche H, Carroll RJ: The simulation extrapolation method for fitting generalized linear models with additive measurement error. The STATA Journal. 2003, 4: 373-385.

### Pre-publication history

The pre-publication history for this paper can be accessed here:http://0-www.biomedcentral.com.brum.beds.ac.uk/1471-2288/7/41/prepub

## Acknowledgements

The authors gratefully acknowledge Aaron Folsom (University of Minnesota, Twin Cities, Department of Epidemiology and ARIC principal investigator) for making available to us the plasma fatty acid data for the Minneapolis baseline population of ARIC. We would like to thank William E. M. Lands for his assistance in clarifying concepts related to his previously published empirical equations.

The Atherosclerosis Risk in Communities Study is carried out as a collaborative study supported by National Heart, Lung, and Blood Institute contracts N01-HC-55015, N01-HC-55016, N01-HC-55018, N01-HC-55019, N01-HC-55020, N01-HC-55021, and N01-HC-55022. The authors thank the staff and participants of the ARIC study for their important contributions.

## Author information

## Additional information

### Competing interests

The author(s) declare that they have no competing interests.

### Authors' contributions

MAB: Conception, literature review, methods, plan of analysis, data analysis, interpretation of results.

JSK: Conception, methods, plan of analysis, literature review, revision of manuscript.

JI: Methods, plan of analysis, revision of manuscript.

JS: Conception, literature review, plan of analysis, revision of manuscript.

GH: Provision of data, plan of analysis, revision of manuscript.

## Authors’ original submitted files for images

Below are the links to the authors’ original submitted files for images.

## Rights and permissions

## About this article

### Cite this article

Beydoun, M.A., Kaufman, J.S., Ibrahim, J. *et al.* Measurement error adjustment in essential fatty acid intake from a food frequency questionnaire: alternative approaches and methods.
*BMC Med Res Methodol* **7, **41 (2007) doi:10.1186/1471-2288-7-41

#### Received

#### Accepted

#### Published

#### DOI

### Keywords

- Essential Fatty Acid
- Cholesteryl Ester
- Food Frequency Questionnaire
- Attenuation Factor
- Reliable Change Index