Does the number of manifest measures affect the model result

jahoosafat · Post by **jahoosafat** » Tue May 01, 2012 3:26 am

In factor analysis, it's well known that if you generate a factor from, for instance, 8 variables and include it in a regression with another factor generated from, let's say, 2 variables ... then the first factor is almost definitely going to have a much stronger association with the dependent variable. Does something similar happen in PLS? Does the number of manifest variables in an outer model for a latent variable affect how strong of a predictor it will be in the PLS causal model?

Hengkov · Post by **Hengkov** » Thu May 03, 2012 9:38 am

Hi,
Relative complex model.
But number of indicators 5-8 is enough.
Regards,
Hengky

jahoosafat · Post by **jahoosafat** » Thu May 03, 2012 12:38 pm

Hi Hengky - Thanks, but I don't think that answers my question. I'm not asking what is a sufficient number of indicators for a good model. I'm asking if PLS "favors" latent variables that have large numbers of indicators over latents with smaller numbers of indicators in the same model. So, if we had a 3 latent variable model:

X -> Y <- Z

And X had 10 indicators while Z had 2 indicators. If I did a principal components regression, X would almost certainly be found to be the stronger predictor of Y. Empirically, do we find the same sort of thing in PLS or no?

Cheers,
Jeff

Hengkov · Post by **Hengkov** » Thu May 03, 2012 7:16 pm

Hi Jeffry,
If you have models is complex => large indicators is good (optimal predict),
But If you have models is simple => small indicators no problem (with minimal indicator 3 items for each construct, some literature suggest > 5 items for evaluation outer model).
Note: number of resampling is large for correct SE.
Regards,
Hengky

jahoosafat · Post by **jahoosafat** » Thu May 03, 2012 8:14 pm

Still not the question. What I am asking about is the comparative predictive strength of latents with many indicators vs. latents with few indicators IN THE SAME MODEL. Again, empirically, if you do a simple principle components regression, you'll almost always find that the latents with many indicators would be stronger predictors of some dependent variable as compared to latents with very few indicators. The question is whether PLS has the same tendency. Maybe someone else could address this question.

Hengkov · Post by **Hengkov** » Thu May 03, 2012 11:08 pm

Hi Jeffry,
Maybe => PLS and PCR is similar result.
See, Garson, D. 2012. Partial Least Squares, "Blue Book".

Marcel_Dijkstra · Post by **Marcel_Dijkstra** » Thu May 17, 2012 8:01 am

It all depends on the structure in the data, i.e. whether high scores in X corresponds with high (or low) scores in Y. To illustrate this take it to the extreme; a signle indicator latent X as compared to a 10 indicator latent Z.
Now suppose X is smoking behaviour, Z is a schizofrenia scale and your dependent latent Y is lung cancer, then most probably the relationship between X and Y is likely to be stronger than the relationship between Z and Y.
But you are right in general a latent with more indicator represents a more 'stable' measurement than a single latent indicator. The latter has just a larger error term by the very definition. And hence it will be more difficult for a single indicator latent to reach significance. The fluctuations in the regression coefficient around the 'true' value is just greater. That is the same as saying that one case has more influence over the latent score in a signle indicator latent than one case has when you have 10 measurements. A fluke in one of the indicators in a 10 indicator latent will have less effect because the latent score is the weighted average of all 10 scores.
I hope this gives you an answer to your question.

jahoosafat · Post by **jahoosafat** » Thu May 17, 2012 10:37 am

This makes sense. Thanks for the explanation.

Diogenes · Post by **Diogenes** » Mon May 21, 2012 12:28 pm

Hi,
In others words:

1) LISREL (SEM – covariance based) is not biased = the correlation between two LV will be estimated correctly.
2) PLS-PM is “consistent at large” = correlation between LV are underestimated. The correlations (or path coefficients) are estimated from the LV scores, that are a weighted mean of the indicators, and the measurement error are included in the score calculation.
Nunnally and Bernstein named this effect as “attenuation of the correlation”.
Lohmöller, Wold and Chin remember us, in many articles, that PLS is “consistent at large”. With more indicators, greater will be the reliability e lesser will be the attenuation.

Best regards,

Bido