sign changes and cases in bootstrap

Questions about the implementation and application of the PLS-SEM method, that are not related to the usage of the SmartPLS software.
Post Reply
ruchi
PLS Expert User
Posts: 75
Joined: Wed Dec 22, 2010 3:09 pm
Real name and title:

sign changes and cases in bootstrap

Post by ruchi »

I have recently started using PLS and wants to check the significance of my path coefficients. I did bootstrapping. I have total 200 respondents data set on 5 variables each measured by 3 manifest variables(items).

3 variables have reflective items and 2 have formative items.

I dont know what to put in "Case" under bootstrap settings. Under sample i think i can put 500 or even more.
'
what does the "No sign changes" , "construct level changes" and "individual level changes" mean and what should be selected under this sign change dropdown box


Please help me to clear this doubt.

Thanks
Regards
Ruchi
User avatar
Diogenes
PLS Super-Expert
Posts: 899
Joined: Sat Oct 15, 2005 5:13 pm
Real name and title:
Location: São Paulo - BRAZIL
Contact:

Post by Diogenes »

Hi,

Using the search function...


Bootstrap in general:
viewtopic.php?t=1337&highlight=bootstrap
viewtopic.php?t=1396&highlight=bootstrap

Difference between Bootstrap and Jacknife
viewtopic.php?t=1062&highlight=bootstrap

Sign change:
No sign changes --> not recommended
Individual changes --> recommended by Henseler, Ringle, Sinkovics (2009, p.307)
Construct level changes --> recommended by Tenenhaus et al. (2005, p.177)
http://php.portals.mbs.ac.uk/Portals/49 ... cs-PLS.pdf
http://www.stat.uni-muenchen.de/institu ... deling.pdf

Cases = N in your dataset --> see:
viewtopic.php?t=846&highlight=bootstrap

Samples = number of resamples
viewtopic.php?t=813&highlight=bootstrap

Running bootstrap in SmartPLS
viewtopic.php?t=838&highlight=bootstrap

Degree of freedom to compute p-values:

viewtopic.php?t=1317&highlight=bootstrap

Best regards,

Bido
ruchi
PLS Expert User
Posts: 75
Joined: Wed Dec 22, 2010 3:09 pm
Real name and title:

bootstrap case

Post by ruchi »

Thanks a lot

It is helping me in clearing doubts.

But just a small query. If I have 149 respondents dataset and in Bootstrap under "cases" if i give 149 cases with 1000 samples , then i get different values.

If i give "300 cases" in bootstrap with "1000 samples" for my original dataset of 149 respondents i get different results. In this case i am getting many of my variables significant as compared to when i entered 149 cases.

Why is it so?

Thanks
Ruchi
User avatar
Diogenes
PLS Super-Expert
Posts: 899
Joined: Sat Oct 15, 2005 5:13 pm
Real name and title:
Location: São Paulo - BRAZIL
Contact:

Post by Diogenes »

Hi,

From one bootstrap to another we will have different results because the resamples are random with replacement.
With more resamples the results (t values) will be more stable.

Using more cases than the original data set, we will have smaller standard deviation and bigger t-values, for this reason, the correct procedure is to use case = sample size (rows in the data set).

Best regards,

Bido
ruchi
PLS Expert User
Posts: 75
Joined: Wed Dec 22, 2010 3:09 pm
Real name and title:

Post by ruchi »

Thanks a lot for clearing doubts
pha
PLS Junior User
Posts: 3
Joined: Wed Aug 25, 2010 8:28 am
Real name and title:

Post by pha »

Hi Bido,
In bootstrapping, the sample is selected by random? However, will this mean that if I should choose 149 cases in my resample, which is the same number of cases in my dataset, some of the 149 will be in the resample twice, or more?
Otherwise I cant see why the, if we choose only 149 cases for each resample, we wouldnt get the result each time.

Best regards,
User avatar
Diogenes
PLS Super-Expert
Posts: 899
Joined: Sat Oct 15, 2005 5:13 pm
Real name and title:
Location: São Paulo - BRAZIL
Contact:

Post by Diogenes »

Hi,
yes, we will have the same case more than once in the resample.

See an example: http://bcs.whfreeman.com/ips5e/content/ ... oore14.pdf

Best regards,

Bido
Alexandersen
PLS Junior User
Posts: 1
Joined: Tue Mar 01, 2011 1:12 pm
Real name and title:

Contribution to r^2

Post by Alexandersen »

Hi professor,
How can I calculate the LV contribution to the overall r^2?

best regards,
User avatar
Diogenes
PLS Super-Expert
Posts: 899
Joined: Sat Oct 15, 2005 5:13 pm
Real name and title:
Location: São Paulo - BRAZIL
Contact:

Post by Diogenes »

Hi,

Using an example:

LVy = b1* VL1 + b2 * LV2 + b3 * LV3 + b4 * LV4

R2 = b1 * r1y + b2 * r2y + b3 *r3y + b4 * r4y

Then the contribution of LV3 to the explanation of y is (b3 * r3y)

Best regards,

Bido
Bjerg54
PLS User
Posts: 18
Joined: Fri Oct 22, 2010 10:53 am
Real name and title:

Post by Bjerg54 »

Hi Bido,
Thank you very much for the fast reply. I know how 3 different follow-up questions :)

Do you have a reference to a research paper or similar that explains the different weighting schemes in smartPLS? (factor, path, centroid)

Do you know if there is any advantage in "cleaning" your LV in spss before entering them into smartpls?

When cleaning the LV, do you have knowledge about which method is most compatible with PLS. Meaning, cleaning the LV in spss by looking at cronbach alpha, or cleaning the LV's in smartpls by looking and t-test results?

Thank you very much, your replies are very much appreciated!
User avatar
Diogenes
PLS Super-Expert
Posts: 899
Joined: Sat Oct 15, 2005 5:13 pm
Real name and title:
Location: São Paulo - BRAZIL
Contact:

Post by Diogenes »

Hi,

1) See:
TENENHAUS, M.; ESPOSITO VINZI, V.; CHATELIN, Y.-M.; LAURO, C. PLS path modeling. Computational Statistics & Data Analysis, v. 48, n. 1, p. 159-205, 2005.
https://studies2.hec.fr/jahia/webdav/si ... S_PM_5.pdf

2) Usually the cleaning (adjusting of the measurement model to the data = exploratory approach) is faster in the SmartPLS than in SPSS.
If you think that the indicators could measure more than one LV, a principal component analysis in the SPSS could help.

3) Principal component analysis.
See:
CHIN, W. W. Partial least squares is to LISREL as principal components analysis is to common factor analysis. Technology Studies, v. 2, n. 2, p. 315–319, 1995.
http://disc-nt.cba.uh.edu/chin/technologystudies.pdf


Best regards,

Bido
Bjerg54
PLS User
Posts: 18
Joined: Fri Oct 22, 2010 10:53 am
Real name and title:

Post by Bjerg54 »

Hi Bido,
I have a LV containing 5 manifest variables, when doing a principal component factor analysis in spss, all variables load with more than 0,5. Hence I am concluding that no variables needs to be excluded. However, when entering the model in smarpls, the t-values for 2 of the 5 manifest variables, are not significant.

Which approach would you consider most reliable?
Post Reply