load_mash_df.Rd
Read in the random and the strong datasets
load_mash_df(path, numSNPs, suffix)
File path to the rds files saved from bigsnpr, a character string. Defaults to the working directory.
The number of most significant SNPs selected from each GWAS. Ideally this will give 1 million or fewer total cells in the resultant mash dataframes. Defaults to 1000.
Character. Optional. Should be the unique suffix used to
save cdbn_bigsnp2mashr
output as RDS files, if it was used.
A list containing five data frames: the SNPs selected, the B_hat and S_hat matrices for the strong SNP set and for a random SNP set that is twice the size.
Make sure you've removed doubles and NA's from your dataset - or set Shat NA's to a very large number so that that condition will be discounted by mashr.