The training datasets used by trRosetta:
  • The FASTA sequences for the training set (15015 proteins): download
  • The PDB structures for the training set (15015 proteins): download (size: 761M)

    The MSAs in a3m format for the two test sets can be downloaded below:
  • CASP13 (Note: the native structures can be downloaded from the casp13 website: http://predictioncenter.org/download_area/CASP13/targets/)
  • CAMEO (Both MSAs and native structure files are available)