Search results

  1. D

    Taxonomy using proteins from TSA

    Thank you! I was not going to find taxonomy.cpp:83 easily! I have been using the precompiled binary, so I might be about to run into a lot of compilation errors ... Thanks! D PS - the current distribution of accession lengths in nucl_wgs.accession2taxid is: 15186413 10 380856633...
  2. D

    Taxonomy using proteins from TSA

    And for "Also note that multiple entries for the same accession will not be handled correctly", do you mean multiple entries in the taxid mapping file, or multiple entries in the input sequence list, or both?
  3. D

    Taxonomy using proteins from TSA

    Hi! Thank you - it looks like lengths of 15 and 17 are common, and occasionally 16. Can this be fixed simply by editing line 45 of taxonomy.h to be enum { max_accesion_len = 17 }; Or is there something more subtle elsewhere? Thanks! D
  4. D

    Taxonomy using proteins from TSA

    Hi! I would like diamond to report on taxonomy from the TSA. These are nucleotide sequences, so I have generated my own putative translations. My input sequences for the database have headers/names like: >GAAA01000001.1 1.Latimeria chalumnae To provide the taxids for the database, I am using...
Top