Scan genomes for internally repeated sequences, elements which are repetitive in another species, or high-identity HGT candidate regions between species.
Take an XML formatted multiblast result, extract genbank IDs for best x hits to each query and download a non-redundant list of matching seqs from ncbi.
For a set of reference transposons, collect instances from one or more genomes. Use aligned collections to build models capturing TE family diversity or produce deRIP'd reference sequence.
This work is supported by the National Institutes of Health's National Center for Advancing Translational Sciences, Grant Number U24TR002306. This work is
solely the responsibility of the creators and does not necessarily represent the official views of the National Institutes of Health.