Migrate annotation from one sequence to UCSC latest freeze sequence

 

* entries are required

1. Please give the name of a originally annotated sequence in Genbank format (preferably from Genbank or created by DNannotator, currently, some features, like CD, with joined multiple elements would not be transferred.  mRNA annotation will be converted into exons.  Annotation of repeats would not be transferred either.  If repeat information is needed, please use "locate repeat elements function")

Annotated Genbank data file *

2. Please select the chromosome the fragments are mapping to:

  * Chromosome of latest version of UCSC freeze (be sure to select the correct one)

3. Adjustable parameters.

Annotation fashion:

Additional parameters for BLAT-based annotation:

Additional parameters for BLAST-based annotation:

For transfer the annotation,

5. Choice of outputs:

    Show BLAT annotation evidence of match identity percentage in Genbank annotation and features table

    Annotation Genbank format data or Genbank format features (gbheader)

             viewable in Vector NTI  or Artemis

    Feature table ready for database in tab-delimited format 

    Annotations failed to be transferred

    Duplicated features (possibly false positive due to homologs)

    Original BLAT output

    Data for the "custom track" of Genome Browser
         Name the track as
    Annotation evidence (compact parsed BLAT data)

    Original full-length Genbank format data (with sequence body)

    Original gbheader

   

6. Please specify the email address to which the outputs will be delivered:

 Email address *

choice of output in gzip compressed format:

Your email box accepts message size limit of  per message

If your input data file is from a Mac machine, please be sure to check this box

 

Attention: 

  1. Repeat_region will be excluded from annotation migration by default.
  2. STS", "gene" will be annotated in ePCR fashion. In other words, only the end sequences of them are used for annotation.
  3. "/label" entries in Genbank format features will be used as "feature name", and they must be unique for each features.

  4. The source Genbank data is better to be gDNA sequence to ensure high conversion rate (more features can be migrated onto new sequence platform).  cDNA sequence based source data could fail more migration especially for SNPs due to the interrupted intron-exon gDNA sequence structures.

 

BLAT created by Jim Kent, 2002 is used