cDNA alignment to gDNA for exon annotation

Alert: Please make sure that your sequence is clean DNA sequence without symbols, such as ":", which can be the product of many sequence assembly programs.  Utility is supplied to clean your FASTA format sequence.

 * entries are required

NOTE: Sim4 designed by Florea L et al. is used. 

1. *Please copy & paste in the cDNA sequence data (may contain all the cDNA sequence in FASTA format) :

or upload data file

 

2. Please give the gDNA sequence data file (only one gDNA sequence in FASTA format is allowed).

 *

3. * Please give the description of the un-annotated target genomic DNA sequence, or Genbank data file or Genbank header file (.gbheader produced by previous DNannotator run).

 

    You may check this box if this is your first time to run annotation for this sequence on DNannotator. Then, please fill in the following information but no Genbank file will be needed.  A Genbank header file will be created for you based on data from FASTA sequence and the information you provided here.

       Un-annotated Genbank file (should be the one generated by DNannotator in previous analysis for the same target sequence): 

3. Adjustable parameters.

H value: (default 500 for shorter introns, EST above quality cDNA analysis; 1000 could be used for high quality analysis, as full-length cDNA vs high quality gDNA sequence) 

Remove poly-A tail? (Poly-A tail can cause miss-assignment of extra exon)

4. Choice of outputs:

    Annotation Genbank format data or Genbank format features

             viewable in Vector NTI  or Artemis

    Feature table ready for database in tab-delimited format 

    Sim4 original output

    Original full-length Genbank format data (with sequence body)

    Original gbheader

 

    Give WARNING: alignments with identity percentage < will be labeled as "low quality"; exon shorter than bp will be labeled as "unreliable"

 

5. Please specify the email address where the final annotated sequence can be delivered:

Email address *

choice of output in gzip compressed format:

Your email box accepts message size limit of  per message

If your input data file is from a Mac machine, please be sure to check this box