HPA home

GETORF : Finds and extracts open reading frames (ORFs) (EMBOSS)



your e-mail
( = required, = conditionally required)


input Section


advanced Section


output Section


input Section


sequence -- DNA [sequences] (-sequence) : please enter
either :
  1. the name of a file:
  2. or the actual data here:

(sequence format)


[Return to the main part with your favorite browser's Back function]


advanced Section


Code to use -- Genetic codes (-table)
Minimum nucleotide size of ORF to report (-minsize)

Type of output -- Type of sequence to output (-find)
Change initial START codons to Methionine (-methionine)
Is the sequence circular (-circular)
Find ORFs in the reverse sequence (-reverse)
Number of flanking nucleotides to report (-flanking)

[Return to the main part with your favorite browser's Back function]


output Section

outseq (-outseq)

Output format for: outseq

[Return to the main part with your favorite browser's Back function]


your e-mail

Some explanations about the options


input Section
enter either the name of a file or the actual data
if you are using Netscape 2.x or later, you can select a file by typing its name, or better, by selecting it with the Netscape file browser (Browse button)
OR you can type your data in the next area, or cut and paste it from another application.
(but not both)

advanced Section
Type of output -- Type of sequence to output (-find)
This is a small menu of possible output options. The first four options are to select either the protein translation or the original nucleic acid sequence of the open reading frame. There are two possible definitions of an open reading frame: it can either be a region that is free of STOP codons or a region that begins with a START codon and ends with a STOP codon. The last three options are probably only of interest to people who wish to investigate the statistical properties of the regions around potential START or STOP codons. The last option assumes that ORF lengths are calculated between two STOP codons.
Change initial START codons to Methionine (-methionine)
START codons at the beginning of protein products will usually code for Methionine, despite what the codon will code for when it is internal to a protein. This qualifier sets all such START codons to code for Methionine by default.
Find ORFs in the reverse sequence (-reverse)
Set this to be false if you do not wish to find ORFs in the reverse complement of the sequence.
Number of flanking nucleotides to report (-flanking)
If you have chosen one of the options of the type of sequence to find that gives the flanking sequence around a STOP or START codon, this allows you to set the number of nucleotides either side of that codon to output. If the region of flanking nucleotides crosses the start or end of the sequence, no output is given for this codon.
Sequence format
The sequence will be automatically converted in the format needed for the program
providing you enter a sequence either:
in plain (raw) sequence format or in one of the following known formats:
IG,GenBank,NBRF,EMBL,GCG,DNAStrider,Fitch,fasta,Phylip,PIR,MSF,ASN,PAUP,CLUSTALW
You may enter in the text area a database entry code, or an accession number, in this form:
database:entry_name
or:
database:accession.

Pise form generator version: 5.a (16 Dec 2002 11:54)