HPA home

TRIMSEQ : Trim ambiguous bits off the ends of sequences (EMBOSS)



your e-mail
( = required, = conditionally required)


input Section


advanced Section


output Section


input Section


sequence -- gapany [sequences] (-sequence) : please enter
either :
  1. the name of a file:
  2. or the actual data here:

(sequence format)


[Return to the main part with your favorite browser's Back function]


advanced Section

Window size (-window)
Percent threshold of ambiguity in window (-percent)
Trim off all ambiguity codes, not just N or X (-strict)
Trim off asterisks (-star)
Trim at the start (-left)
Trim at the end (-right)

[Return to the main part with your favorite browser's Back function]


output Section

outseq (-outseq)

Output format for: outseq

[Return to the main part with your favorite browser's Back function]


your e-mail

Some explanations about the options


input Section
enter either the name of a file or the actual data
if you are using Netscape 2.x or later, you can select a file by typing its name, or better, by selecting it with the Netscape file browser (Browse button)
OR you can type your data in the next area, or cut and paste it from another application.
(but not both)

advanced Section
Window size (-window)
This determines the size of the region that is considered when deciding whether the percentage of ambiguity is greater than the threshold. A value of 5 means that a region of 5 letters in the sequence is shifted along the sequence from the ends and trimming is done only if there is a greater or equal percentage of ambiguity than the threshold percentage.
Percent threshold of ambiguity in window (-percent)
This is the threshold of the percentage ambiguity in the window required in order to trim a sequence.
Trim off all ambiguity codes, not just N or X (-strict)
In nucleic sequences, trim off not only N's and X's, but also the nucleotide IUPAC ambiguity codes M, R, W, S, Y, K, V, H, D and B. In protein sequences, trim off not only X's but also B and Z.
Trim off asterisks (-star)
In protein sequences, trim off not only X's, but also the *'s
Sequence format
The sequence will be automatically converted in the format needed for the program
providing you enter a sequence either:
in plain (raw) sequence format or in one of the following known formats:
IG,GenBank,NBRF,EMBL,GCG,DNAStrider,Fitch,fasta,Phylip,PIR,MSF,ASN,PAUP,CLUSTALW
You may enter in the text area a database entry code, or an accession number, in this form:
database:entry_name
or:
database:accession.

Pise form generator version: 5.a (16 Dec 2002 11:55)