needlerunner

 

Function

Needle all sequence pairs in a directory

Description

**************** EDIT HERE ****************

Algorithm

**************** EDIT HERE ****************

Usage

Under development, no final example yet

Command line arguments

   Standard (Mandatory) qualifiers:
  [-seqpairsdir]       dirlist    Input directory of files containing sequence
                                  pairs for alignment
  [-commandpath]       directory  Directory where alignment executable lives
  [-commandname]       string     Name of alignment executable
  [-scoringmatrixdir]  directory  Directory where scoring matrix lives
  [-scoringmatrixname] string     Name of alignment scoring matrix
   -gapopen            float      The gap opening penalty is subtracted from
                                  the total score every time a gap is inserted
                                  in the alignment. The default value is
                                  chosen for the EBLOSUM62 scoring matrix when
                                  you are aligning protein sequences, and the
                                  EDNAFULL matrix when you are aligning
                                  nucleotide sequences. If you sequence
                                  contains many small errors you should set a
                                  low gap opening penalty (near zero) to avoid
                                  over-penalizing those errors.
   -gapextend          float      The gap extension penalty is the score
                                  substracted when more spaces are added to an
                                  existing alignment gap. Often a single
                                  insertion or deletion affects multiple
                                  residues/nucleotides; separate opening and
                                  extension penalties reduce the excessive
                                  penalty such events would otherwise carry;
                                  generally the gap extension penalty should
                                  be significantly lower than the gap opening
                                  penalty.
  [-alignedseqsoutdir] outdir     Name of directory containing sequences
                                  aligned by needle
  [-outfilesuffix]     string     Appended to end of all output alignment file
                                  names
  [-outfileformat]     string     Format of output alignments

   Additional (Optional) qualifiers: (none)
   Advanced (Unprompted) qualifiers: (none)
   Associated qualifiers: (none)
   General qualifiers:
   -auto               boolean    Turn off prompts
   -stdout             boolean    Write standard output
   -filter             boolean    Read standard input, write standard output
   -options            boolean    Prompt for standard and additional values
   -debug              boolean    Write debug output to program.dbg
   -verbose            boolean    Report some/full command line options
   -help               boolean    Report command line options. More
                                  information on associated and general
                                  qualifiers can be found with -help -verbose
   -warning            boolean    Report warnings
   -error              boolean    Report errors
   -fatal              boolean    Report fatal errors
   -die                boolean    Report deaths


Standard (Mandatory) qualifiers Allowed values Default
[-seqpairsdir]
(Parameter 1)
Input directory of files containing sequence pairs for alignment Directory with files .
[-commandpath]
(Parameter 2)
Directory where alignment executable lives Directory  
[-commandname]
(Parameter 3)
Name of alignment executable Any string is accepted needle
[-scoringmatrixdir]
(Parameter 4)
Directory where scoring matrix lives Directory  
[-scoringmatrixname]
(Parameter 5)
Name of alignment scoring matrix Any string is accepted EBLOSUM62
-gapopen The gap opening penalty is subtracted from the total score every time a gap is inserted in the alignment. The default value is chosen for the EBLOSUM62 scoring matrix when you are aligning protein sequences, and the EDNAFULL matrix when you are aligning nucleotide sequences. If you sequence contains many small errors you should set a low gap opening penalty (near zero) to avoid over-penalizing those errors. Floating point number from 1.0 to 100.0 10.0 for any sequence
-gapextend The gap extension penalty is the score substracted when more spaces are added to an existing alignment gap. Often a single insertion or deletion affects multiple residues/nucleotides; separate opening and extension penalties reduce the excessive penalty such events would otherwise carry; generally the gap extension penalty should be significantly lower than the gap opening penalty. Floating point number from 0.0 to 10.0 0.5 for any sequence
[-alignedseqsoutdir]
(Parameter 6)
Name of directory containing sequences aligned by needle Output directory  
[-outfilesuffix]
(Parameter 7)
Appended to end of all output alignment file names Any string is accepted .needle
[-outfileformat]
(Parameter 8)
Format of output alignments Any string is accepted fasta
Additional (Optional) qualifiers Allowed values Default
(none)
Advanced (Unprompted) qualifiers Allowed values Default
(none)

Input file format

needlerunner reads any normal sequence USAs.

Under development, no example input file(s) yet

Output file format

needlerunner outputs a graph to the specified graphics device. outputs a report format file. The default format is ...

Under development, no example output file(s) yet

Data files

**************** EDIT HERE ****************

Notes

None.

References

None.

Warnings

None.

Diagnostic Error Messages

None.

Exit status

It always exits with status 0.

Known bugs

None.

See also

Program nameDescription
alignrunnerAlign sequence pairs in a directory
comparatorCompare contact scores of two sequence alignments
contactalignDamian Counsell's experimental 2.5-D alignment algorithm
esim4Align an mRNA to a genomic DNA sequence
est2genomeAlign EST and genomic DNA sequences
nawalignDamian Counsell's NW implementation
nawalignrunnerNawalign all sequence pairs in a directory
needleNeedleman-Wunsch global alignment
scorerScore alignments using structural alignments
scorerrunnerSCORER for ordered pairs of substituted seqs
stretcherFinds the best global alignment between two sequences
substituteSubstitute matches into a template
substituterunnerRun SUBSTITUTE on a directory of traces

Author(s)

Damian Counsell (d.counsell © rfcgr.mrc.ac.uk)
MRC Rosalind Franklin Centre for Genomics Research Wellcome Trust Genome Campus, Hinxton, Cambridge, CB10 1SB, UK

History

Target users

This program is intended to be used by everyone and everything, from naive users to embedded scripts.

Comments

None