![]() |
meme |
% meme -protein Motif detection Input sequence(s): memepep.dat Motif Distribution oops : Oops Distribution zoops : Zoops Distribution tcm : TCM Distribution Model to use [zoops]: Maximum number of motifs to find [1]: 3 Output file [carp_rhich.meme]: |
Go to the input files for this example
Go to the output files for this example
Standard (Mandatory) qualifiers: [-sequence] seqall Sequence database USA -model menu Model to use -nmotifs integer Maximum number of motifs to find [-outfile] outfile Output file name Additional (Optional) qualifiers: -ntype menu Method to use -protein boolean Assume sequences are proteins -nucleic boolean Assume sequences are DNA -palindromes boolean Allow palindromes -ponly boolean Force palindromes -[no]shorten boolean Allow motifs shorter than MINW -nsites float Expected number of sites for each motif -minsites float Minimum number of sites for each motif -maxsites float Maximum number of sites for each motif -w integer Starting motif width to try -minw integer Minimum starting motif width to try -maxw integer Maximum starting motif width to try -prior menu Prior to use -[no]brief boolean Don't print documemtation -b float Strength of the prior -spmap menu Mapping start -spfuzz float Fuzziness of sequence to theta mapping -maxiter integer Maximum EM iterations to run -distance float EM convergence criterion -cons string Consensus sequence to start EM from -chi float Cutoff for p-value -adj menu Type -maxsize integer Maximum dataset size in characters -page integer Width of page -status boolean Print progress reports -v boolean Verbose mode -cfive boolean Use 5' to 3' complementary strand as well -cthree boolean Use 3' to 5' complementary strand as well -wthree boolean Use 3' to 5' main strand as well -prob float Starting point confidence level -seed integer Seed for random numbers in sampling -seqfrac float Fraction of sequences to use -[no]align boolean Print aligned motif occurrences -trace boolean Trace starting points -allprint boolean Print all debugging information -wprint boolean Print erasure matrix -zprint boolean Print missing information matrix -llprint boolean Print log likelihood during EM -startsprint boolean Print starting points -fastaprint boolean Print sites in FASTA format -timer integer Timer type Advanced (Unprompted) qualifiers: (none) Associated qualifiers: "-sequence" associated qualifiers -sbegin1 integer Start of each sequence to be used -send1 integer End of each sequence to be used -sreverse1 boolean Reverse (if DNA) -sask1 boolean Ask for begin/end/reverse -snucleotide1 boolean Sequence is nucleotide -sprotein1 boolean Sequence is protein -slower1 boolean Make lower case -supper1 boolean Make upper case -sformat1 string Input sequence format -sdbname1 string Database name -sid1 string Entryname -ufo1 string UFO features -fformat1 string Features format -fopenfile1 string Features file name "-outfile" associated qualifiers -odirectory2 string Output directory General qualifiers: -auto boolean Turn off prompts -stdout boolean Write standard output -filter boolean Read standard input, write standard output -options boolean Prompt for standard and additional values -debug boolean Write debug output to program.dbg -verbose boolean Report some/full command line options -help boolean Report command line options. More information on associated and general qualifiers can be found with -help -verbose -warning boolean Report warnings -error boolean Report errors -fatal boolean Report fatal errors -die boolean Report deaths |
Standard (Mandatory) qualifiers | Allowed values | Default | |||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|
[-sequence] (Parameter 1) |
Sequence database USA | Readable sequence(s) | Required | ||||||||||
-model | Model to use |
|
zoops | ||||||||||
-nmotifs | Maximum number of motifs to find | Any integer value | 1 | ||||||||||
[-outfile] (Parameter 2) |
Output file name | Output file | <sequence>.meme | ||||||||||
Additional (Optional) qualifiers | Allowed values | Default | |||||||||||
-ntype | Method to use |
|
pair | ||||||||||
-protein | Assume sequences are proteins | Boolean value Yes/No | No | ||||||||||
-nucleic | Assume sequences are DNA | Boolean value Yes/No | No | ||||||||||
-palindromes | Allow palindromes | Boolean value Yes/No | No | ||||||||||
-ponly | Force palindromes | Boolean value Yes/No | No | ||||||||||
-[no]shorten | Allow motifs shorter than MINW | Boolean value Yes/No | Yes | ||||||||||
-nsites | Expected number of sites for each motif | Any numeric value | 0. | ||||||||||
-minsites | Minimum number of sites for each motif | Any numeric value | 0. | ||||||||||
-maxsites | Maximum number of sites for each motif | Any numeric value | 0. | ||||||||||
-w | Starting motif width to try | Any integer value | 0 | ||||||||||
-minw | Minimum starting motif width to try | Any integer value | 8 | ||||||||||
-maxw | Maximum starting motif width to try | Any integer value | 57 | ||||||||||
-prior | Prior to use |
|
dirichlet | ||||||||||
-[no]brief | Don't print documemtation | Boolean value Yes/No | Yes | ||||||||||
-b | Strength of the prior | Any numeric value | -1.0 | ||||||||||
-spmap | Mapping start |
|
uni | ||||||||||
-spfuzz | Fuzziness of sequence to theta mapping | Any numeric value | -1.0 | ||||||||||
-maxiter | Maximum EM iterations to run | Any integer value | 50 | ||||||||||
-distance | EM convergence criterion | Any numeric value | 1e-3 | ||||||||||
-cons | Consensus sequence to start EM from | Any string is accepted | An empty string is accepted | ||||||||||
-chi | Cutoff for p-value | Any numeric value | 1.0 | ||||||||||
-adj | Type |
|
root | ||||||||||
-maxsize | Maximum dataset size in characters | Any integer value | 100000 | ||||||||||
-page | Width of page | Any integer value | 80 | ||||||||||
-status | Print progress reports | Boolean value Yes/No | No | ||||||||||
-v | Verbose mode | Boolean value Yes/No | No | ||||||||||
-cfive | Use 5' to 3' complementary strand as well | Boolean value Yes/No | No | ||||||||||
-cthree | Use 3' to 5' complementary strand as well | Boolean value Yes/No | No | ||||||||||
-wthree | Use 3' to 5' main strand as well | Boolean value Yes/No | No | ||||||||||
-prob | Starting point confidence level | Any numeric value | 1.0 | ||||||||||
-seed | Seed for random numbers in sampling | Any integer value | 0 | ||||||||||
-seqfrac | Fraction of sequences to use | Any numeric value | 1.0 | ||||||||||
-[no]align | Print aligned motif occurrences | Boolean value Yes/No | Yes | ||||||||||
-trace | Trace starting points | Boolean value Yes/No | No | ||||||||||
-allprint | Print all debugging information | Boolean value Yes/No | No | ||||||||||
-wprint | Print erasure matrix | Boolean value Yes/No | No | ||||||||||
-zprint | Print missing information matrix | Boolean value Yes/No | No | ||||||||||
-llprint | Print log likelihood during EM | Boolean value Yes/No | No | ||||||||||
-startsprint | Print starting points | Boolean value Yes/No | No | ||||||||||
-fastaprint | Print sites in FASTA format | Boolean value Yes/No | No | ||||||||||
-timer | Timer type | Any integer value | 0 | ||||||||||
Advanced (Unprompted) qualifiers | Allowed values | Default | |||||||||||
(none) |
>CARP_RHICH P06026 RHIZOPUSPEPSIN PRECURSOR (EC 3.4.23.21) MKFTLISSCIAIAALAVAVDAAPGEKKISIPLAKNPNYKPSAKNAIQKAIAKYNKHKINT STGGIVPDAGVGTVPMTDYGNDVEYYGQVTIGTPGKKFNLDFDTGSSDLWIASTLCTNCG SRQTKYDPKQSSTYQADGRTWSISYGDGSSASGILAKDNVNLGGLLIKGQTIELAKREAA SFANGPNDGLLGLGFDTITTVRGVKTPMDNLISQGLISRPIFGVYLGKASNGGGGEYIFG GYDSTKFKGSLTTVPIDNSRGWWGITVDRATVGTSTVASSFDGILDTGTTLLILPNNVAA SVARAYGASDNGDGTYTISCDTSRFKPLVFSINGASFQVSPDSLVFEEYQGQCIAGFGYG NFDFAIIGDTFLKNNYVVFNQGVPEVQIAPVAQ >CARP_YEAST P07267 SACCHAROPEPSIN PRECURSOR (EC 3.4.23.25) (ASPARTATE PROTEASE) MFSLKALLPLALLLVSANQVAAKVHKAKIYKHELSDEMKEVTFEQHLAHLGQKYLTQFEK ANPEVVFSREHPFFTEGGHDVPLTNYLNAQYYTDITLGTPPQNFKVILDTGSSNLWVPSN ECGSLACFLHSKYDHEASSSYKANGTEFAIQYGTGSLEGYISQDTLSIGDLTIPKQDFAE ATSEPGLTFAFGKFDGILGLGYDTISVDKVVPPFYNAIQQDLLDEKRFAFYLGDTSKDTE NGGEATFGGIDESKFKGDITWLPVRRKAYWEVKFEGIGLGDEYAELESHGAAIDTGTSLI TLPSGLAEMINAEIGAKKGWTGQYTLDCNTRDNLPDLIFNFNGYNFTIGPYDYTLEVSGS CISAITPMDFPEPVGPLAIVGDAFLRKYYSIYDLGNNAVGLAKAI >CATD_HUMAN P07339 CATHEPSIN D PRECURSOR (EC 3.4.23.5) MQPSSLLPLALCLLAAPASALVRIPLHKFTSIRRTMSEVGGSVEDLIAKGPVSKYSQAVP AVTEGPIPEVLKNYMDAQYYGEIGIGTPPQCFTVVFDTGSSNLWVPSIHCKLLDIACWIH HKYNSDKSSTYVKNGTSFDIHYGSGSLSGYLSQDTVSVPCQSASSASALGGVKVERQVFG EATKQPGITFIAAKFDGILGMAYPRISVNNVLPVFDNLMQQKLVDQNIFSFYLSRDPDAQ PGGELMLGGTDSKYYKGSLSYLNVTRKAYWQVHLDQVEVASGLTLCKEGCEAIVDTGTSL MVGPVDEVRELQKAIGAVPLIQGEYMIPCEKVSTLPAITLKLGGKGYKLSPEDYTLKVSQ AGKTLCLSGFMGMDIPPPSGPLWILGDVFIGRYYTVFDRDNNRVGFAEAARL >CHYM_BOVIN P00794 PROCHYMOSIN A AND B PRECURSORS (EC 3.4.23.4) (PREPRORENNIN) MRCLVVLLAVFALSQGTEITRIPLYKGKSLRKALKEHGLLEDFLQKQQYGISSKYSGFGE VASVPLTNYLDSQYFGKIYLGTPPQEFTVLFDTGSSDFWVPSIYCKSNACKNHQRFDPRK SSTFQNLGKPLSIHYGTGSMQGILGYDTVTVSNIVDIQQTVGLSTQEPGDVFTYAEFDGI LGMAYPSLASEYSIPVFDNMMNRHLVAQDLFSVYMDRNGQESMLTLGAIDPSYYTGSLHW VPVTVQQYWQFTVDSVTISGVVVACEGGCQAILDTGTSKLVGPSSDILNIQQAIGATQNQ YGEFDIDCDNLSYMPTVVFEINGKMYPLTPSAYTSQDQGFCTSGFQSENHSQKWILGDVF IREYYSVFDRANNLVGLAKAI >PEPA_ASPAW P17946 ASPERGILLOPEPSIN A PRECURSOR (EC 3.4.23.18) MVVFSKTAALVLGLSSAVSAAPAPTRKGFTINQIARPANKTRTINLPGMYARSLAKFGGT VPQSVKEAASKGSAVTTPQNNDEEYLTPVTVGKSTLHLDFDTGSADLWVFSDELPSSEQT GHDLYTPSSSATKLSGYTWDISYGDGSSASGDVYRDTVTVGGVTTNKQAVEAASKISSEF VQNTANDGLLGLAFSSINTVQPKAQTTFFDTVKSQLDSPLFAVQLKHDAPGVYDFGYIDD SKYTGSITYTDADSSQGYWGFSTDGYSIGDGSSSSSGFSAIADTGTTLILLDDEIVSAYY EQVSGASGETEAGGYVFSCSTNPPDFTVVIGDYKAVVPGKYINYAPISTGSSTCFGGIQS NSGLGLSILGDVFLKSQYVVFNSEGPKLGFAAQA |
******************************************************************************** MEME - Motif discovery tool ******************************************************************************** MEME version 2.3.1 (Release date: 2000/11/05 21:47:56) For further information on how to interpret these results or to get a copy of the MEME software please access http://www.sdsc.edu/MEME. This file may be used as input to the MAST algorithm for searching sequence databases for matches to groups of motifs. MAST is available for interactive use and downloading at http://www.sdsc.edu/MEME. ******************************************************************************** ******************************************************************************** REFERENCE ******************************************************************************** If you use this program in your research, please cite: Timothy L. Bailey and Charles Elkan, "Fitting a mixture model by expectation maximization to discover motifs in biopolymers", Proceedings of the Second International Conference on Intelligent Systems for Molecular Biology, pp. 28-36, AAAI Press, Menlo Park, California, 1994. ******************************************************************************** ******************************************************************************** TRAINING SET ******************************************************************************** DATAFILE= ../../data/memepep.dat (deleted by web version of MEME) ALPHABET= ACDEFGHIKLMNPQRSTVWY Sequence name Weight Length Sequence name Weight Length ------------- ------ ------ ------------- ------ ------ CARP_RHICH 1.0000 393 CARP_YEAST 1.0000 405 CATD_HUMAN 1.0000 412 CHYM_BOVIN 1.0000 381 PEPA_ASPAW 1.0000 394 ******************************************************************************** ******************************************************************************** MOTIF 1 width = 8 sites = 5.0 ******************************************************************************** Simplified A 55:2::2: motif letter- C :22::::: probability D :::::::: matrix E :::::::: F :::::::: G :::::::: H ::::332: I 2::2:::: [Part of this file has been deleted for brevity] letter-probability matrix: alength= 20 w= 16 n= 1910 0.345527 0.003027 0.008623 0.010390 0.006714 0.011555 0.003738 0.009380 0.009749 0.015285 0.003844 0.007680 0.008446 0.006793 0.008650 0.178967 0.009902 0.010727 0.335557 0.005447 0.012194 0.003027 0.008623 0.010390 0.006714 0.011555 0.003738 0.842714 0.009749 0.015285 0.003844 0.007680 0.008446 0.006793 0.008650 0.012300 0.009902 0.010727 0.002224 0.005447 0.012194 0.003027 0.008623 0.010390 0.006714 0.011555 0.003738 0.176047 0.009749 0.515285 0.003844 0.007680 0.008446 0.006793 0.008650 0.012300 0.009902 0.177394 0.002224 0.005447 0.012194 0.003027 0.008623 0.010390 0.006714 0.844888 0.003738 0.009380 0.009749 0.015285 0.003844 0.007680 0.008446 0.006793 0.008650 0.012300 0.009902 0.010727 0.002224 0.005447 0.012194 0.003027 0.841956 0.010390 0.006714 0.011555 0.003738 0.009380 0.009749 0.015285 0.003844 0.007680 0.008446 0.006793 0.008650 0.012300 0.009902 0.010727 0.002224 0.005447 0.178861 0.003027 0.008623 0.010390 0.006714 0.011555 0.003738 0.009380 0.009749 0.015285 0.003844 0.007680 0.008446 0.006793 0.008650 0.012300 0.176568 0.510727 0.002224 0.005447 0.012194 0.003027 0.008623 0.010390 0.840047 0.011555 0.003738 0.009380 0.009749 0.015285 0.003844 0.007680 0.008446 0.006793 0.008650 0.012300 0.009902 0.010727 0.002224 0.005447 0.012194 0.003027 0.008623 0.010390 0.006714 0.011555 0.003738 0.342714 0.009749 0.515285 0.003844 0.007680 0.008446 0.006793 0.008650 0.012300 0.009902 0.010727 0.002224 0.005447 0.012194 0.003027 0.008623 0.010390 0.006714 0.178221 0.003738 0.009380 0.343082 0.015285 0.003844 0.007680 0.008446 0.006793 0.341983 0.012300 0.009902 0.010727 0.002224 0.005447 0.012194 0.003027 0.008623 0.177057 0.006714 0.011555 0.003738 0.009380 0.176415 0.015285 0.003844 0.174346 0.008446 0.006793 0.175316 0.178967 0.009902 0.010727 0.002224 0.005447 0.012194 0.003027 0.008623 0.010390 0.006714 0.011555 0.003738 0.009380 0.009749 0.015285 0.003844 0.174346 0.008446 0.173459 0.008650 0.012300 0.009902 0.010727 0.002224 0.505447 0.012194 0.003027 0.008623 0.010390 0.006714 0.011555 0.003738 0.009380 0.009749 0.015285 0.003844 0.007680 0.008446 0.006793 0.008650 0.012300 0.009902 0.010727 0.002224 0.838780 0.012194 0.003027 0.008623 0.010390 0.006714 0.011555 0.003738 0.009380 0.009749 0.015285 0.003844 0.007680 0.008446 0.006793 0.008650 0.345634 0.176568 0.344060 0.002224 0.005447 0.012194 0.003027 0.008623 0.010390 0.006714 0.011555 0.003738 0.176047 0.009749 0.015285 0.003844 0.007680 0.008446 0.006793 0.008650 0.012300 0.009902 0.677394 0.002224 0.005447 0.012194 0.003027 0.008623 0.010390 0.673381 0.011555 0.003738 0.009380 0.009749 0.015285 0.003844 0.007680 0.008446 0.006793 0.008650 0.012300 0.009902 0.010727 0.002224 0.172114 0.012194 0.003027 0.508623 0.010390 0.006714 0.011555 0.003738 0.009380 0.009749 0.015285 0.003844 0.341013 0.008446 0.006793 0.008650 0.012300 0.009902 0.010727 0.002224 0.005447 Stopped because nmotifs = 3 reached. ******************************************************************************** DEBUG INFORMATION ******************************************************************************** This information can also be useful in the event you wish to report a problem with the MEME software. model: mod= zoops nmotifs= 3 chi= 1 width: minw= 8 maxw= 57 shorten= yes lambda: minsites= 0 maxsites= 5 theta: prob= 1 spmap= uni spfuzz= 0.5 em: prior= dirichlet b= 1 maxiter= 50 distance= 0.001 data: n= 1985 N= 5 strands: w53 sample: seed= 0 seqfrac= 1 LRT: adj= root Letter frequencies: A 0.074 C 0.014 D 0.061 E 0.039 F 0.048 G 0.100 H 0.013 I 0.058 K 0.052 L 0.084 M 0.014 N 0.040 P 0.048 Q 0.042 R 0.021 S 0.094 T 0.072 V 0.069 W 0.010 Y 0.049 Non-redundant database letter frequencies: A 0.073 C 0.018 D 0.052 E 0.062 F 0.040 G 0.069 H 0.022 I 0.056 K 0.058 L 0.092 M 0.023 N 0.046 P 0.051 Q 0.041 R 0.052 S 0.074 T 0.059 V 0.064 W 0.013 Y 0.033 Effective length of alphabet = 20 Entropy of dataset (bits) = -4.1 meme -protein ******************************************************************************** |
Program name | Description |
---|---|
antigenic | Finds antigenic sites in proteins |
digest | Protein proteolytic enzyme or reagent cleavage digest |
epestfind | Finds PEST motifs as potential proteolytic cleavage sites |
fuzzpro | Protein pattern search |
fuzztran | Protein pattern search after translation |
helixturnhelix | Report nucleic acid binding motifs |
oddcomp | Find protein sequence regions with a biased composition |
patmatdb | Search a protein sequence with a motif |
patmatmotifs | Search a PROSITE motif database with a protein sequence |
pepcoil | Predicts coiled coil regions |
preg | Regular expression search of a protein sequence |
pscan | Scans proteins using PRINTS |
sigcleave | Reports protein signal cleavage sites |
Although we take every care to ensure that the results of the EMBOSS version are identical to those from the original package, we recommend that you check your inputs give the same results in both versions before publication.
Please report all bugs in the EMBOSS version to the EMBOSS bug team, not to the original author.