Standard (Mandatory) qualifiers: [-protpath] dirlist This option specifies the location of the protein CCF files (clean coordinate files) (input). A 'clean cordinate file' contains protein coordinate and derived data for a single PDB file ('protein clean coordinate file') or a single domain from SCOP or CATH ('domain clean coordinate file'), in CCF format (EMBL-like). The files, generated by using PDBPARSE (PDB files) or DOMAINER (domains), contain 'cleaned-up' data that is self-consistent and error-corrected. Records for residue solvent accessibility and secondary structure are added to the file by using PDBPLUS. [-domaindir] directory This option specifies the location of the domain CCF files (clean coordinate files) (input). A 'clean cordinate file' contains protein coordinate and derived data for a single PDB file ('protein clean coordinate file') or a single domain from SCOP or CATH ('domain clean coordinate file'), in CCF format (EMBL-like). The files, generated by using PDBPARSE (PDB files) or DOMAINER (domains), contain 'cleaned-up' data that is self-consistent and error-corrected. Records for residue solvent accessibility and secondary structure are added to the file by using PDBPLUS. -dcffile infile This option specifies the name of the DCF file (domain classification file) (input). A 'domain classification file' contains classification and other data for domains from SCOP or CATH, in DCF format (EMBL-like). The files are generated by using SCOPPARSE and CATHPARSE. Domain sequence information can be added to the file by using DOMAINSEQS. -threshold float This option specifies the threshold contact distance. [-outfile] outfile This option specifies the name of the output file. -logfile outfile This option specifies the name of the log file. Additional (Optional) qualifiers: (none) Advanced (Unprompted) qualifiers: -dicfile datafile This option specifies the dictionary of heterogen groups in PDB. This file is generated by using HETPARSE and is part of the EMBOSS distribution. -vdwfile datafile This option specifies the name of the data file with van der Waals radii for atoms in amino acid residues. This file is part of the EMBOSS distribution. Associated qualifiers: "-outfile" associated qualifiers -odirectory3 string Output directory "-logfile" associated qualifiers -odirectory string Output directory General qualifiers: -auto boolean Turn off prompts -stdout boolean Write standard output -filter boolean Read standard input, write standard output -options boolean Prompt for standard and additional values -debug boolean Write debug output to program.dbg -verbose boolean Report some/full command line options -help boolean Report command line options. More information on associated and general qualifiers can be found with -help -verbose -warning boolean Report warnings -error boolean Report errors -fatal boolean Report fatal errors -die boolean Report deaths
Standard (Mandatory) qualifiers | Allowed values | Default | |
---|---|---|---|
[-protpath] (Parameter 1) |
This option specifies the location of the protein CCF files (clean coordinate files) (input). A 'clean cordinate file' contains protein coordinate and derived data for a single PDB file ('protein clean coordinate file') or a single domain from SCOP or CATH ('domain clean coordinate file'), in CCF format (EMBL-like). The files, generated by using PDBPARSE (PDB files) or DOMAINER (domains), contain 'cleaned-up' data that is self-consistent and error-corrected. Records for residue solvent accessibility and secondary structure are added to the file by using PDBPLUS. | Directory with files | ./ |
[-domaindir] (Parameter 2) |
This option specifies the location of the domain CCF files (clean coordinate files) (input). A 'clean cordinate file' contains protein coordinate and derived data for a single PDB file ('protein clean coordinate file') or a single domain from SCOP or CATH ('domain clean coordinate file'), in CCF format (EMBL-like). The files, generated by using PDBPARSE (PDB files) or DOMAINER (domains), contain 'cleaned-up' data that is self-consistent and error-corrected. Records for residue solvent accessibility and secondary structure are added to the file by using PDBPLUS. | Directory | ./ |
-dcffile | This option specifies the name of the DCF file (domain classification file) (input). A 'domain classification file' contains classification and other data for domains from SCOP or CATH, in DCF format (EMBL-like). The files are generated by using SCOPPARSE and CATHPARSE. Domain sequence information can be added to the file by using DOMAINSEQS. | Input file | Required |
-threshold | This option specifies the threshold contact distance. | Any numeric value | 1.0 |
[-outfile] (Parameter 3) |
This option specifies the name of the output file. | Output file | SITES.con |
-logfile | This option specifies the name of the log file. | Output file | SITES.log |
Additional (Optional) qualifiers | Allowed values | Default | |
(none) | |||
Advanced (Unprompted) qualifiers | Allowed values | Default | |
-dicfile | This option specifies the dictionary of heterogen groups in PDB. This file is generated by using HETPARSE and is part of the EMBOSS distribution. | Data file | Ehet.dat |
-vdwfile | This option specifies the name of the data file with van der Waals radii for atoms in amino acid residues. This file is part of the EMBOSS distribution. | Data file | Evdw.dat |
FILE TYPE | FORMAT | DESCRIPTION | CREATED BY | SEE ALSO |
Clean coordinate file (for protein) | CCF format (EMBL-like). | Protein coordinate and derived data for a single PDB file. The data are 'cleaned-up': self-consistent and error-corrected. | PDBPARSE | Records for residue solvent accessibility and secondary structure are added to the file by using PDBPLUS. |
Clean coordinate file (for domain) | CCF format (EMBL-like). | Protein coordinate and derived data for a single domain from SCOP or CATH. The data are 'cleaned-up': self-consistent and error-corrected. | DOMAINER | Records for residue solvent accessibility and secondary structure are added to the file by using PDBPLUS. |
Contact file (intra-chain residue-residue contacts) | CON format (EMBL-like.) | Intra-chain residue-residue contact data for a protein or a domain from SCOP or CATH. | CONTACTS | N.A. |
Contact file (inter-chain residue-residue contacts) | CON format (EMBL-like.) | Inter-chain residue-residue contact data for a protein or a domain from SCOP or CATH. | INTERFACE | N.A. |
Contact file (residue-ligand contacts) | CON format (EMBL-like.) | Residue-ligand contact data for a protein or a domain from SCOP or CATH. | SITES | N.A. |
van der Waals radii | A file of van der Waals radii for atoms in amino acid residues. Part of the emboss distribution. | N.A. | N.A. | |
Dictionary of heterogen groups | A file of the dictionary of heterogen groups in PDB. | HETPARSE | N.A. |
Program name | Description |
---|---|
aaindexextract | Extract data from AAINDEX |
allversusall | Does an all-versus-all global alignment for each set of sequences in an input directory and writes files of sequence similarity values |
cathparse | Reads raw CATH classification files and writes DCF file (domain classification file) |
cutgextract | Extract data from CUTG |
domainer | Reads CCF files (clean coordinate files) for proteins and writes CCF files for domains, taken from a DCF file (domain classification file) |
domainnr | Removes redundant domains from a DCF file (domain classification file). The file must contain domain sequence information, which can be added by using DOMAINSEQS |
domainseqs | Adds sequence records to a DCF file (domain classification file) |
domainsse | Adds secondary structure records to a DCF file (domain classification file) |
hetparse | Converts raw dictionary of heterogen groups to a file in EMBL-like format |
pdbparse | Parses PDB files and writes CCF files (clean coordinate files) for proteins |
pdbplus | Add residue solvent accessibility and secondary structure data to a CCF file (clean coordinate file) for a protein or domain |
pdbtosp | Convert raw swissprot:PDB equivalence file to EMBL-like format |
printsextract | Extract data from PRINTS |
prosextract | Builds the PROSITE motif database for patmatmotifs to search |
rebaseextract | Extract data from REBASE |
scopparse | Reads raw SCOP classification files and writes a DCF file (domain classification file) |
seqnr | Removes redundancy from DHF files (domain hits files) or other files of sequences |
ssematch | Searches a DCF file (domain classification file) for secondary structure matches |
tfextract | Extract data from TRANSFAC |
Excerpt of log file CCF: 000_testdata_new/sites/in/1cs4.ccf HETS:YES NHETS:7 SCOP:YES NDOMS: 1 CCF: 000_testdata_new/sites/in/1ii7.ccf HETS:YES NHETS:5 SCOP:YES NDOMS: 1 |