Pattern Discovery Syntax
Syntax is
splash InputFile [options] 

Example:

running 

splash histones.fasta 

produces the file histones.fasta_0_209_0_3_8.txt

Description: This process the InputFile to extract all the patterns that satisfy the discovery constraints set by the individual command line options. 

Input: InputFile must be the fully qualified path of a file containing protein or DNA sequences in FASTA format.

Output: Discovered Patterns are stored in in a file whose name, InputFile_A_B_C_D_E.txt, is determined by the discovery parameters in the following way 

A This is the thread Id number. It is always 0, unless the -t option is used. In that case, this value is used to separate the results retuned by the individual processes
B It is the minimum support, in number of occurrences or sequences, for this run. See options -j, -%, and -u
C It is the minimum length of the pattern in tokens. For instance, pattern A..BC.D.E has length 5. If C is 0, then the minimum length is set by the value of k0 for the density constraint. The latter is reported in the D value. See options -l, -k, and -w
D It is the k0 of the density constraint. See options -k and -w
E It is the w0 of the density constraint. See options -k and -w

Options: Several options are available. These control the following aspects of the pattern discovery process:

Constraints
Statistical Constraints
Similarity Matrix 
Parallel Execution
Format
Other

0. SPLASH
1. Algorithm
2. Performance
3. Pattern Discovery

Syntax
DNA/Protein Seq.
Constraints
Statistical Constr.
Similarity Matrix
Parallel Execution
Output Format
Other

4. Exhaustive Discovery

Syntax

5. Hierarchical Discovery

Syntax

6. Search

Syntax

7. References