A novel promoter prediction method based on multiple sigma factors model for bacteria genomes
 

INPUT & PARAMETER
  • Standard inputs from Refseq of NCBI:
        Ptt file of NC_000913      NC_000913.ptt
        Fna file of NC_000913      NC_000913.fna
  • Standard files of defaut parameters for PWM training:
        Training set sigma promoter sequence file of sigma70      Sigma70.dna
        Sigma promoter feature file of sigma70      Sigma70.dat
  • Standard files of defaut parameters for promoter scanning:
        Position weight matrix file for sigma70      Sigma70.def.tss.par.dat
        Scoring scheme file for sigma70      Sigma70.thr.dat


  • OUTPUT
  • Standard outputs of training step:
        Trained PWM file for Sigma70 promoter at condition 0      Sigma70.0.tss.par.dat.
  • Standard outputs of scanning step:
        Promoter prediction result for NC_000913 genome      NC_000913.sca.dat

  • COMMAND LINE OPTIONS
  • Functional options:
        -I: integrate .ptt and .rnt file into .ano file;
             Necessary inputs: -p, -r and -o;
        -T: scoring schemes calculated by EM algorithm:
             Necessary inputs: -n, -o, -t and -s.
        -S: scanning for sigma factor binding sites
             Necessary inputs: -n, -o, -f, -p (or -a), -w and -h
             Optional inputs: -l, -g, -u and -c.
  • Parameters for inputs:
        Inputs with no default value:
             -n NC number (eg: NC_000913)
             -o output directory
             -f .fna file directory
             -a .ano file directory
             -p .ptt file directory
             -r .rnt file directory
             -h threshold input file directory
             -t Training set sequences directory
             -w parameters of scoring scheme file directory
             -s file that include sigma factor parameters for position weight matrix
             Attention: the file name of these inputs are not required!
        Inputs with default value:
             -l threshold of intergenic distance (default value is 100)
             -g region length for promoter scanning (default value is 250)
             -u the chosen number of PWM calculated by user
             if nothing is inputted, the program will utilize the default PWMs,
             otherwise four inputted numbers should be seperated by blanks (eg: 1 2 5 26)
             -c the sigma promoters excluded from scanning"
             if nothing is inputted, the program will scan for all four sigma promoters,
             otherwise inputted sigma promoters should be seperated by blanks (eg: Sigma38 Sigma32 Sigma24).

  • COMMAND LINE EXAMPLES
  • Standard command for training step:
        For Linux OS:      SigmaPromoter -n NC_000913 -o full length output directory -t ./parameters/training_set/sequences/ -s ./parameters/training_set/parameters/ -T
        For Windows OS:      SigmaPromoter.exe -n NC_000913 -o full length output directory -t ./parameters/training_set/sequences/ -s ./parameters/training_set/parameters/ -T
  • Standard command for scanning step:
        For Linux OS:      SigmaPromoter -n NC_002950 -o full length output directory -f ./RefSeq62/all.fna/ -p ./RefSeq62/all.ptt/ -w ./parameters/WM/ -h ./parameters/threshold/ -c Sigma38 Sigma32 -S
        For Windows OS:      SigmaPromoter.exe -n NC_002950 -o full length output directory -f ./RefSeq62/all.fna/ -p ./RefSeq62/all.ptt/ -w ./parameters/WM/ -h ./parameters/threshold/ -c Sigma38 Sigma32 -S

  • RETURN
    Homepage of SigmaPromoter