Commit 0e854e1e1dab832696362b8210d202475bd32de7

Authored by Erickson Silva
1 parent e1cf2b2f
Exists in master and in 1 other branch devel

Remove arquivos binários (dependências) desnecessarios

aelius-install.tar.gz
No preview for this file type
aelius_install.sh
@@ -1,36 +0,0 @@ @@ -1,36 +0,0 @@
1 -#!/bin/bash  
2 -  
3 -#Instalador do Tradutor Aelius  
4 -  
5 -cd ~/vlibras-core  
6 -  
7 -echo "\n# Extraindo...\n"  
8 -tar -xf aelius-install.tar.gz -C ~/.  
9 -  
10 -echo "# Instalando dependências...\n"  
11 -sudo apt-get install python-dev python-yaml python-numpy python-matplotlib  
12 -  
13 -cd ~/nltk-2.0.1rc1/  
14 -  
15 -sudo python setup.py install  
16 -  
17 -echo "\n# Finalizando...\n"  
18 -  
19 -# path to HunPos binaries  
20 -echo "\nPATH=\"${PATH}:$HOME/Applications/bin\"" >> ~/.bashrc  
21 -echo "export PATH\n" >> ~/.bashrc  
22 -  
23 -# path to Aelius and Translate package  
24 -echo "PYTHONPATH=\"${PYTHONPATH}:$HOME/Applications:$HOME/vlibras-core/tradutor/src/py\"" >> ~/.bashrc  
25 -echo "export PYTHONPATH\n" >> ~/.bashrc  
26 -  
27 -cd ..  
28 -  
29 -sudo mv usr-local-bin.tar.gz /usr/local/bin/  
30 -  
31 -cd /usr/local/bin/  
32 -  
33 -sudo tar -xf usr-local-bin.tar.gz  
34 -sudo rm usr-local-bin.tar.gz  
35 -  
36 -echo "### Instalação finalizada! \n## Execute o seguinte comando para concluir:\n\n$ source ~/.bashrc\n"  
recognize/src/julius/doc/Juliusbook-4.1.5.pdf
No preview for this file type
recognize/src/julius/doc/manuals/accept_check.txt
@@ -1,80 +0,0 @@ @@ -1,80 +0,0 @@
1 - accept_check  
2 -  
3 -ACCEPT_CHECK(1) ACCEPT_CHECK(1)  
4 -  
5 -  
6 -  
7 -NAME  
8 - accept_check  
9 - - Check whether a grammar accept / reject given word sequences  
10 -  
11 -SYNOPSIS  
12 - accept_check [-t] [-s spname] [-v] {prefix}  
13 -  
14 -DESCRIPTION  
15 - accept_check is a tool to check whether a sentence can be accepted or  
16 - rejected on a grammar (prefix.dfa and prefix.dict). The sentence should  
17 - be given from standard input. You can do a batch check by preparing all  
18 - test sentence at each line of a text file, and give it as standard  
19 - input of accept_check.  
20 -  
21 - This tool needs .dfa, .dict and .term files. You should convert a  
22 - written grammar file to generate them by mkdfa.pl.  
23 -  
24 - A sentence should be given as space-separated word sequence. It may be  
25 - required to add head / tail silence word like sil, depending on your  
26 - grammar. And should not contain a short-pause word.  
27 -  
28 - When a word belongs to various category in a grammar, accept_check will  
29 - check all the possible sentence patterns, and accept it if any of those  
30 - is acceptable.  
31 -  
32 -OPTIONS  
33 - -t  
34 - Use category name as input instead of word.  
35 -  
36 - -s spname  
37 - Short-pause word name to be skipped. (default: "sp")  
38 -  
39 - -v  
40 - Debug output.  
41 -  
42 -EXAMPLES  
43 - An output for "date" grammar:  
44 -  
45 - % echo '<s> NEXT SUNDAY </s>' | accept_check date  
46 - Reading in dictionary...  
47 - 143 words...done  
48 - Reading in DFA grammar...done  
49 - Mapping dict item <-> DFA terminal (category)...done  
50 - Reading in term file (optional)...done  
51 - 27 categories, 143 words  
52 - DFA has 35 nodes and 71 arcs  
53 - -----  
54 - wseq: <s> NEXT SUNDAY </s>  
55 - cate: NS_B (NEXT|NEXT) (DAYOFWEEK|DAYOFWEEK|DAY|DAY) NS_E  
56 - accepted  
57 -  
58 -  
59 -SEE ALSO  
60 - mkdfa.pl ( 1 ) ,  
61 - generate ( 1 ) ,  
62 - nextword ( 1 )  
63 -  
64 -COPYRIGHT  
65 - Copyright (c) 1997-2000 Information-technology Promotion Agency, Japan  
66 -  
67 - Copyright (c) 1991-2008 Kawahara Lab., Kyoto University  
68 -  
69 - Copyright (c) 2000-2005 Shikano Lab., Nara Institute of Science and  
70 - Technology  
71 -  
72 - Copyright (c) 2005-2008 Julius project team, Nagoya Institute of  
73 - Technology  
74 -  
75 -LICENSE  
76 - The same as Julius.  
77 -  
78 -  
79 -  
80 - 10/02/2008 ACCEPT_CHECK(1)  
recognize/src/julius/doc/manuals/adinrec.txt
@@ -1,130 +0,0 @@ @@ -1,130 +0,0 @@
1 - adinrec  
2 -  
3 -ADINREC(1) ADINREC(1)  
4 -  
5 -  
6 -  
7 -NAME  
8 - adinrec  
9 - - record audio device and save one utterance to a file  
10 -  
11 -SYNOPSIS  
12 - adinrec [options...] {filename}  
13 -  
14 -DESCRIPTION  
15 - adinrec opens an audio stream, detects an utterance input and store it  
16 - to a specified file. The utterance detection is done by level and  
17 - zero-cross thresholds. Default input device is microphone, but other  
18 - audio input source, including Julius A/D-in plugin, can be used by  
19 - using "-input" option.  
20 -  
21 - The audio format is 16 bit, 1 channel, in Microsoft WAV format. If the  
22 - given filename already exists, it will be overridden.  
23 -  
24 - If filename is "-" , the captured data will be streamed into standard  
25 - out, with no header (raw format).  
26 -  
27 -OPTIONS  
28 - adinrec uses JuliusLib and adopts Julius options. Below is a list of  
29 - valid options.  
30 -  
31 - adinrec specific options  
32 - -freq Hz  
33 - Set sampling rate in Hz. (default: 16,000)  
34 -  
35 - -raw  
36 - Output in raw file format.  
37 -  
38 - JuliusLib options  
39 - -input {mic|rawfile|adinnet|stdin|netaudio|esd|alsa|oss}  
40 - Choose speech input source. Specify 'file' or 'rawfile' for waveform  
41 - file. On file input, users will be prompted to enter the file name  
42 - from stdin.  
43 -  
44 - 'mic' is to get audio input from a default live microphone device,  
45 - and 'adinnet' means receiving waveform data via tcpip network from  
46 - an adinnet client. 'netaudio' is from DatLink/NetAudio input, and  
47 - 'stdin' means data input from standard input.  
48 -  
49 - At Linux, you can choose API at run time by specifying alsa, oss and  
50 - esd.  
51 -  
52 - -lv thres  
53 - Level threshold for speech input detection. Values should be in  
54 - range from 0 to 32767. (default: 2000)  
55 -  
56 - -zc thres  
57 - Zero crossing threshold per second. Only input that goes over the  
58 - level threshold (-lv) will be counted. (default: 60)  
59 -  
60 - -headmargin msec  
61 - Silence margin at the start of speech segment in milliseconds.  
62 - (default: 300)  
63 -  
64 - -tailmargin msec  
65 - Silence margin at the end of speech segment in milliseconds.  
66 - (default: 400)  
67 -  
68 - -zmean  
69 - This option enables DC offset removal.  
70 -  
71 - -smpFreq Hz  
72 - Set sampling rate in Hz. (default: 16,000)  
73 -  
74 - -48  
75 - Record input with 48kHz sampling, and down-sample it to 16kHz  
76 - on-the-fly. This option is valid for 16kHz model only. The  
77 - down-sampling routine was ported from sptk. (Rev. 4.0)  
78 -  
79 - -NA devicename  
80 - Host name for DatLink server input (-input netaudio).  
81 -  
82 - -adport port_number  
83 - With -input adinnet, specify adinnet port number to listen.  
84 - (default: 5530)  
85 -  
86 - -nostrip  
87 - Julius by default removes successive zero samples in input speech  
88 - data. This option stop it.  
89 -  
90 - -C jconffile  
91 - Load a jconf file at here. The content of the jconffile will be  
92 - expanded at this point.  
93 -  
94 - -plugindir dirlist  
95 - Specify which directories to load plugin. If several direcotries  
96 - exist, specify them by colon-separated list.  
97 -  
98 -ENVIRONMENT VARIABLES  
99 - ALSADEV  
100 - Device name string for ALSA. (default: "default")  
101 -  
102 - AUDIODEV  
103 - Device name string for OSS. (default: "/dev/dsp")  
104 -  
105 - LATENCY_MSEC  
106 - Input latency of microphone input in milliseconds. Smaller value  
107 - will shorten latency but sometimes make process unstable. Default  
108 - value will depend on the running OS.  
109 -  
110 -SEE ALSO  
111 - julius ( 1 ) ,  
112 - adintool ( 1 )  
113 -  
114 -COPYRIGHT  
115 - Copyright (c) 1997-2000 Information-technology Promotion Agency, Japan  
116 -  
117 - Copyright (c) 1991-2008 Kawahara Lab., Kyoto University  
118 -  
119 - Copyright (c) 2000-2005 Shikano Lab., Nara Institute of Science and  
120 - Technology  
121 -  
122 - Copyright (c) 2005-2008 Julius project team, Nagoya Institute of  
123 - Technology  
124 -  
125 -LICENSE  
126 - The same as Julius.  
127 -  
128 -  
129 -  
130 - 10/02/2008 ADINREC(1)  
recognize/src/julius/doc/manuals/adintool.txt
@@ -1,214 +0,0 @@ @@ -1,214 +0,0 @@
1 - adintool  
2 -  
3 -ADINTOOL(1) ADINTOOL(1)  
4 -  
5 -  
6 -  
7 -NAME  
8 - adintool  
9 - - a tool to record / split / send / receive audio streams  
10 -  
11 -SYNOPSIS  
12 - adintool {-in inputdev} {-out outputdev} [options...]  
13 -  
14 -DESCRIPTION  
15 - adintool analyzes speech input, finds speech segments skipping silence,  
16 - and records the detected segments in various ways. It performs speech  
17 - detection based on zerocross number and power (level), and records the  
18 - detected parts to files or other output devices sucessively.  
19 -  
20 -  
21 - adintool is a upper version of adinrec with various functions.  
22 - Supported input device are: microphone input, a speech file, standard  
23 - tty input, and network socket (called adin-net server mode). Julius  
24 - plugin can be also used. Detected speech segments will be saved to  
25 - output devices: speech files, standard tty output, and network socket  
26 - (called adin-net client mode). For example, you can split the incoming  
27 - speech to segments and send them to Julius to be recognized.  
28 -  
29 - Output format is WAV, 16bit (signed short), monoral. If the file  
30 - already exist, it will be overridden.  
31 -  
32 -OPTIONS  
33 - All Julius options can be set. Only audio input related options are  
34 - treated and others are silently skipped. Below is a list of options.  
35 -  
36 - adintool specific options  
37 - -freq Hz  
38 - Set sampling rate in Hz. (default: 16,000)  
39 -  
40 - -in inputdev  
41 - Audio input device. "mic" to capture via microphone input, "file"  
42 - for audio file input, and "stdin" to read raw data from  
43 - standard-input. For file input, file name prompt will appear after  
44 - startup. Use "adinnet" to make adintool as "adinnet server",  
45 - receiving data from client via network socket. Default port number  
46 - is 5530, which can be altered by option "-inport".  
47 -  
48 - Alternatively, input device can be set by "-input" option, in which  
49 - case you can use plugin input.  
50 -  
51 - -out outputdev  
52 - Audio output device store the data. Specify "file" to save to file,  
53 - in which the output filename should be given by "-filename". Use  
54 - "stdout" to standard out. "adinnet" will make adintool to be an  
55 - adinnet client, sending speech data to a server via tcp/ip socket.  
56 - When using "adinnet" output, the server name to send data should be  
57 - specified by "-server". The default port number is 5530, which can  
58 - be changed by "-port" option.  
59 -  
60 - -inport num  
61 - When adintool becomes adinnet server to receive data (-in adinnet),  
62 - set the port number to listen. (default: 5530)  
63 -  
64 - -server [host] [,host...]  
65 - When output to adinnet server (-out adinnet), set the hostname. You  
66 - can send to multiple hosts by specifying their hostnames as  
67 - comma-delimited list like "host1,host2,host3".  
68 -  
69 - -port [num] [,num...]  
70 - When adintool send a data to adinnet server (-out adinnet), set the  
71 - port number to connect. (default: 5530) For multiple servers,  
72 - specify port numbers for all servers like "5530,5530,5531".  
73 -  
74 - -filename file  
75 - When output to file (-out file), set the output filename. The actual  
76 - file name will be as "file.0000.wav" , "file.0001.wav" and so on,  
77 - where the four digit number increases as speech segment detected.  
78 - The initial number will be set to 0 by default, which can be changed  
79 - by "-startid" option. When using "-oneshot" option to save only the  
80 - first segment, the input will be saved as "file".  
81 -  
82 - -startid number  
83 - At file output, set the initial file number. (default: 0)  
84 -  
85 - -oneshot  
86 - Exit after the end of first speech segment.  
87 -  
88 - -nosegment  
89 - Do not perform speech detection for input, just treat all the input  
90 - as a single valid segment.  
91 -  
92 - -raw  
93 - Output as RAW file (no header).  
94 -  
95 - -autopause  
96 - When output to adinnet server, adintool enter pause state at every  
97 - end of speech segment. It will restart when the destination adinnet  
98 - server sends it a resume signal.  
99 -  
100 - -loosesync  
101 - When output to multiple adinnet server, not to do strict  
102 - synchronization for restart. By default, when adintool has entered  
103 - pause state, it will not restart until resume commands are received  
104 - from all servers. This option will allow restart at least one  
105 - restart command has arrived.  
106 -  
107 - -rewind msec  
108 - When input is a live microphone device, and there has been some  
109 - continuing input at the moment adintool resumes, it start recording  
110 - backtracking by the specified milliseconds.  
111 -  
112 - Concerning Julius options  
113 - -input {mic|rawfile|adinnet|stdin|netaudio|esd|alsa|oss}  
114 - Choose speech input source. Specify 'file' or 'rawfile' for waveform  
115 - file. On file input, users will be prompted to enter the file name  
116 - from stdin.  
117 -  
118 - 'mic' is to get audio input from a default live microphone device,  
119 - and 'adinnet' means receiving waveform data via tcpip network from  
120 - an adinnet client. 'netaudio' is from DatLink/NetAudio input, and  
121 - 'stdin' means data input from standard input.  
122 -  
123 - At Linux, you can choose API at run time by specifying alsa, oss and  
124 - esd.  
125 -  
126 - -lv thres  
127 - Level threshold for speech input detection. Values should be in  
128 - range from 0 to 32767. (default: 2000)  
129 -  
130 - -zc thres  
131 - Zero crossing threshold per second. Only input that goes over the  
132 - level threshold (-lv) will be counted. (default: 60)  
133 -  
134 - -headmargin msec  
135 - Silence margin at the start of speech segment in milliseconds.  
136 - (default: 300)  
137 -  
138 - -tailmargin msec  
139 - Silence margin at the end of speech segment in milliseconds.  
140 - (default: 400)  
141 -  
142 - -zmean  
143 - This option enables DC offset removal.  
144 -  
145 - -smpFreq Hz  
146 - Set sampling rate in Hz. (default: 16,000)  
147 -  
148 - -48  
149 - Record input with 48kHz sampling, and down-sample it to 16kHz  
150 - on-the-fly. This option is valid for 16kHz model only. The  
151 - down-sampling routine was ported from sptk. (Rev. 4.0)  
152 -  
153 - -NA devicename  
154 - Host name for DatLink server input (-input netaudio).  
155 -  
156 - -adport port_number  
157 - With -input adinnet, specify adinnet port number to listen.  
158 - (default: 5530)  
159 -  
160 - -nostrip  
161 - Julius by default removes successive zero samples in input speech  
162 - data. This option stop it.  
163 -  
164 - -C jconffile  
165 - Load a jconf file at here. The content of the jconffile will be  
166 - expanded at this point.  
167 -  
168 - -plugindir dirlist  
169 - Specify which directories to load plugin. If several direcotries  
170 - exist, specify them by colon-separated list.  
171 -  
172 -ENVIRONMENT VARIABLES  
173 - ALSADEV  
174 - (using mic input with alsa device) specify a capture device name. If  
175 - not specified, "default" will be used.  
176 -  
177 - AUDIODEV  
178 - (using mic input with oss device) specify a capture device path. If  
179 - not specified, "/dev/dsp" will be used.  
180 -  
181 - LATENCY_MSEC  
182 - Try to set input latency of microphone input in milliseconds.  
183 - Smaller value will shorten latency but sometimes make process  
184 - unstable. Default value will depend on the running OS.  
185 -  
186 -EXAMPLES  
187 - Record microphone input to files: "data.0000.wav", "data.0001.wav" and  
188 - so on:  
189 - Split a long speech file "foobar.raw" into "foobar.1500.wav",  
190 - "foobar.1501.wav" ...:  
191 - Copy an entire audio file via network socket.  
192 - Detect speech segment, send to Julius via network and recognize it:  
193 -  
194 -SEE ALSO  
195 - julius ( 1 ) ,  
196 - adinrec ( 1 )  
197 -  
198 -COPYRIGHT  
199 - Copyright (c) 1997-2000 Information-technology Promotion Agency, Japan  
200 -  
201 - Copyright (c) 1991-2008 Kawahara Lab., Kyoto University  
202 -  
203 - Copyright (c) 2000-2005 Shikano Lab., Nara Institute of Science and  
204 - Technology  
205 -  
206 - Copyright (c) 2005-2008 Julius project team, Nagoya Institute of  
207 - Technology  
208 -  
209 -LICENSE  
210 - The same as Julius.  
211 -  
212 -  
213 -  
214 - 10/02/2008 ADINTOOL(1)  
recognize/src/julius/doc/manuals/dfa_determinize.txt
@@ -1,51 +0,0 @@ @@ -1,51 +0,0 @@
1 - dfa_determinize  
2 -  
3 -DFA_DETERMINIZE(1) DFA_DETERMINIZE(1)  
4 -  
5 -  
6 -  
7 -NAME  
8 - dfa_determinize  
9 - - Determinize NFA grammar network.  
10 -  
11 -SYNOPSIS  
12 - dfa_determinize [-o outfile] {dfafile}  
13 -  
14 -DESCRIPTION  
15 - dfa_determinize converts a non-deterministic .dfa file into  
16 - deterministic DFA. Output to standard output, or file specified by "-o"  
17 - option.  
18 -  
19 - This additional tool is not necessary on a grammar building procedure  
20 - in Julius, since the grammar network generated by mkdfa.pl is always  
21 - determinized.  
22 -  
23 -OPTIONS  
24 - -o outfile  
25 - Outout file. If not specified, output to stdout.  
26 -  
27 -EXAMPLES  
28 - Determinize foo.dfa to bar.dfa:  
29 - Another way:  
30 -  
31 -SEE ALSO  
32 - mkdfa.pl ( 1 ) ,  
33 - dfa_minimize ( 1 )  
34 -  
35 -COPYRIGHT  
36 - Copyright (c) 1997-2000 Information-technology Promotion Agency, Japan  
37 -  
38 - Copyright (c) 1991-2008 Kawahara Lab., Kyoto University  
39 -  
40 - Copyright (c) 2000-2005 Shikano Lab., Nara Institute of Science and  
41 - Technology  
42 -  
43 - Copyright (c) 2005-2008 Julius project team, Nagoya Institute of  
44 - Technology  
45 -  
46 -LICENSE  
47 - The same as Julius.  
48 -  
49 -  
50 -  
51 - 10/02/2008 DFA_DETERMINIZE(1)  
recognize/src/julius/doc/manuals/dfa_minimize.txt
@@ -1,49 +0,0 @@ @@ -1,49 +0,0 @@
1 - dfa_minimize  
2 -  
3 -DFA_MINIMIZE(1) DFA_MINIMIZE(1)  
4 -  
5 -  
6 -  
7 -NAME  
8 - dfa_minimize  
9 - - Minimize a DFA grammar network  
10 -  
11 -SYNOPSIS  
12 - dfa_minimize [-o outfile] {dfafile}  
13 -  
14 -DESCRIPTION  
15 - dfa_minimize will convert an .dfa file to an equivalent minimal form.  
16 - Output to standard output, or to a file specified by "-o" option.  
17 -  
18 - On version 3.5.3 and later, mkdfa.pl invokes this tool inside, and the  
19 - output .dfa file will be always minimized, so you do not need to use  
20 - this manually.  
21 -  
22 -OPTIONS  
23 - -o outfile  
24 - Output file. If not specified output to standard output.  
25 -  
26 -EXAMPLES  
27 - Minimize foo.dfa to bar.dfa:  
28 - Another way:  
29 -  
30 -SEE ALSO  
31 - mkdfa.pl ( 1 )  
32 -  
33 -COPYRIGHT  
34 - Copyright (c) 1997-2000 Information-technology Promotion Agency, Japan  
35 -  
36 - Copyright (c) 1991-2008 Kawahara Lab., Kyoto University  
37 -  
38 - Copyright (c) 2000-2005 Shikano Lab., Nara Institute of Science and  
39 - Technology  
40 -  
41 - Copyright (c) 2005-2008 Julius project team, Nagoya Institute of  
42 - Technology  
43 -  
44 -LICENSE  
45 - The same as Julius.  
46 -  
47 -  
48 -  
49 - 10/02/2008 DFA_MINIMIZE(1)  
recognize/src/julius/doc/manuals/generate-ngram.txt
@@ -1,62 +0,0 @@ @@ -1,62 +0,0 @@
1 - generate-ngram  
2 -  
3 -GENERATE-NGRAM(1) GENERATE-NGRAM(1)  
4 -  
5 -  
6 -  
7 -NAME  
8 - generate-ngram  
9 - - random sentence generator from N-gram  
10 -  
11 -SYNOPSIS  
12 - generate-ngram [options...] {binary_ngram}  
13 -  
14 -DESCRIPTION  
15 - generate-ngram is a tool to generate sentences randomly according to  
16 - the given N-gram language model. The N-gram model file binary_ngram  
17 - should be an binary format.  
18 -  
19 -OPTIONS  
20 - -n num  
21 - Number of sentences to generate (default: 10)  
22 -  
23 - -N  
24 - Specify which length of N-gram to use (default: available max in the  
25 - given model)  
26 -  
27 - -bos  
28 - Beginning-of-sentence word (default: "<s>")  
29 -  
30 - -eos  
31 - End-of-sentence word (default: "</s>")  
32 -  
33 - -ignore  
34 - Specify a word to be supressed from output (default: "<UNK")  
35 -  
36 - -v  
37 - Verbose output.  
38 -  
39 - -debug  
40 - Debug output.  
41 -  
42 -SEE ALSO  
43 - julius ( 1 ) ,  
44 - mkbingram ( 1 )  
45 -  
46 -COPYRIGHT  
47 - Copyright (c) 1997-2000 Information-technology Promotion Agency, Japan  
48 -  
49 - Copyright (c) 1991-2008 Kawahara Lab., Kyoto University  
50 -  
51 - Copyright (c) 2000-2005 Shikano Lab., Nara Institute of Science and  
52 - Technology  
53 -  
54 - Copyright (c) 2005-2008 Julius project team, Nagoya Institute of  
55 - Technology  
56 -  
57 -LICENSE  
58 - The same as Julius.  
59 -  
60 -  
61 -  
62 - 10/02/2008 GENERATE-NGRAM(1)  
recognize/src/julius/doc/manuals/generate.txt
@@ -1,76 +0,0 @@ @@ -1,76 +0,0 @@
1 - generate  
2 -  
3 -GENERATE(1) GENERATE(1)  
4 -  
5 -  
6 -  
7 -NAME  
8 - generate  
9 - - random sentence generator from a grammar  
10 -  
11 -SYNOPSIS  
12 - generate [-v] [-t] [-n num] [-s spname] {prefix}  
13 -  
14 -DESCRIPTION  
15 - This small program randomly generates sentences that are acceptable by  
16 - the given grammar.  
17 -  
18 -  
19 - .dfa, .dict and .term files are needed to execute. They can be  
20 - generated from .grammar and .voca file by mkdfa.pl.  
21 -  
22 -OPTIONS  
23 - -t  
24 - Output in word's category name.  
25 -  
26 - -n num  
27 - Set number of sentences to be generated (default: 10)  
28 -  
29 - -s spname  
30 - the name string of short-pause word to be supressed (default: "sp")  
31 -  
32 - -v  
33 - Debug output mode.  
34 -  
35 -EXAMPLES  
36 - Exmple output of a sample grammar "fruit":  
37 -  
38 - % generate fruit  
39 - Stat: init_voca: read 36 words  
40 - Reading in term file (optional)...done  
41 - 15 categories, 36 words  
42 - DFA has 26 nodes and 42 arcs  
43 - -----  
44 - <s> I WANT ONE APPLE </s>  
45 - <s> I WANT TEN PEARS </s>  
46 - <s> CAN I HAVE A PINEAPPLE </s>  
47 - <s> I WANT ONE PEAR </s>  
48 - <s> COULD I HAVE A BANANA </s>  
49 - <s> I WANT ONE APPLE PLEASE </s>  
50 - <s> I WANT NINE APPLES </s>  
51 - <s> NINE APPLES </s>  
52 - <s> I WANT ONE PINEAPPLE </s>  
53 - <s> I WANT A PEAR </s>  
54 -  
55 -  
56 -SEE ALSO  
57 - mkdfa.pl ( 1 ) ,  
58 - generate-ngram ( 1 )  
59 -  
60 -COPYRIGHT  
61 - Copyright (c) 1997-2000 Information-technology Promotion Agency, Japan  
62 -  
63 - Copyright (c) 1991-2008 Kawahara Lab., Kyoto University  
64 -  
65 - Copyright (c) 2000-2005 Shikano Lab., Nara Institute of Science and  
66 - Technology  
67 -  
68 - Copyright (c) 2005-2008 Julius project team, Nagoya Institute of  
69 - Technology  
70 -  
71 -LICENSE  
72 - The same as Julius.  
73 -  
74 -  
75 -  
76 - 10/02/2008 GENERATE(1)  
recognize/src/julius/doc/manuals/gram2sapixml.pl.txt
@@ -1,47 +0,0 @@ @@ -1,47 +0,0 @@
1 - gram2sapixml.pl  
2 -  
3 -GRAM2SAPIXML.PL(1) GRAM2SAPIXML.PL(1)  
4 -  
5 -  
6 -  
7 -NAME  
8 - gram2sapixml.pl  
9 - - convert Julius grammar to SAPI XML grammar format  
10 -  
11 -SYNOPSIS  
12 - gram2sapixml.pl [prefix...]  
13 -  
14 -DESCRIPTION  
15 - gram2sapixml.pl converts a recognition grammar file of Julius  
16 - (.grammar, .voca) to Microsoft SAPI XML grammar format. prefix should  
17 - be a file name of target grammar, excluding suffixes. If multiple  
18 - argument is given, each will be process sequencialy in turn.  
19 -  
20 - The internal character set should be in UTF-8 format. By default  
21 - gram2sapixml.pl assume input in EUC-JP encoding and tries to convert it  
22 - to UTF-8 using iconv. You may want to disable this feature within the  
23 - script.  
24 -  
25 - It will fail to convert a left recursive rule in the grammar. When  
26 - fails, it will leave the source rules in the target .xml file, so you  
27 - should modify the output manually to solve it.  
28 -  
29 -SEE ALSO  
30 - mkdfa.pl ( 1 )  
31 -  
32 -DIAGNOSTICS  
33 - The conversion procedure is somewhat dumb one, only converting the  
34 - non-terminal symbols and terminal symbols (=word category name) into  
35 - corresponding rules one by one. This is only a help tool, and you will  
36 - need a manual inspection and editing to use it on a real SAPI  
37 - application.  
38 -  
39 -COPYRIGHT  
40 - Copyright (c) 2002 Takashi Sumiyoshi  
41 -  
42 -LICENSE  
43 - The same as Julius.  
44 -  
45 -  
46 -  
47 - 10/02/2008 GRAM2SAPIXML.PL(1)  
recognize/src/julius/doc/manuals/jclient.pl.txt
@@ -1,46 +0,0 @@ @@ -1,46 +0,0 @@
1 - jclient.pl  
2 -  
3 -JCLIENT.PL(1) JCLIENT.PL(1)  
4 -  
5 -  
6 -  
7 -NAME  
8 - jclient.pl  
9 - - sample client for module mode (perl version)  
10 -  
11 -SYNOPSIS  
12 - jclient.pl  
13 -  
14 -DESCRIPTION  
15 - This is yet another sample client written in perl. It will connect to  
16 - Julius running in module mode, receive recognition results from Julius,  
17 - and cna send commands to control Julius.  
18 -  
19 - This is a tiny program with only 57 lines. You can use it for free.  
20 -  
21 -EXAMPLES  
22 - Invoke Julius with module mode by specifying "-module" option:  
23 - Then, at other terminal or other host, invoke jclient.pl like below.  
24 - The default hostname is "localhost", and port number is 10500. You can  
25 - change them by editing the top part of the script.  
26 - It will then receive the outputs of Julius and output the raw message  
27 - to standard out. Also, by inputting a raw module command to the  
28 - standard input of jclient.pl, it will be sent to Julius. See manuals  
29 - for the specification of module mode.  
30 -  
31 -SEE ALSO  
32 - julius ( 1 ) ,  
33 - jcontrol ( 1 )  
34 -  
35 -COPYRIGHT  
36 - "jclient.pl" has been developed by Dr. Ryuichi Nisimura  
37 - (nisimura@sys.wakayama-u.ac.jp). Use at your own risk.  
38 -  
39 - If you have any feedback, comment or request, please contact the E-mail  
40 - address above, or look at the Web page below.  
41 -  
42 - http://w3voice.jp/  
43 -  
44 -  
45 -  
46 - 10/02/2008 JCLIENT.PL(1)  
recognize/src/julius/doc/manuals/jcontrol.txt
@@ -1,173 +0,0 @@ @@ -1,173 +0,0 @@
1 - jcontrol  
2 -  
3 -JCONTROL(1) JCONTROL(1)  
4 -  
5 -  
6 -  
7 -NAME  
8 - jcontrol  
9 - - a sample module client written in C  
10 -  
11 -SYNOPSIS  
12 - jcontrol {hostname} [portnum]  
13 -  
14 -DESCRIPTION  
15 - jcontrol is a simple console program to control julius running on other  
16 - host via network API. It can send command to Julius, and receive  
17 - messages from Julius.  
18 -  
19 - When invoked, jcontrol tries to connect to Julius running in "module  
20 - mode" on specified hostname. After connection established, jcontrol  
21 - waits for user commands from standard input.  
22 -  
23 - When user types a command to jcontrol, it will be interpreted and cor-  
24 - responding API command will be sent to Julius. When a message is  
25 - received from Julius, its content will be output to standard output.  
26 -  
27 - For the details about the API, see the related documents.  
28 -  
29 -OPTIONS  
30 - hostname  
31 - Host name where Julius is runnning in module mode.  
32 -  
33 - portnum  
34 - port number (default: 10500)  
35 -  
36 -COMMANDS  
37 - jcontrol interprets commands from standard input. Below is a list of  
38 - all commands.  
39 -  
40 - Engine control  
41 - pause  
42 - Stop Julius and enter into paused status. In paused status, Julius  
43 - will not run recognition even if speech input occurs. When this  
44 - command is issued while recognition is running, Julius will stop  
45 - after the recognition has been finished.  
46 -  
47 - terminate  
48 - Same as pause, but discard the current speech input when received  
49 - command in the middle of recognition process.  
50 -  
51 - resume  
52 - Restart Julius that has been paused or terminated.  
53 -  
54 - inputparam arg  
55 - Tell Julius how to deal with speech input in case grammar is changed  
56 - just when recognition is running. Specify one: "TERMINATE", "PAUSE"  
57 - or "WAIT".  
58 -  
59 - version  
60 - Tell Julius to send version description string.  
61 -  
62 - status  
63 - Tell Julius to send the system status (active / sleep)  
64 -  
65 - Grammar handling  
66 - changegram prefix  
67 - Send a new grammar "prefix.dfa" and "prefix.dict", and tell julius  
68 - to use it as a new grammar. All the current grammars used in the  
69 - current process of Julius will be deleted and replaced to the  
70 - specifed grammar.  
71 -  
72 - addgram prefix  
73 - Send a new grammar "prefix.dfa" and "prefix.dict" and add it to the  
74 - current grammar.  
75 -  
76 - deletegram gramlist  
77 - Tell Julius to delete existing grammar. The grammar can be specified  
78 - by either prefix name or number ID. The number ID can be determined  
79 - from the message sent from Julius at each time grammar information  
80 - has changed. When want to delete more than one grammar, specify all  
81 - of them as comma-sparated.  
82 -  
83 - deactivategram gramlist  
84 - Tell Julius to de-activate a specified grammar. The specified  
85 - grammar will still be kept but will not be used for recognition.  
86 -  
87 - The target grammar can be specified by either prefix name or number  
88 - ID. The number ID can be determined from the message sent from  
89 - Julius at each time grammar information has changed. When want to  
90 - delete more than one grammar, specify all of them as comma-sparated.  
91 -  
92 - activategram gramlist  
93 - Tell Julius to activate previously de-activated grammar. The target  
94 - grammar can be specified by either prefix name or number ID. The  
95 - number ID can be determined from the message sent from Julius at  
96 - each time grammar information has changed. When want to delete more  
97 - than one grammar, specify all of them as comma-sparated.  
98 -  
99 - addword grammar_name_or_id dictfile  
100 - Add the recognition word entries in the specified dictfile to the  
101 - specified grammar on current process.  
102 -  
103 - syncgram  
104 - Force synchronize grammar status, like unix command "sync".  
105 -  
106 - Process management  
107 - Julius-4 supports multi-model recognition nad multi decoding. In this  
108 - case it is possible to control each recognition process, as defined by  
109 - "-SR" option, from module client.  
110 -  
111 - In multi decoding mode, the module client holds "current process", and  
112 - the process commands and grammar related commands will be issued toward  
113 - the current process.  
114 -  
115 - listprocess  
116 - Tell Julius to send the list of existing recognition process.  
117 -  
118 - currentprocess procname  
119 - Switch the current process to the process specified by the name.  
120 -  
121 - shiftprocess  
122 - Rotate the current process. At each call the current process will be  
123 - changed to the next one.  
124 -  
125 - addprocess jconffile  
126 - Tell Julisu to load a new recognition process into engine. The  
127 - argument jconffile should be a jconf file that contains only one set  
128 - of LM options and one SR definition. Note that the file should be  
129 - visible on the running Julius, since jcontrol only send the path  
130 - name and Julius actually read the jconf file.  
131 -  
132 - The new LM and SR process will have the name of the jconffile.  
133 -  
134 - delprocess procname  
135 - Delete the specified recognition process from the engine.  
136 -  
137 - deactivateprocess procname  
138 - Tell Julius to temporary stop the specified recognition process. The  
139 - stopped process will not be executed for the input until activated  
140 - again.  
141 -  
142 - activateprocess procname  
143 - Tell Julius to activate the temporarily stopped process.  
144 -  
145 -EXAMPLES  
146 - The dump messages from Julius are output to tty with prefix ">"  
147 - appended to each line. Julius can be started in module mode like this:  
148 - jcontrolcan be launched with the host name:  
149 - It will then receive the outputs of Julius and output the raw message  
150 - to standard out. Also, by inputting the commands above to the standard  
151 - input of jcontrol, it will be sent to Julius. See manuals for the  
152 - specification of module mode.  
153 -  
154 -SEE ALSO  
155 - julius ( 1 )  
156 -  
157 -COPYRIGHT  
158 - Copyright (c) 1997-2000 Information-technology Promotion Agency, Japan  
159 -  
160 - Copyright (c) 1991-2008 Kawahara Lab., Kyoto University  
161 -  
162 - Copyright (c) 2000-2005 Shikano Lab., Nara Institute of Science and  
163 - Technology  
164 -  
165 - Copyright (c) 2005-2008 Julius project team, Nagoya Institute of  
166 - Technology  
167 -  
168 -LICENSE  
169 - The same as Julius.  
170 -  
171 -  
172 -  
173 - 10/02/2008 JCONTROL(1)  
recognize/src/julius/doc/manuals/julius.txt
@@ -1,1287 +0,0 @@ @@ -1,1287 +0,0 @@
1 - julius  
2 -  
3 -JULIUS(1) JULIUS(1)  
4 -  
5 -  
6 -  
7 -NAME  
8 - julius  
9 - - open source multi-purpose LVCSR engine  
10 -  
11 -SYNOPSIS  
12 - julius [-C jconffile] [options...]  
13 -  
14 -DESCRIPTION  
15 - julius is a high-performance, multi-purpose, open-source speech  
16 - recognition engine for researchers and developers. It is capable of  
17 - performing almost real-time recognition of continuous speech with over  
18 - 60k-word 3-gram language model and triphone HMM model, on most current  
19 - PCs. julius can perform recognition on audio files, live microphone  
20 - input, network input and feature parameter files.  
21 -  
22 - The core recognition module is implemented as C library called  
23 - "JuliusLib". It can also be extended by plug-in facility.  
24 -  
25 - Supported Models  
26 - julius needs a language model and an acoustic model to run as a speech  
27 - recognizer. julius supports the following models.  
28 -  
29 - Acoustic model  
30 - Sub-word HMM (Hidden Markov Model) in HTK ascii format are  
31 - supported. Phoneme models (monophone), context dependent phoneme  
32 - models (triphone), tied-mixture and phonetic tied-mixture models  
33 - of any unit can be used. When using context dependent models,  
34 - inter-word context dependency is also handled. Multi-stream  
35 - feature and MSD-HMM is also supported. You can further use a  
36 - tool mkbinhmm to convert the ascii HMM file to a compact binary  
37 - format for faster loading.  
38 -  
39 - Note that julius itself can only extract MFCC features from  
40 - speech data. If you use acoustic HMM trained for other feature,  
41 - you should give the input in HTK parameter file of the same  
42 - feature type.  
43 -  
44 - Language model: word N-gram  
45 - Word N-gram language model, up to 10-gram, is supported. Julius  
46 - uses different N-gram for each pass: left-to-right 2-gram on 1st  
47 - pass, and right-to-left N-gram on 2nd pass. It is recommended to  
48 - use both LR 2-gram and RL N-gram for Julius. However, you can  
49 - use only single LR N-gram or RL N-gram. In such case,  
50 - approximated LR 2-gram computed from the given N-gram will be  
51 - applied at the first pass.  
52 -  
53 - The Standard ARPA format is supported. In addition, a binary  
54 - format is also supported for efficiency. The tool mkbingram(1)  
55 - can convert ARPA format N-gram to binary format.  
56 -  
57 - Language model: grammar  
58 - The grammar format is an original one, and tools to create a  
59 - recognirion grammar are included in the distribution. A grammar  
60 - consists of two files: one is a 'grammar' file that describes  
61 - sentence structures in a BNF style, using word 'category' name  
62 - as terminate symbols. Another is a 'voca' file that defines  
63 - words with its pronunciations (i.e. phoneme sequences) for each  
64 - category. They should be converted by mkdfa.pl(1) to a  
65 - deterministic finite automaton file (.dfa) and a dictionary file  
66 - (.dict), respectively. You can also use multiple grammars.  
67 -  
68 - Language model: isolated word  
69 - You can perform isolated word recognition using only word  
70 - dictionary. With this model type, Julius will perform rapid one  
71 - pass recognition with static context handling. Silence models  
72 - will be added at both head and tail of each word. You can also  
73 - use multiple dictionaries in a process.  
74 -  
75 - Search Algorithm  
76 - Recognition algorithm of julius is based on a two-pass strategy. Word  
77 - 2-gram and reverse word 3-gram is used on the respective passes. The  
78 - entire input is processed on the first pass, and again the final  
79 - searching process is performed again for the input, using the result of  
80 - the first pass to narrow the search space. Specifically, the  
81 - recognition algorithm is based on a tree-trellis heuristic search  
82 - combined with left-to-right frame-synchronous beam search and  
83 - right-to-left stack decoding search.  
84 -  
85 - When using context dependent phones (triphones), interword contexts are  
86 - taken into consideration. For tied-mixture and phonetic tied-mixture  
87 - models, high-speed acoustic likelihood calculation is possible using  
88 - gaussian pruning.  
89 -  
90 - For more details, see the related documents.  
91 -  
92 -OPTIONS  
93 - These options specify the models, system behaviors and various search  
94 - parameters to Julius. These option can be set at the command line, but  
95 - it is recommended that you write them in a text file as a "jconf file",  
96 - and specify it by "-C" option.  
97 -  
98 - Applications incorporating JuliusLib also use these options to set the  
99 - parameters of core recognition engine. For example, a jconf file can be  
100 - loaded to the enine by calling j_config_load_file_new() with the jconf  
101 - file name as argument.  
102 -  
103 - Please note that relative paths in a jconf file should be relative to  
104 - the jconf file itself, not the current working directory.  
105 -  
106 - Below are the details of all options, gathered by group.  
107 -  
108 - Julius application option  
109 - These are application options of Julius, outside of JuliusLib. It  
110 - contains parameters and switches for result output, character set  
111 - conversion, log level, and module mode options. These option are  
112 - specific to Julius, and cannot be used at applications using JuliusLib  
113 - other than Julius.  
114 -  
115 - -outfile  
116 - On file input, this option write the recognition result of each file  
117 - to a separate file. The output file of an input file will be the  
118 - same name but the suffix will be changed to ".out". (rev.4.0)  
119 -  
120 - -separatescore  
121 - Output the language and acoustic scores separately.  
122 -  
123 - -callbackdebug  
124 - Print the callback names at each call for debug. (rev.4.0)  
125 -  
126 - -charconv from to  
127 - Print with character set conversion. from is the source character  
128 - set used in the language model, and to is the target character set  
129 - you want to get.  
130 -  
131 - On Linux, the arguments should be a code name. You can obtain the  
132 - list of available code names by invoking the command "iconv --list".  
133 - On Windows, the arguments should be a code name or codepage number.  
134 - Code name should be one of "ansi", "mac", "oem", "utf-7", "utf-8",  
135 - "sjis", "euc". Or you can specify any codepage number supported at  
136 - your environment.  
137 -  
138 - -nocharconv  
139 - Disable character conversion.  
140 -  
141 - -module [port]  
142 - Run Julius on "Server Module Mode". After startup, Julius waits for  
143 - tcp/ip connection from client. Once connection is established,  
144 - Julius start communication with the client to process incoming  
145 - commands from the client, or to output recognition results, input  
146 - trigger information and other system status to the client. The  
147 - default port number is 10500.  
148 -  
149 - -record dir  
150 - Auto-save all input speech data into the specified directory. Each  
151 - segmented inputs are recorded each by one. The file name of the  
152 - recorded data is generated from system time when the input ends, in  
153 - a style of YYYY.MMDD.HHMMSS.wav. File format is 16bit monoral WAV.  
154 - Invalid for mfcfile input.  
155 -  
156 - With input rejection by -rejectshort, the rejected input will also  
157 - be recorded even if they are rejected.  
158 -  
159 - -logfile file  
160 - Save all log output to a file instead of standard output. (Rev.4.0)  
161 -  
162 - -nolog  
163 - Disable all log output. (Rev.4.0)  
164 -  
165 - -help  
166 - Output help message and exit.  
167 -  
168 - Global options  
169 - These are model-/search-dependent options relating audio input, sound  
170 - detection, GMM, decoding algorithm, plugin facility, and others. Global  
171 - options should be placed before any instance declaration (-AM, -LM, or  
172 - -SR), or just after "-GLOBAL" option.  
173 -  
174 - Audio input  
175 - -input  
176 - {mic|rawfile|mfcfile|adinnet|stdin|netaudio|alsa|oss|esd}  
177 - Choose speech input source. Specify 'file' or 'rawfile' for  
178 - waveform file, 'htkparam' or 'mfcfile' for HTK parameter  
179 - file. On file input, users will be prompted to enter the file  
180 - name from stdin, or you can use -filelist option to specify  
181 - list of files to process.  
182 -  
183 - 'mic' is to get audio input from a default live microphone  
184 - device, and 'adinnet' means receiving waveform data via tcpip  
185 - network from an adinnet client. 'netaudio' is from  
186 - DatLink/NetAudio input, and 'stdin' means data input from  
187 - standard input.  
188 -  
189 - For waveform file input, only WAV (no compression) and RAW  
190 - (noheader, 16bit, big endian) are supported by default. Other  
191 - format can be read when compiled with libsnd library. To see  
192 - what format is actually supported, see the help message using  
193 - option -help. For stdin input, only WAV and RAW is supported.  
194 - (default: mfcfile)  
195 -  
196 - At Linux, you can choose API at run time by specifying alsa,  
197 - oss and esd.  
198 -  
199 - -filelist filename  
200 - (With -input rawfile|mfcfile) perform recognition on all  
201 - files listed in the file. The file should contain input file  
202 - per line. Engine will end when all of the files are  
203 - processed.  
204 -  
205 - -notypecheck  
206 - By default, Julius checks the input parameter type whether it  
207 - matches the AM or not. This option will disable the check and  
208 - force engine to use the input vector as is.  
209 -  
210 - -48  
211 - Record input with 48kHz sampling, and down-sample it to 16kHz  
212 - on-the-fly. This option is valid for 16kHz model only. The  
213 - down-sampling routine was ported from sptk. (Rev. 4.0)  
214 -  
215 - -NA devicename  
216 - Host name for DatLink server input (-input netaudio).  
217 -  
218 - -adport port_number  
219 - With -input adinnet, specify adinnet port number to listen.  
220 - (default: 5530)  
221 -  
222 - -nostrip  
223 - Julius by default removes successive zero samples in input  
224 - speech data. This option inhibits the removal.  
225 -  
226 - -zmean , -nozmean  
227 - This option enables/disables DC offset removal of input  
228 - waveform. Offset will be estimated from the whole input. For  
229 - microphone / network input, zero mean of the first 48000  
230 - samples (3 seconds in 16kHz sampling) will be used for the  
231 - estimation. (default: disabled)  
232 -  
233 - This option uses static offset for the channel. See also  
234 - -zmeansource for frame-wise offset removal.  
235 -  
236 - Speech detection by level and zero-cross  
237 - -cutsilence , -nocutsilence  
238 - Turn on / off the speech detection by level and zero-cross.  
239 - Default is on for mic / adinnet input, and off for files.  
240 -  
241 - -lv thres  
242 - Level threshold for speech input detection. Values should be  
243 - in range from 0 to 32767. (default: 2000)  
244 -  
245 - -zc thres  
246 - Zero crossing threshold per second. Only input that goes over  
247 - the level threshold (-lv) will be counted. (default: 60)  
248 -  
249 - -headmargin msec  
250 - Silence margin at the start of speech segment in  
251 - milliseconds. (default: 300)  
252 -  
253 - -tailmargin msec  
254 - Silence margin at the end of speech segment in milliseconds.  
255 - (default: 400)  
256 -  
257 - Input rejection  
258 - Two simple front-end input rejection methods are implemented,  
259 - based on input length and average power of detected segment. The  
260 - rejection by average power is experimental, and can be enabled  
261 - by --enable-power-reject on compilation. Valid for MFCC feature  
262 - with power coefficient and real-time input only.  
263 -  
264 - For GMM-based input rejection see the GMM section below.  
265 -  
266 - -rejectshort msec  
267 - Reject input shorter than specified milliseconds. Search will  
268 - be terminated and no result will be output.  
269 -  
270 - -powerthres thres  
271 - Reject the inputted segment by its average energy. If the  
272 - average energy of the last recognized input is below the  
273 - threshold, Julius will reject the input. (Rev.4.0)  
274 -  
275 - This option is valid when --enable-power-reject is specified  
276 - at compilation time.  
277 -  
278 - Gaussian mixture model / GMM-VAD  
279 - GMM will be used for input rejection by accumulated score, or  
280 - for front-end GMM-based VAD when --enable-gmm-vad is specified.  
281 -  
282 - NOTE: You should also set the proper MFCC parameters required  
283 - for the GMM, specifying the acoustic parameters described in AM  
284 - section -AM_GMM.  
285 -  
286 - When GMM-based VAD is enabled, the voice activity score will be  
287 - calculated at each frame as front-end processing. The value will  
288 - be computed as \[ \max_{m \in M_v} p(x|m) - \max_{m \in M_n}  
289 - p(x|m) \] where $M_v$ is a set of voice GMM, and $M_n$ is a set  
290 - of noise GMM whose names should be specified by -gmmreject. The  
291 - activity score will be then averaged for the last N frames,  
292 - where N is specified by -gmmmargin. Julius updates the averaged  
293 - activity score at each frame, and detect speech up-trigger when  
294 - the value gets higher than a value specified by -gmmup, and  
295 - detecgt down-trigger when it gets lower than a value of  
296 - -gmmdown.  
297 -  
298 - -gmm hmmdefs_file  
299 - GMM definition file in HTK format. If specified, GMM-based  
300 - input verification will be performed concurrently with the  
301 - 1st pass, and you can reject the input according to the  
302 - result as specified by -gmmreject. The GMM should be defined  
303 - as one-state HMMs.  
304 -  
305 - -gmmnum number  
306 - Number of Gaussian components to be computed per frame on GMM  
307 - calculation. Only the N-best Gaussians will be computed for  
308 - rapid calculation. The default is 10 and specifying smaller  
309 - value will speed up GMM calculation, but too small value (1  
310 - or 2) may cause degradation of identification performance.  
311 -  
312 - -gmmreject string  
313 - Comma-separated list of GMM names to be rejected as invalid  
314 - input. When recognition, the log likelihoods of GMMs  
315 - accumulated for the entire input will be computed  
316 - concurrently with the 1st pass. If the GMM name of the  
317 - maximum score is within this string, the 2nd pass will not be  
318 - executed and the input will be rejected.  
319 -  
320 - -gmmmargin frames  
321 - (GMM_VAD) Head margin in frames. When a speech trigger  
322 - detected by GMM, recognition will start from current frame  
323 - minus this value. (Rev.4.0)  
324 -  
325 - This option will be valid only if compiled with  
326 - --enable-gmm-vad.  
327 -  
328 - -gmmup value  
329 - (GMM_VAD) Up trigger threshold of voice activity score.  
330 - (Rev.4.1)  
331 -  
332 - This option will be valid only if compiled with  
333 - --enable-gmm-vad.  
334 -  
335 - -gmmdown value  
336 - (GMM_VAD) Down trigger threshold of voice activity score.  
337 - (Rev.4.1)  
338 -  
339 - This option will be valid only if compiled with  
340 - --enable-gmm-vad.  
341 -  
342 - Decoding option  
343 - Real-time processing means concurrent processing of MFCC  
344 - computation 1st pass decoding. By default, real-time processing  
345 - on the pass is on for microphone / adinnet / netaudio input, and  
346 - for others.  
347 -  
348 - -realtime , -norealtime  
349 - Explicitly switch on / off real-time (pipe-line) processing  
350 - on the first pass. The default is off for file input, and on  
351 - for microphone, adinnet and NetAudio input. This option  
352 - relates to the way CMN and energy normalization is performed:  
353 - if off, they will be done using average features of whole  
354 - input. If on, MAP-CMN and energy normalization to do  
355 - real-time processing.  
356 -  
357 - Misc. options  
358 - -C jconffile  
359 - Load a jconf file at here. The content of the jconffile will  
360 - be expanded at this point.  
361 -  
362 - -version  
363 - Print version information to standard error, and exit.  
364 -  
365 - -setting  
366 - Print engine setting information to standard error, and exit.  
367 -  
368 - -quiet  
369 - Output less log. For result, only the best word sequence will  
370 - be printed.  
371 -  
372 - -debug  
373 - (For debug) output enormous internal message and debug  
374 - information to log.  
375 -  
376 - -check {wchmm|trellis|triphone}  
377 - For debug, enter interactive check mode.  
378 -  
379 - -plugindir dirlist  
380 - Specify directory to load plugin. If several direcotries  
381 - exist, specify them by colon-separated list.  
382 -  
383 - Instance declaration for multi decoding  
384 - The following arguments will create a new configuration set with  
385 - default parameters, and switch current set to it. Jconf parameters  
386 - specified after the option will be set into the current set.  
387 -  
388 - To do multi-model decoding, these argument should be specified at the  
389 - first of each model / search instances with different names. Any  
390 - options before the first instance definition will be IGNORED.  
391 -  
392 - When no instance definition is found (as older version of Julius), all  
393 - the options are assigned to a default instance named _default.  
394 -  
395 - Please note that decoding with a single LM and multiple AMs is not  
396 - fully supported. For example, you may want to construct the jconf file  
397 - as following.  
398 - This type of model sharing is not supported yet, since some part of LM  
399 - processing depends on the assigned AM. Instead, you can get the same  
400 - result by defining the same LMs for each AM, like this:  
401 -  
402 - -AM name  
403 - Create a new AM configuration set, and switch current to the new  
404 - one. You should give a unique name. (Rev.4.0)  
405 -  
406 - -LM name  
407 - Create a new LM configuration set, and switch current to the new  
408 - one. You should give a unique name. (Rev.4.0)  
409 -  
410 - -SR name am_name lm_name  
411 - Create a new search configuration set, and switch current to the new  
412 - one. The specified AM and LM will be assigned to it. The am_name and  
413 - lm_name can be either name or ID number. You should give a unique  
414 - name. (Rev.4.0)  
415 -  
416 - -AM_GMM  
417 - When using GMM for front-end processing, you can specify  
418 - GMM-specific acoustic parameters after this option. If you does not  
419 - specify -AM_GMM with GMM, the GMM will share the same parameter  
420 - vector as the last AM. The current AM will be switched to the GMM  
421 - one, so be careful not to confuse with normal AM configurations.  
422 - (Rev.4.0)  
423 -  
424 - -GLOBAL  
425 - Start a global section. The global options should be placed before  
426 - any instance declaration, or after this option on multiple model  
427 - recognition. This can be used multiple times. (Rev.4.1)  
428 -  
429 - -nosectioncheck , -sectioncheck  
430 - Disable / enable option location check in multi-model decoding. When  
431 - enabled, the options between instance declaration is treated as  
432 - "sections" and only the belonging option types can be written. For  
433 - example, when an option -AM is specified, only the AM related option  
434 - can be placed after the option until other declaration is found.  
435 - Also, global options should be placed at top, before any instance  
436 - declarataion. This is enabled by default. (Rev.4.1)  
437 -  
438 - Language model (-LM)  
439 - This group contains options for model definition of each language model  
440 - type. When using multiple LM, one instance can have only one LM.  
441 -  
442 - Only one type of LM can be specified for a LM configuration. If you  
443 - want to use multi model, you should define them one as a new LM.  
444 -  
445 - N-gram  
446 - -d bingram_file  
447 - Use binary format N-gram. An ARPA N-gram file can be  
448 - converted to Julius binary format by mkbingram.  
449 -  
450 - -nlr arpa_ngram_file  
451 - A forward, left-to-right N-gram language model in standard  
452 - ARPA format. When both a forward N-gram and backward N-gram  
453 - are specified, Julius uses this forward 2-gram for the 1st  
454 - pass, and the backward N-gram for the 2nd pass.  
455 -  
456 - Since ARPA file often gets huge and requires a lot of time to  
457 - load, it may be better to convert the ARPA file to Julius  
458 - binary format by mkbingram. Note that if both forward and  
459 - backward N-gram is used for recognition, they together will  
460 - be converted to a single binary.  
461 -  
462 - When only a forward N-gram is specified by this option and no  
463 - backward N-gram specified by -nrl, Julius performs  
464 - recognition with only the forward N-gram. The 1st pass will  
465 - use the 2-gram entry in the given N-gram, and The 2nd pass  
466 - will use the given N-gram, with converting forward  
467 - probabilities to backward probabilities by Bayes rule.  
468 - (Rev.4.0)  
469 -  
470 - -nrl arpa_ngram_file  
471 - A backward, right-to-left N-gram language model in standard  
472 - ARPA format. When both a forward N-gram and backward N-gram  
473 - are specified, Julius uses the forward 2-gram for the 1st  
474 - pass, and this backward N-gram for the 2nd pass.  
475 -  
476 - Since ARPA file often gets huge and requires a lot of time to  
477 - load, it may be better to convert the ARPA file to Julius  
478 - binary format by mkbingram. Note that if both forward and  
479 - backward N-gram is used for recognition, they together will  
480 - be converted to a single binary.  
481 -  
482 - When only a backward N-gram is specified by this option and  
483 - no forward N-gram specified by -nlr, Julius performs  
484 - recognition with only the backward N-gram. The 1st pass will  
485 - use the forward 2-gram probability computed from the backward  
486 - 2-gram using Bayes rule. The 2nd pass fully use the given  
487 - backward N-gram. (Rev.4.0)  
488 -  
489 - -v dict_file  
490 - Word dictionary file.  
491 -  
492 - -silhead word_string -siltail word_string  
493 - Silence word defined in the dictionary, for silences at the  
494 - beginning of sentence and end of sentence. (default: "<s>",  
495 - "</s>")  
496 -  
497 - -mapunk word_string  
498 - Specify unknown word. Default is "<unk>" or "<UNK>". This  
499 - will be used to assign word probability on unknown words,  
500 - i.e. words in dictionary that are not in N-gram vocabulary.  
501 -  
502 - -iwspword  
503 - Add a word entry to the dictionary that should correspond to  
504 - inter-word pauses. This may improve recognition accuracy in  
505 - some language model that has no explicit inter-word pause  
506 - modeling. The word entry to be added can be changed by  
507 - -iwspentry.  
508 -  
509 - -iwspentry word_entry_string  
510 - Specify the word entry that will be added by -iwspword.  
511 - (default: "<UNK> [sp] sp sp")  
512 -  
513 - -sepnum number  
514 - Number of high frequency words to be isolated from the  
515 - lexicon tree, to ease approximation error that may be caused  
516 - by the one-best approximation on 1st pass. (default: 150)  
517 -  
518 - Grammar  
519 - Multiple grammars can be specified by repeating -gram and  
520 - -gramlist. Note that this is unusual behavior from other options  
521 - (in normal Julius option, last one will override previous ones).  
522 - You can use -nogram to reset the grammars already specified  
523 - before the point.  
524 -  
525 - -gram gramprefix1[,gramprefix2[,gramprefix3,...]]  
526 - Comma-separated list of grammars to be used. the argument  
527 - should be a prefix of a grammar, i.e. if you have foo.dfa and  
528 - foo.dict, you should specify them with a single argument foo.  
529 - Multiple grammars can be specified at a time as a  
530 - comma-separated list.  
531 -  
532 - -gramlist list_file  
533 - Specify a grammar list file that contains list of grammars to  
534 - be used. The list file should contain the prefixes of  
535 - grammars, each per line. A relative path in the list file  
536 - will be treated as relative to the file, not the current path  
537 - or configuration file.  
538 -  
539 - -dfa dfa_file -v dict_file  
540 - An old way of specifying grammar files separately. This is  
541 - bogus, and should not be used any more.  
542 -  
543 - -nogram  
544 - Remove the current list of grammars already specified by  
545 - -gram, -gramlist, -dfa and -v.  
546 -  
547 - Isolated word  
548 - Dictionary can be specified by using -w and -wlist. When you  
549 - specify multiple times, all of them will be read at startup. You  
550 - can use -nogram to reset the already specified dictionaries at  
551 - that point.  
552 -  
553 - -w dict_file  
554 - Word dictionary for isolated word recognition. File format is  
555 - the same as other LM. (Rev.4.0)  
556 -  
557 - -wlist list_file  
558 - Specify a dictionary list file that contains list of  
559 - dictionaries to be used. The list file should contain the  
560 - file name of dictionaries, each per line. A relative path in  
561 - the list file will be treated as relative to the list file,  
562 - not the current path or configuration file. (Rev.4.0)  
563 -  
564 - -nogram  
565 - Remove the current list of dictionaries already specified by  
566 - -w and -wlist.  
567 -  
568 - -wsil head_sil_model_name tail_sil_model_name sil_context_name  
569 - On isolated word recognition, silence models will be appended  
570 - to the head and tail of each word at recognition. This option  
571 - specifies the silence models to be appended.  
572 - sil_context_name is the name of the head sil model and tail  
573 - sil model as a context of word head phone and tail phone. For  
574 - example, if you specify -wsil silB silE sp, a word with phone  
575 - sequence b eh t will be translated as silB sp-b+eh b-eh+t  
576 - eh-t+sp silE. (Rev.4.0)  
577 -  
578 - User-defined LM  
579 - -userlm  
580 - Declare to use user LM functions in the program. This option  
581 - should be specified if you use user-defined LM functions.  
582 - (Rev.4.0)  
583 -  
584 - Misc. LM options  
585 - -forcedict  
586 - Skip error words in dictionary and force running.  
587 -  
588 - Acoustic model and feature analysis (-AM) (-AM_GMM)  
589 - This section is about options for acoustic model, feature extraction,  
590 - feature normalizations and spectral subtraction.  
591 -  
592 - After -AM name, an acoustic model and related specification should be  
593 - written. You can use multiple AMs trained with different MFCC types.  
594 - For GMM, the required parameter condition should be specified just as  
595 - same as AMs after -AM_GMM.  
596 -  
597 - When using multiple AMs, the values of -smpPeriod, -smpFreq, -fsize and  
598 - -fshift should be the same among all AMs.  
599 -  
600 - Acoustic HMM  
601 - -h hmmdef_file  
602 - Acoustic HMM definition file. It should be in HTK ascii  
603 - format, or Julius binary format. You can convert HTK ascii  
604 - format to Julius binary format using mkbinhmm.  
605 -  
606 - -hlist hmmlist_file  
607 - HMMList file for phone mapping. This file provides mapping  
608 - between logical triphone names generated in the dictionary  
609 - and the defined HMM names in hmmdefs. This option should be  
610 - specified for context-dependent model.  
611 -  
612 - -tmix number  
613 - Specify the number of top Gaussians to be calculated in a  
614 - mixture codebook. Small number will speed up the acoustic  
615 - computation, but AM accuracy may get worse with too small  
616 - value. See also -gprune. (default: 2)  
617 -  
618 - -spmodel name  
619 - Specify HMM model name that corresponds to short-pause in an  
620 - utterance. The short-pause model name will be used in  
621 - recognition: short-pause skipping on grammar recognition,  
622 - word-end short-pause model insertion with -iwsp on N-gram, or  
623 - short-pause segmentation (-spsegment). (default: "sp")  
624 -  
625 - -multipath  
626 - Enable multi-path mode. To make decoding faster, Julius by  
627 - default impose a limit on HMM transitions that each model  
628 - should have only one transition from initial state and to end  
629 - state. On multi-path mode, Julius does extra handling on  
630 - inter-model transition to allows model-skipping transition  
631 - and multiple output/input transitions. Note that specifying  
632 - this option will make Julius a bit slower, and the larger  
633 - beam width may be required.  
634 -  
635 - This function was a compilation-time option on Julius 3.x,  
636 - and now becomes a run-time option. By default (without this  
637 - option), Julius checks the transition type of specified HMMs,  
638 - and enable the multi-path mode if required. You can force  
639 - multi-path mode with this option. (rev.4.0)  
640 -  
641 - -gprune {safe|heuristic|beam|none|default}  
642 - Set Gaussian pruning algorithm to use. For tied-mixture  
643 - model, Julius performs Gaussian pruning to reduce acoustic  
644 - computation, by calculating only the top N Gaussians in each  
645 - codebook at each frame. The default setting will be set  
646 - according to the model type and engine setting. default will  
647 - force accepting the default setting. Set this to none to  
648 - disable pruning and perform full computation. safe  
649 - guarantees the top N Gaussians to be computed. heuristic and  
650 - beam do more aggressive computational cost reduction, but may  
651 - result in small loss of accuracy model (default: safe  
652 - (standard), beam (fast) for tied mixture model, none for non  
653 - tied-mixture model).  
654 -  
655 - -iwcd1 {max|avg|best number}  
656 - Select method to approximate inter-word triphone on the head  
657 - and tail of a word in the first pass.  
658 -  
659 -  
660 - max will apply the maximum likelihood of the same context  
661 - triphones. avg will apply the average likelihood of the same  
662 - context triphones. best number will apply the average of top  
663 - N-best likelihoods of the same context triphone.  
664 -  
665 - Default is best 3 for use with N-gram, and avg for grammar  
666 - and word. When this AM is shared by LMs of both type, latter  
667 - one will be chosen.  
668 -  
669 - -iwsppenalty float  
670 - Insertion penalty for word-end short pauses appended by  
671 - -iwsp.  
672 -  
673 - -gshmm hmmdef_file  
674 - If this option is specified, Julius performs Gaussian Mixture  
675 - Selection for efficient decoding. The hmmdefs should be a  
676 - monophone model generated from an ordinary monophone HMM  
677 - model, using mkgshmm.  
678 -  
679 - -gsnum number  
680 - On GMS, specify number of monophone states to compute  
681 - corresponding triphones in detail. (default: 24)  
682 -  
683 - Speech analysis  
684 - Only MFCC feature extraction is supported in current Julius.  
685 - Thus when recognizing a waveform input from file or microphone,  
686 - AM must be trained by MFCC. The parameter condition should also  
687 - be set as exactly the same as the training condition by the  
688 - options below.  
689 -  
690 - When you give an input in HTK Parameter file, you can use any  
691 - parameter type for AM. In this case Julius does not care about  
692 - the type of input feature and AM, just read them as vector  
693 - sequence and match them to the given AM. Julius only checks  
694 - whether the parameter types are the same. If it does not work  
695 - well, you can disable this checking by -notypecheck.  
696 -  
697 - In Julius, the parameter kind and qualifiers (as TARGETKIND in  
698 - HTK) and the number of cepstral parameters (NUMCEPS) will be set  
699 - automatically from the content of the AM header, so you need not  
700 - specify them by options.  
701 -  
702 - Other parameters should be set exactly the same as training  
703 - condition. You can also give a HTK Config file which you used to  
704 - train AM to Julius by -htkconf. When this option is applied,  
705 - Julius will parse the Config file and set appropriate parameter.  
706 -  
707 - You can further embed those analysis parameter settings to a  
708 - binary HMM file using mkbinhmm.  
709 -  
710 - If options specified in several ways, they will be evaluated in  
711 - the order below. The AM embedded parameter will be loaded first  
712 - if any. Then, the HTK config file given by -htkconf will be  
713 - parsed. If a value already set by AM embedded value, HTK config  
714 - will override them. At last, the direct options will be loaded,  
715 - which will override settings loaded before. Note that, when the  
716 - same options are specified several times, later will override  
717 - previous, except that -htkconf will be evaluated first as  
718 - described above.  
719 -  
720 - -smpPeriod period  
721 - Sampling period of input speech, in unit of 100 nanoseconds.  
722 - Sampling rate can also be specified by -smpFreq. Please note  
723 - that the input frequency should be set equal to the training  
724 - conditions of AM. (default: 625, corresponds to 16,000Hz)  
725 -  
726 - This option corresponds to the HTK Option SOURCERATE. The  
727 - same value can be given to this option.  
728 -  
729 - When using multiple AM, this value should be the same among  
730 - all AMs.  
731 -  
732 - -smpFreq Hz  
733 - Set sampling frequency of input speech in Hz. Sampling rate  
734 - can also be specified using -smpPeriod. Please note that this  
735 - frequency should be set equal to the training conditions of  
736 - AM. (default: 16,000)  
737 -  
738 - When using multiple AM, this value should be the same among  
739 - all AMs.  
740 -  
741 - -fsize sample_num  
742 - Window size in number of samples. (default: 400)  
743 -  
744 - This option corresponds to the HTK Option WINDOWSIZE, but  
745 - value should be in samples (HTK value / smpPeriod).  
746 -  
747 - When using multiple AM, this value should be the same among  
748 - all AMs.  
749 -  
750 - -fshift sample_num  
751 - Frame shift in number of samples. (default: 160)  
752 -  
753 - This option corresponds to the HTK Option TARGETRATE, but  
754 - value should be in samples (HTK value / smpPeriod).  
755 -  
756 - When using multiple AM, this value should be the same among  
757 - all AMs.  
758 -  
759 - -preemph float  
760 - Pre-emphasis coefficient. (default: 0.97)  
761 -  
762 - This option corresponds to the HTK Option PREEMCOEF. The same  
763 - value can be given to this option.  
764 -  
765 - -fbank num  
766 - Number of filterbank channels. (default: 24)  
767 -  
768 - This option corresponds to the HTK Option NUMCHANS. The same  
769 - value can be given to this option. Be aware that the default  
770 - value not the same as in HTK (22).  
771 -  
772 - -ceplif num  
773 - Cepstral liftering coefficient. (default: 22)  
774 -  
775 - This option corresponds to the HTK Option CEPLIFTER. The same  
776 - value can be given to this option.  
777 -  
778 - -rawe , -norawe  
779 - Enable/disable using raw energy before pre-emphasis (default:  
780 - disabled)  
781 -  
782 - This option corresponds to the HTK Option RAWENERGY. Be aware  
783 - that the default value differs from HTK (enabled at HTK,  
784 - disabled at Julius).  
785 -  
786 - -enormal , -noenormal  
787 - Enable/disable normalizing log energy. On live input, this  
788 - normalization will be approximated from the average of last  
789 - input. (default: disabled)  
790 -  
791 - This option corresponds to the HTK Option ENORMALISE. Be  
792 - aware that the default value differs from HTK (enabled at  
793 - HTK, disabled at Julius).  
794 -  
795 - -escale float_scale  
796 - Scaling factor of log energy when normalizing log energy.  
797 - (default: 1.0)  
798 -  
799 - This option corresponds to the HTK Option ESCALE. Be aware  
800 - that the default value differs from HTK (0.1).  
801 -  
802 - -silfloor float  
803 - Energy silence floor in dB when normalizing log energy.  
804 - (default: 50.0)  
805 -  
806 - This option corresponds to the HTK Option SILFLOOR.  
807 -  
808 - -delwin frame  
809 - Delta window size in number of frames. (default: 2)  
810 -  
811 - This option corresponds to the HTK Option DELTAWINDOW. The  
812 - same value can be given to this option.  
813 -  
814 - -accwin frame  
815 - Acceleration window size in number of frames. (default: 2)  
816 -  
817 - This option corresponds to the HTK Option ACCWINDOW. The same  
818 - value can be given to this option.  
819 -  
820 - -hifreq Hz  
821 - Enable band-limiting for MFCC filterbank computation: set  
822 - upper frequency cut-off. Value of -1 will disable it.  
823 - (default: -1)  
824 -  
825 - This option corresponds to the HTK Option HIFREQ. The same  
826 - value can be given to this option.  
827 -  
828 - -lofreq Hz  
829 - Enable band-limiting for MFCC filterbank computation: set  
830 - lower frequency cut-off. Value of -1 will disable it.  
831 - (default: -1)  
832 -  
833 - This option corresponds to the HTK Option LOFREQ. The same  
834 - value can be given to this option.  
835 -  
836 - -zmeanframe , -nozmeanframe  
837 - With speech input, this option enables/disables frame-wise DC  
838 - offset removal. This corresponds to HTK configuration  
839 - ZMEANSOURCE. This cannot be used together with -zmean.  
840 - (default: disabled)  
841 -  
842 - -usepower  
843 - Use power instead of magnitude on filterbank analysis.  
844 - (default: disabled)  
845 -  
846 - Normalization  
847 - Julius can perform cepstral mean normalization (CMN) for inputs.  
848 - CMN will be activated when the given AM was trained with CMN  
849 - (i.e. has "_Z" qualifier in the header).  
850 -  
851 - The cepstral mean will be estimated in different way according  
852 - to the input type. On file input, the mean will be computed from  
853 - the whole input. On live input such as microphone and network  
854 - input, the ceptral mean of the input is unknown at the start. So  
855 - MAP-CMN will be used. On MAP-CMN, an initial mean vector will be  
856 - applied at the beginning, and the mean vector will be smeared to  
857 - the mean of the incrementing input vector as input goes. Options  
858 - below can control the behavior of MAP-CMN.  
859 -  
860 - -cvn  
861 - Enable cepstral variance normalization. At file input, the  
862 - variance of whole input will be calculated and then applied.  
863 - At live microphone input, variance of the last input will be  
864 - applied. CVN is only supported for an audio input.  
865 -  
866 - -vtln alpha lowcut hicut  
867 - Do frequency warping, typically for a vocal tract length  
868 - normalization (VTLN). Arguments are warping factor, high  
869 - frequency cut-off and low freq. cut-off. They correspond to  
870 - HTK Config values, WARPFREQ, WARPHCUTOFF and WARPLCUTOFF.  
871 -  
872 - -cmnload file  
873 - Load initial cepstral mean vector from file on startup. The  
874 - file should be one saved by -cmnsave. Loading an initial  
875 - cepstral mean enables Julius to better recognize the first  
876 - utterance on a real-time input. When used together with  
877 - -cmnnoupdate, this initial value will be used for all input.  
878 -  
879 - -cmnsave file  
880 - Save the calculated cepstral mean vector into file. The  
881 - parameters will be saved at each input end. If the output  
882 - file already exists, it will be overridden.  
883 -  
884 - -cmnupdate -cmnnoupdate  
885 - Control whether to update the cepstral mean at each input on  
886 - real-time input. Disabling this and specifying -cmnload will  
887 - make engine to always use the loaded static initial cepstral  
888 - mean.  
889 -  
890 - -cmnmapweight float  
891 - Specify the weight of initial cepstral mean for MAP-CMN.  
892 - Specify larger value to retain the initial cepstral mean for  
893 - a longer period, and smaller value to make the cepstral mean  
894 - rely more on the current input. (default: 100.0)  
895 -  
896 - Front-end processing  
897 - Julius can perform spectral subtraction to reduce some  
898 - stationary noise from audio input. Though it is not a powerful  
899 - method, but it may work on some situation. Julius has two ways  
900 - to estimate noise spectrum. One way is to assume that the first  
901 - short segment of an speech input is noise segment, and estimate  
902 - the noise spectrum as the average of the segment. Another way is  
903 - to calculate average spectrum from noise-only input using other  
904 - tool mkss, and load it in Julius. The former one is popular for  
905 - speech file input, and latter should be used in live input. The  
906 - options below will switch / control the behavior.  
907 -  
908 - -sscalc  
909 - Perform spectral subtraction using head part of each file as  
910 - silence part. The head part length should be specified by  
911 - -sscalclen. Valid only for file input. Conflict with -ssload.  
912 -  
913 - -sscalclen msec  
914 - With -sscalc, specify the length of head silence for noise  
915 - spectrum estimation in milliseconds. (default: 300)  
916 -  
917 - -ssload file  
918 - Perform spectral subtraction for speech input using  
919 - pre-estimated noise spectrum loaded from file. The noise  
920 - spectrum file can be made by mkss. Valid for all speech  
921 - input. Conflict with -sscalc.  
922 -  
923 - -ssalpha float  
924 - Alpha coefficient of spectral subtraction for -sscalc and  
925 - -ssload. Noise will be subtracted stronger as this value gets  
926 - larger, but distortion of the resulting signal also becomes  
927 - remarkable. (default: 2.0)  
928 -  
929 - -ssfloor float  
930 - Flooring coefficient of spectral subtraction. The spectral  
931 - power that goes below zero after subtraction will be  
932 - substituted by the source signal with this coefficient  
933 - multiplied. (default: 0.5)  
934 -  
935 - Misc. AM options  
936 - -htkconf file  
937 - Parse the given HTK Config file, and set corresponding  
938 - parameters to Julius. When using this option, the default  
939 - parameter values are switched from Julius defaults to HTK  
940 - defaults.  
941 -  
942 - Recognition process and search (-SR)  
943 - This section contains options for search parameters on the 1st / 2nd  
944 - pass such as beam width and LM weights, configurations for short-pause  
945 - segmentation, switches for word lattice output and confusion network  
946 - output, forced alignments, and other options relating recognition  
947 - process and result output.  
948 -  
949 - Default values for beam width and LM weights will change according to  
950 - compile-time setup of JuliusLib , AM model type, and LM size. Please  
951 - see the startup log for the actual values.  
952 -  
953 - 1st pass parameters  
954 - -lmp weight penalty  
955 - (N-gram) Language model weights and word insertion penalties  
956 - for the first pass.  
957 -  
958 - -penalty1 penalty  
959 - (Grammar) word insertion penalty for the first pass.  
960 - (default: 0.0)  
961 -  
962 - -b width  
963 - Beam width in number of HMM nodes for rank beaming on the  
964 - first pass. This value defines search width on the 1st pass,  
965 - and has dominant effect on the total processing time. Smaller  
966 - width will speed up the decoding, but too small value will  
967 - result in a substantial increase of recognition errors due to  
968 - search failure. Larger value will make the search stable and  
969 - will lead to failure-free search, but processing time will  
970 - grow in proportion to the width.  
971 -  
972 - The default value is dependent on acoustic model type: 400  
973 - (monophone), 800 (triphone), or 1000 (triphone, setup=v2.1)  
974 -  
975 - -nlimit num  
976 - Upper limit of token per node. This option is valid when  
977 - --enable-wpair and --enable-wpair-nlimit are enabled at  
978 - compilation time.  
979 -  
980 - -progout  
981 - Enable progressive output of the partial results on the first  
982 - pass.  
983 -  
984 - -proginterval msec  
985 - Set the time interval for -progout in milliseconds. (default:  
986 - 300)  
987 -  
988 - 2nd pass parameters  
989 - -lmp2 weight penalty  
990 - (N-gram) Language model weights and word insertion penalties  
991 - for the second pass.  
992 -  
993 - -penalty2 penalty  
994 - (Grammar) word insertion penalty for the second pass.  
995 - (default: 0.0)  
996 -  
997 - -b2 width  
998 - Envelope beam width (number of hypothesis) at the second  
999 - pass. If the count of word expansion at a certain hypothesis  
1000 - length reaches this limit while search, shorter hypotheses  
1001 - are not expanded further. This prevents search to fall in  
1002 - breadth-first-like situation stacking on the same position,  
1003 - and improve search failure mostly for large vocabulary  
1004 - condition. (default: 30)  
1005 -  
1006 - -sb float  
1007 - Score envelope width for enveloped scoring. When calculating  
1008 - hypothesis score for each generated hypothesis, its trellis  
1009 - expansion and Viterbi operation will be pruned in the middle  
1010 - of the speech if score on a frame goes under the width.  
1011 - Giving small value makes the second pass faster, but  
1012 - computation error may occur. (default: 80.0)  
1013 -  
1014 - -s num  
1015 - Stack size, i.e. the maximum number of hypothesis that can be  
1016 - stored on the stack during the search. A larger value may  
1017 - give more stable results, but increases the amount of memory  
1018 - required. (default: 500)  
1019 -  
1020 - -m count  
1021 - Number of expanded hypotheses required to discontinue the  
1022 - search. If the number of expanded hypotheses is greater then  
1023 - this threshold then, the search is discontinued at that  
1024 - point. The larger this value is, The longer Julius gets to  
1025 - give up search. (default: 2000)  
1026 -  
1027 - -n num  
1028 - The number of candidates Julius tries to find. The search  
1029 - continues till this number of sentence hypotheses have been  
1030 - found. The obtained sentence hypotheses are sorted by score,  
1031 - and final result is displayed in the order (see also the  
1032 - -output). The possibility that the optimum hypothesis is  
1033 - correctly found increases as this value gets increased, but  
1034 - the processing time also becomes longer. The default value  
1035 - depends on the engine setup on compilation time: 10  
1036 - (standard) or 1 (fast or v2.1)  
1037 -  
1038 - -output num  
1039 - The top N sentence hypothesis to be output at the end of  
1040 - search. Use with -n (default: 1)  
1041 -  
1042 - -lookuprange frame  
1043 - Set the number of frames before and after to look up next  
1044 - word hypotheses in the word trellis on the second pass. This  
1045 - prevents the omission of short words, but with a large value,  
1046 - the number of expanded hypotheses increases and system  
1047 - becomes slow. (default: 5)  
1048 -  
1049 - -looktrellis  
1050 - (Grammar) Expand only the words survived on the first pass  
1051 - instead of expanding all the words predicted by grammar. This  
1052 - option makes second pass decoding faster especially for large  
1053 - vocabulary condition, but may increase deletion error of  
1054 - short words. (default: disabled)  
1055 -  
1056 - Short-pause segmentation / decoder-VAD  
1057 - When compiled with --enable-decoder-vad, the short-pause  
1058 - segmentation will be extended to support decoder-based VAD.  
1059 -  
1060 - -spsegment  
1061 - Enable short-pause segmentation mode. Input will be segmented  
1062 - when a short pause word (word with only silence model in  
1063 - pronunciation) gets the highest likelihood at certain  
1064 - successive frames on the first pass. When detected segment  
1065 - end, Julius stop the 1st pass at the point, perform 2nd pass,  
1066 - and continue with next segment. The word context will be  
1067 - considered among segments. (Rev.4.0)  
1068 -  
1069 - When compiled with --enable-decoder-vad, this option enables  
1070 - decoder-based VAD, to skip long silence.  
1071 -  
1072 - -spdur frame  
1073 - Short pause duration length to detect end of input segment,  
1074 - in number of frames. (default: 10)  
1075 -  
1076 - -pausemodels string  
1077 - A comma-separated list of pause model names to be used at  
1078 - short-pause segmentation. The word whose pronunciation  
1079 - consists of only the pause models will be treated as "pause  
1080 - word" and used for pause detection. If not specified, name of  
1081 - -spmodel, -silhead and -siltail will be used. (Rev.4.0)  
1082 -  
1083 - -spmargin frame  
1084 - Back step margin at trigger up for decoder-based VAD. When  
1085 - speech up-trigger found by decoder-VAD, Julius will rewind  
1086 - the input parameter by this value, and start recognition at  
1087 - the point. (Rev.4.0)  
1088 -  
1089 - This option will be valid only if compiled with  
1090 - --enable-decoder-vad.  
1091 -  
1092 - -spdelay frame  
1093 - Trigger decision delay frame at trigger up for decoder-based  
1094 - VAD. (Rev.4.0)  
1095 -  
1096 - This option will be valid only if compiled with  
1097 - --enable-decoder-vad.  
1098 -  
1099 - Word lattice / confusion network output  
1100 - -lattice , -nolattice  
1101 - Enable / disable generation of word graph. Search algorithm  
1102 - also has changed to optimize for better word graph  
1103 - generation, so the sentence result may not be the same as  
1104 - normal N-best recognition. (Rev.4.0)  
1105 -  
1106 - -confnet , -noconfnet  
1107 - Enable / disable generation of confusion network. Enabling  
1108 - this will also activates -lattice internally. (Rev.4.0)  
1109 -  
1110 - -graphrange frame  
1111 - Merge same words at neighbor position at graph generation. If  
1112 - the beginning time and ending time of two word candidates of  
1113 - the same word is within the specified range, they will be  
1114 - merged. The default is 0 (allow merging same words on exactly  
1115 - the same location) and specifying larger value will result in  
1116 - smaller graph output. Setting this value to -1 will disable  
1117 - merging, in that case same words on the same location of  
1118 - different scores will be left as they are. (default: 0)  
1119 -  
1120 - -graphcut depth  
1121 - Cut the resulting graph by its word depth at post-processing  
1122 - stage. The depth value is the number of words to be allowed  
1123 - at a frame. Setting to -1 disables this feature. (default:  
1124 - 80)  
1125 -  
1126 - -graphboundloop count  
1127 - Limit the number of boundary adjustment loop at  
1128 - post-processing stage. This parameter prevents Julius from  
1129 - blocking by infinite adjustment loop by short word  
1130 - oscillation. (default: 20)  
1131 -  
1132 - -graphsearchdelay , -nographsearchdelay  
1133 - When this option is enabled, Julius modifies its graph  
1134 - generation algorithm on the 2nd pass not to terminate search  
1135 - by graph merging, until the first sentence candidate is  
1136 - found. This option may improve graph accuracy, especially  
1137 - when you are going to generate a huge word graph by setting  
1138 - broad search. Namely, it may result in better graph accuracy  
1139 - when you set wide beams on both 1st pass -b and 2nd pass -b2,  
1140 - and large number for -n. (default: disabled)  
1141 -  
1142 - Multi-gram / multi-dic recognition  
1143 - -multigramout , -nomultigramout  
1144 - On grammar recognition using multiple grammars, Julius will  
1145 - output only the best result among all grammars. Enabling this  
1146 - option will make Julius to output result for each grammar.  
1147 - (default: disabled)  
1148 -  
1149 - Forced alignment  
1150 - -walign  
1151 - Do viterbi alignment per word units for the recognition  
1152 - result. The word boundary frames and the average acoustic  
1153 - scores per frame will be calculated.  
1154 -  
1155 - -palign  
1156 - Do viterbi alignment per phone units for the recognition  
1157 - result. The phone boundary frames and the average acoustic  
1158 - scores per frame will be calculated.  
1159 -  
1160 - -salign  
1161 - Do viterbi alignment per state for the recognition result.  
1162 - The state boundary frames and the average acoustic scores per  
1163 - frame will be calculated.  
1164 -  
1165 - Misc. search options  
1166 - -inactive  
1167 - Start this recognition process instance with inactive state.  
1168 - (Rev.4.0)  
1169 -  
1170 - -1pass  
1171 - Perform only the first pass.  
1172 -  
1173 - -fallback1pass  
1174 - When 2nd pass fails, Julius finish the recognition with no  
1175 - result. This option tell Julius to output the 1st pass result  
1176 - as a final result when the 2nd pass fails. Note that some  
1177 - score output (confidence etc.) may not be useful. This was  
1178 - the default behavior of Julius-3.x.  
1179 -  
1180 - -no_ccd , -force_ccd  
1181 - Explicitly switch phone context handling at search. Normally  
1182 - Julius determines whether the using AM is a context-dependent  
1183 - model or not from the model names, i.e., whether the names  
1184 - contain character + and -. This option will override the  
1185 - automatic detection.  
1186 -  
1187 - -cmalpha float  
1188 - Smoothing parameter for confidence scoring. (default: 0.05)  
1189 -  
1190 - -iwsp  
1191 - (Multi-path mode only) Enable inter-word context-free short  
1192 - pause insertion. This option appends a skippable short pause  
1193 - model for every word end. The short-pause model can be  
1194 - specified by -spmodel.  
1195 -  
1196 - -transp float  
1197 - Additional insertion penalty for transparent words. (default:  
1198 - 0.0)  
1199 -  
1200 - -demo  
1201 - Equivalent to -progout -quiet.  
1202 -  
1203 -ENVIRONMENT VARIABLES  
1204 - ALSADEV  
1205 - (using mic input with alsa device) specify a capture device name. If  
1206 - not specified, "default" will be used.  
1207 -  
1208 - AUDIODEV  
1209 - (using mic input with oss device) specify a capture device path. If  
1210 - not specified, "/dev/dsp" will be used.  
1211 -  
1212 - LATENCY_MSEC  
1213 - Try to set input latency of microphone input in milliseconds.  
1214 - Smaller value will shorten latency but sometimes make process  
1215 - unstable. Default value will depend on the running OS.  
1216 -  
1217 -EXAMPLES  
1218 - For examples of system usage, refer to the tutorial section in the  
1219 - Julius documents.  
1220 -  
1221 -NOTICE  
1222 - Note about jconf files: relative paths in a jconf file are interpreted  
1223 - as relative to the jconf file itself, not to the current directory.  
1224 -  
1225 -SEE ALSO  
1226 - julian(1), jcontrol(1), adinrec(1), adintool(1), mkbingram(1),  
1227 - mkbinhmm(1), mkgsmm(1), wav2mfcc(1), mkss(1)  
1228 -  
1229 - http://julius.sourceforge.jp/en/  
1230 -  
1231 -DIAGNOSTICS  
1232 - Julius normally will return the exit status 0. If an error occurs,  
1233 - Julius exits abnormally with exit status 1. If an input file cannot be  
1234 - found or cannot be loaded for some reason then Julius will skip  
1235 - processing for that file.  
1236 -  
1237 -BUGS  
1238 - There are some restrictions to the type and size of the models Julius  
1239 - can use. For a detailed explanation refer to the Julius documentation.  
1240 - For bug-reports, inquires and comments please contact julius-info at  
1241 - lists.sourceforge.jp.  
1242 -  
1243 -COPYRIGHT  
1244 - Copyright (c) 1991-2008 Kawahara Lab., Kyoto University  
1245 -  
1246 - Copyright (c) 1997-2000 Information-technology Promotion Agency, Japan  
1247 -  
1248 - Copyright (c) 2000-2008 Shikano Lab., Nara Institute of Science and  
1249 - Technology  
1250 -  
1251 - Copyright (c) 2005-2008 Julius project team, Nagoya Institute of  
1252 - Technology  
1253 -  
1254 -AUTHORS  
1255 - Rev.1.0 (1998/02/20)  
1256 - Designed by Tatsuya KAWAHARA and Akinobu LEE (Kyoto University)  
1257 -  
1258 - Development by Akinobu LEE (Kyoto University)  
1259 -  
1260 - Rev.1.1 (1998/04/14), Rev.1.2 (1998/10/31), Rev.2.0 (1999/02/20),  
1261 - Rev.2.1 (1999/04/20), Rev.2.2 (1999/10/04), Rev.3.0 (2000/02/14),  
1262 - Rev.3.1 (2000/05/11)  
1263 - Development of above versions by Akinobu LEE (Kyoto University)  
1264 -  
1265 - Rev.3.2 (2001/08/15), Rev.3.3 (2002/09/11), Rev.3.4 (2003/10/01),  
1266 - Rev.3.4.1 (2004/02/25), Rev.3.4.2 (2004/04/30)  
1267 - Development of above versions by Akinobu LEE (Nara Institute of  
1268 - Science and Technology)  
1269 -  
1270 - Rev.3.5 (2005/11/11), Rev.3.5.1 (2006/03/31), Rev.3.5.2 (2006/07/31),  
1271 - Rev.3.5.3 (2006/12/29), Rev.4.0 (2007/12/19), Rev.4.1 (2008/10/03)  
1272 - Development of above versions by Akinobu LEE (Nagoya Institute of  
1273 - Technology)  
1274 -  
1275 -THANKS TO  
1276 - From rev.3.2, Julius is released by the "Information Processing  
1277 - Society, Continuous Speech Consortium".  
1278 -  
1279 - The Windows DLL version was developed and released by Hideki BANNO  
1280 - (Nagoya University).  
1281 -  
1282 - The Windows Microsoft Speech API compatible version was developed by  
1283 - Takashi SUMIYOSHI (Kyoto University).  
1284 -  
1285 -  
1286 -  
1287 - 02/11/2009 JULIUS(1)  
recognize/src/julius/doc/manuals/mkbingram.txt
@@ -1,97 +0,0 @@ @@ -1,97 +0,0 @@
1 - mkbingram  
2 -  
3 -MKBINGRAM(1) MKBINGRAM(1)  
4 -  
5 -  
6 -  
7 -NAME  
8 - mkbingram  
9 - - make binary N-gram from ARPA N-gram file  
10 -  
11 -SYNOPSIS  
12 - mkbingram [-nlr forward_ngram.arpa] [-nrl backward_ngram.arpa]  
13 - [-d old_bingram_file] {output_bingram_file}  
14 -  
15 -DESCRIPTION  
16 - mkbingram is a tool to convert N-gram definition file(s) in ARPA  
17 - standard format to a compact Julius binary format. It will speed up the  
18 - initial loading time of N-gram much faster. It can read gzipped file  
19 - directly.  
20 -  
21 - From rev.4.0, Julius can deal with forward N-gram, backward N-gram and  
22 - their combinations. So, mkbingram now generates binary N-gram file from  
23 - one of them, or combining them two to produce one binary N-gram.  
24 -  
25 - When only a forward N-gram is specified, mkbingram generates binary  
26 - N-gram from only the forward N-gram. When using this binary N-gram at  
27 - Julius, it performs the 1st pass with the 2-gram probabilities in the  
28 - N-gram, and run the 2nd pass with the given N-gram fully, with  
29 - converting forward probabilities to backward probabilities by Bayes  
30 - rule.  
31 -  
32 - When only a backward N-gram is specified, mkbingram generates an binary  
33 - N-gram file that contains only the backward N-gram. The 1st pass will  
34 - use forward 2-gram probabilities that can be computed from the backward  
35 - 2-gram using Bayes rule, and the 2nd pass use the given backward N-gram  
36 - fully.  
37 -  
38 - When both forward and backward N-grams are specified, the 2-gram part  
39 - in the forward N-gram and all backward N-gram will be combined into  
40 - single bingram file. The forward 2-gram will be applied for the 1st  
41 - pass and backward N-gram for the 2nd pass. Note that both N-gram should  
42 - be trained in the same corpus with same parameters (i.e. cut-off  
43 - thresholds), with same vocabulary.  
44 -  
45 - The old binary N-gram produced by mkbingram of version 3.x and earlier  
46 - can be used in Julius-4, but you can convert the old version to the new  
47 - version by specifying it as input of current mkbingram by option "-d".  
48 -  
49 - Please note that binary N-gram file converted by mkbingram of version  
50 - 4.0 and later cannot be read by older Julius 3.x.  
51 -  
52 -OPTIONS  
53 - -nlr forward_ngram.arpa  
54 - Read in a forward (left-to-right) word N-gram file in ARPA standard  
55 - format.  
56 -  
57 - -nrl backward_ngram.arpa  
58 - Read in a backward (right-to-left) word N-gram file in ARPA standard  
59 - format.  
60 -  
61 - -d old_bingram_file  
62 - Read in a binary N-gram file.  
63 -  
64 - -swap  
65 - Swap BOS word <s> and EOS word </s> in N-gram.  
66 -  
67 - output_bingram_file  
68 - binary N-gram file name to output.  
69 -  
70 -EXAMPLES  
71 - Convert a set of forward and backward N-gram in ARPA format into Julius  
72 - binary form:  
73 - Convert a single forward 4-gram in ARPA format into a binary file:  
74 - Convert old binary N-gram file to current format:  
75 -  
76 -SEE ALSO  
77 - julius ( 1 ) ,  
78 - mkbinhmm ( 1 ) ,  
79 - mkbinhmmlist ( 1 )  
80 -  
81 -COPYRIGHT  
82 - Copyright (c) 1997-2000 Information-technology Promotion Agency, Japan  
83 -  
84 - Copyright (c) 1991-2008 Kawahara Lab., Kyoto University  
85 -  
86 - Copyright (c) 2000-2005 Shikano Lab., Nara Institute of Science and  
87 - Technology  
88 -  
89 - Copyright (c) 2005-2008 Julius project team, Nagoya Institute of  
90 - Technology  
91 -  
92 -LICENSE  
93 - The same as Julius.  
94 -  
95 -  
96 -  
97 - 02/11/2009 MKBINGRAM(1)  
recognize/src/julius/doc/manuals/mkbinhmm.txt
@@ -1,78 +0,0 @@ @@ -1,78 +0,0 @@
1 - mkbinhmm  
2 -  
3 -MKBINHMM(1) MKBINHMM(1)  
4 -  
5 -  
6 -  
7 -NAME  
8 - mkbinhmm  
9 - - convert HMM definition file in HTK ascii format to Julius binary  
10 - format  
11 -  
12 -SYNOPSIS  
13 - mkbinhmm [-htkconf HTKConfigFile] {hmmdefs_file} {binhmm_file}  
14 -  
15 -DESCRIPTION  
16 - mkbinhmm convert an HMM definition file in HTK ascii format into a  
17 - binary HMM file for Julius. It will greatly speed up the launch  
18 - process.  
19 -  
20 - You can also embed acoustic analysis condition parameters needed for  
21 - recognition into the output file. To embed the parameters, specify the  
22 - HTK Config file you have used to extract acoustic features for training  
23 - the HMM by the optione "-htkconf".  
24 -  
25 - The embedded parameters in a binary HMM format will be loaded into  
26 - Julius automatically, so you do not need to specify the acoustic  
27 - feature options at run time. It will be convenient when you deliver an  
28 - acoustic model.  
29 -  
30 - You can also specify binary file as the input. This can be used to  
31 - update the old binary format into new one, or to embed the config  
32 - parameters into the already existing binary files. If the input binhmm  
33 - already has acoustic analysis parameters embedded, they will be  
34 - overridden by the specified values.  
35 -  
36 -  
37 - mkbinhmm can read gzipped file as input.  
38 -  
39 -OPTIONS  
40 - -htkconf HTKConfigFile  
41 - HTK Config file you used at training time. If specified, the values  
42 - are embedded to the output file.  
43 -  
44 - hmmdefs_file  
45 - The source HMm definitino file in HTK ascii format or Julius binary  
46 - format.  
47 -  
48 - hmmdefs_file  
49 - Output file.  
50 -  
51 -EXAMPLES  
52 - Convert HTK ascii format HMM definition file into Julius binary file:  
53 - Furthermore, embed acoustic feature parameters as specified by Config  
54 - file  
55 - Embed the acoustic parameters into an existing binary file  
56 -  
57 -SEE ALSO  
58 - julius ( 1 ) ,  
59 - mkbingram ( 1 ) ,  
60 - mkbinhmmlist ( 1 )  
61 -  
62 -COPYRIGHT  
63 - Copyright (c) 1997-2000 Information-technology Promotion Agency, Japan  
64 -  
65 - Copyright (c) 1991-2008 Kawahara Lab., Kyoto University  
66 -  
67 - Copyright (c) 2000-2005 Shikano Lab., Nara Institute of Science and  
68 - Technology  
69 -  
70 - Copyright (c) 2005-2008 Julius project team, Nagoya Institute of  
71 - Technology  
72 -  
73 -LICENSE  
74 - The same as Julius.  
75 -  
76 -  
77 -  
78 - 10/02/2008 MKBINHMM(1)  
recognize/src/julius/doc/manuals/mkbinhmmlist.txt
@@ -1,64 +0,0 @@ @@ -1,64 +0,0 @@
1 - mkbinhmmlist  
2 -  
3 -MKBINHMMLIST(1) MKBINHMMLIST(1)  
4 -  
5 -  
6 -  
7 -NAME  
8 - mkbinhmmlist  
9 - - convert HMMList file into binary format  
10 -  
11 -SYNOPSIS  
12 - mkbinhmmlist {hmmdefs_file} {HMMList_file} {output_binhmmlist_file}  
13 -  
14 -DESCRIPTION  
15 - mkbinhmmlist converts a HMMList file to binary format. Since the index  
16 - trees for lookup are also stored in the binary format, it will speed up  
17 - the startup of Julius, namely when using big HMMList file.  
18 -  
19 - For conversion, HMM definition file hmmdefs_file that will be used  
20 - together at Julius needs to be specified. The format of the HMM  
21 - definition file can be either ascii or Julius binary format.  
22 -  
23 - The output binary file can be used in Julius as the same by "-hlist".  
24 - The format wil be auto-detected by Julius.  
25 -  
26 -  
27 - mkbinhmmlist can read gzipped file.  
28 -  
29 -OPTIONS  
30 - hmmdefs_file  
31 - Acoustic HMM definition file, in HMM ascii format or Julius binary  
32 - format.  
33 -  
34 - HMMList_file  
35 - Source HMMList file  
36 -  
37 - output_binhmmlist_file  
38 - Output file, will be overwritten if already exist.  
39 -  
40 -EXAMPLES  
41 - Convert a HMMList file logicalTri into binary format and store to  
42 - logicalTri.bin:  
43 -  
44 -SEE ALSO  
45 - julius ( 1 ) ,  
46 - mkbinhmm ( 1 )  
47 -  
48 -COPYRIGHT  
49 - Copyright (c) 1997-2000 Information-technology Promotion Agency, Japan  
50 -  
51 - Copyright (c) 1991-2008 Kawahara Lab., Kyoto University  
52 -  
53 - Copyright (c) 2000-2005 Shikano Lab., Nara Institute of Science and  
54 - Technology  
55 -  
56 - Copyright (c) 2005-2008 Julius project team, Nagoya Institute of  
57 - Technology  
58 -  
59 -LICENSE  
60 - The same as Julius.  
61 -  
62 -  
63 -  
64 - 10/02/2008 MKBINHMMLIST(1)  
recognize/src/julius/doc/manuals/mkdfa.pl.txt
@@ -1,67 +0,0 @@ @@ -1,67 +0,0 @@
1 - mkdfa.pl  
2 -  
3 -MKDFA.PL(1) MKDFA.PL(1)  
4 -  
5 -  
6 -  
7 -NAME  
8 - mkdfa.pl  
9 - - grammar compiler  
10 -  
11 -SYNOPSIS  
12 - mkdfa.pl [options...] {prefix}  
13 -  
14 -DESCRIPTION  
15 - mkdfa.pl compiles the Julian format grammar (.grammar and .voca) to  
16 - Julian native formats (.dfa and .dict). In addition, ".term" will be  
17 - also generated that stores correspondence of category ID used in the  
18 - output files to the source category name.  
19 -  
20 -  
21 - prefix should be the common file name prefix of ".grammar" and "voca"  
22 - file. From prefix.grammar and prefix.voca file, prefix.dfa, prefix.dict  
23 - and prefix.term will be output.  
24 -  
25 -OPTIONS  
26 - -n  
27 - Not process dictionary. You can only convert .grammar file to .dfa  
28 - file without .voca file.  
29 -  
30 -ENVIRONMENT VARIABLES  
31 - TMP or TEMP  
32 - Set directory to store temporal file. If not specified, one of them  
33 - on the following list will be used: /tmp, /var/tmp, /WINDOWS/Temp,  
34 - /WINNT/Temp.  
35 -  
36 -EXAMPLES  
37 - Convert a grammar foo.grammar and foo.voca to foo.dfa, foo.voca and  
38 - foo.term.  
39 -  
40 -SEE ALSO  
41 - julius ( 1 ) ,  
42 - generate ( 1 ) ,  
43 - nextword ( 1 ) ,  
44 - accept_check ( 1 ) ,  
45 - dfa_minimize ( 1 )  
46 -  
47 -DIAGNOSTICS  
48 - mkdfa.pl invokes mkfa and dfa_minimize internally. They should be  
49 - placed at the same directory as mkdfa.pl.  
50 -  
51 -COPYRIGHT  
52 - Copyright (c) 1997-2000 Information-technology Promotion Agency, Japan  
53 -  
54 - Copyright (c) 1991-2008 Kawahara Lab., Kyoto University  
55 -  
56 - Copyright (c) 2000-2005 Shikano Lab., Nara Institute of Science and  
57 - Technology  
58 -  
59 - Copyright (c) 2005-2008 Julius project team, Nagoya Institute of  
60 - Technology  
61 -  
62 -LICENSE  
63 - The same as Julius.  
64 -  
65 -  
66 -  
67 - 10/02/2008 MKDFA.PL(1)  
recognize/src/julius/doc/manuals/mkgshmm.txt
@@ -1,50 +0,0 @@ @@ -1,50 +0,0 @@
1 - mkgshmm  
2 -  
3 -MKGSHMM(1) MKGSHMM(1)  
4 -  
5 -  
6 -  
7 -NAME  
8 - mkgshmm  
9 - - convert monophone HMM to GS HMM for Julius  
10 -  
11 -SYNOPSIS  
12 - mkgshmm {monophone_hmmdefs}  
13 - >  
14 - {outputfile}  
15 -  
16 -DESCRIPTION  
17 - mkgshmm converts monophone HMM definition file in HTK format into a  
18 - special format for Gaussian Mixture Selection (GMS) in Julius.  
19 -  
20 - GMS is an algorithm to reduce the amount of acoustic computation with  
21 - triphone HMM, by pre-selection of promising gaussian mixtures using  
22 - likelihoods of corresponding monophone mixtures.  
23 -  
24 -EXAMPLES  
25 - (1) Prepare a monophone model which was trained by the same corpus as  
26 - target triphone model.  
27 -  
28 - (2) Convert the monophone model using mkgshmm.  
29 - (3) Specify the output file in Julius with option "-gshmm"  
30 -  
31 -SEE ALSO  
32 - julius ( 1 )  
33 -  
34 -COPYRIGHT  
35 - Copyright (c) 1997-2000 Information-technology Promotion Agency, Japan  
36 -  
37 - Copyright (c) 1991-2008 Kawahara Lab., Kyoto University  
38 -  
39 - Copyright (c) 2000-2005 Shikano Lab., Nara Institute of Science and  
40 - Technology  
41 -  
42 - Copyright (c) 2005-2008 Julius project team, Nagoya Institute of  
43 - Technology  
44 -  
45 -LICENSE  
46 - The same as Julius.  
47 -  
48 -  
49 -  
50 - 10/02/2008 MKGSHMM(1)  
recognize/src/julius/doc/manuals/mkss.txt
@@ -1,55 +0,0 @@ @@ -1,55 +0,0 @@
1 - mkss  
2 -  
3 -MKSS(1) MKSS(1)  
4 -  
5 -  
6 -  
7 -NAME  
8 - mkss  
9 - - calculate average spectrum for spectral subtraction  
10 -  
11 -SYNOPSIS  
12 - mkss [options...] {filename}  
13 -  
14 -DESCRIPTION  
15 - mkss is a tool to estimate noise spectrum for spectral subtraction on  
16 - Julius. It reads a few seconds of sound data from microphone input,  
17 - calculate the average spectrum and save it to a file. The output file  
18 - can be used as a noise spectrum data in Julius (option "-ssload").  
19 -  
20 - The recording will start immediately after startup. Sampling format is  
21 - 16bit, monoral. If outpue file already exist, it will be overridden.  
22 -  
23 -OPTIONS  
24 - -freq Hz  
25 - Sampling frequency in Hz (default: 16,000)  
26 -  
27 - -len msec  
28 - capture length in milliseconds (default: 3000)  
29 -  
30 - -fsize sample_num  
31 - frame size in number of samples (default: 400)  
32 -  
33 - -fshift sample_num  
34 - frame shift in number of samples (default: 160)  
35 -  
36 -SEE ALSO  
37 - julius ( 1 )  
38 -  
39 -COPYRIGHT  
40 - Copyright (c) 1997-2000 Information-technology Promotion Agency, Japan  
41 -  
42 - Copyright (c) 1991-2008 Kawahara Lab., Kyoto University  
43 -  
44 - Copyright (c) 2000-2005 Shikano Lab., Nara Institute of Science and  
45 - Technology  
46 -  
47 - Copyright (c) 2005-2008 Julius project team, Nagoya Institute of  
48 - Technology  
49 -  
50 -LICENSE  
51 - The same as Julius.  
52 -  
53 -  
54 -  
55 - 10/02/2008 MKSS(1)  
recognize/src/julius/doc/manuals/nextword.txt
@@ -1,82 +0,0 @@ @@ -1,82 +0,0 @@
1 - nextword  
2 -  
3 -NEXTWORD(1) NEXTWORD(1)  
4 -  
5 -  
6 -  
7 -NAME  
8 - nextword  
9 - - display next predicted words (in reverse order)  
10 -  
11 -SYNOPSIS  
12 - nextword [-t] [-r] [-s spname] [-v] {prefix}  
13 -  
14 -DESCRIPTION  
15 - Given a partial (part of) sentence from the end, it outputs the next  
16 - words allowed in the specified grammar.  
17 -  
18 -  
19 - .dfa, .dict and .term files are needed to execute. They can be  
20 - generated from .grammar and .voca file by mkdfa.pl.  
21 -  
22 - Please note that the latter part of sentence should be given, since the  
23 - main 2nd pass does a right-to-left parsing.  
24 -  
25 -OPTIONS  
26 - -t  
27 - Input / Output in category name. (default: word)  
28 -  
29 - -r  
30 - Enter in reverse order  
31 -  
32 - -s spname  
33 - the name string of short-pause word to be supressed (default: "sp")  
34 -  
35 - -v  
36 - Debug output.  
37 -  
38 -EXAMPLES  
39 - Exmple output of a sample grammar "fruit":  
40 -  
41 - % nextword fruit  
42 - Stat: init_voca: read 36 words  
43 - Reading in term file (optional)...done  
44 - 15 categories, 36 words  
45 - DFA has 26 nodes and 42 arcs  
46 - -----  
47 - command completion is disabled  
48 - -----  
49 - wseq > A BANANA </s>  
50 - [wseq: A BANANA </s>]  
51 - [cate: (NUM_1|NUM_1|A|A) FRUIT_SINGULAR NS_E]  
52 - PREDICTED CATEGORIES/WORDS:  
53 - NS_B (<s> )  
54 - HAVE (HAVE )  
55 - WANT (WANT )  
56 - NS_B (<s> )  
57 - HAVE (HAVE )  
58 - WANT (WANT )  
59 -  
60 -  
61 -SEE ALSO  
62 - mkdfa.pl ( 1 ) ,  
63 - generate ( 1 ) ,  
64 - accept_check ( 1 )  
65 -  
66 -COPYRIGHT  
67 - Copyright (c) 1997-2000 Information-technology Promotion Agency, Japan  
68 -  
69 - Copyright (c) 1991-2008 Kawahara Lab., Kyoto University  
70 -  
71 - Copyright (c) 2000-2005 Shikano Lab., Nara Institute of Science and  
72 - Technology  
73 -  
74 - Copyright (c) 2005-2008 Julius project team, Nagoya Institute of  
75 - Technology  
76 -  
77 -LICENSE  
78 - The same as Julius.  
79 -  
80 -  
81 -  
82 - 10/02/2008 NEXTWORD(1)  
recognize/src/julius/install/julius-4.2.2.tar.gz
No preview for this file type