davidmezzetti commited on
Commit
62526e7
1 Parent(s): 7aa1d35

Add model files

Browse files
Files changed (3) hide show
  1. README.md +30 -0
  2. config.yaml +97 -0
  3. model.onnx +3 -0
README.md CHANGED
@@ -1,3 +1,33 @@
1
  ---
 
 
 
 
 
 
 
 
2
  license: apache-2.0
3
  ---
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
  ---
2
+ tags:
3
+ - espnet
4
+ - audio
5
+ - text-to-speech
6
+ - onnx
7
+ language: en
8
+ datasets:
9
+ - ljspeech
10
  license: apache-2.0
11
  ---
12
+
13
+ # ESPnet JETS Text-to-Speech (TTS) Model for ONNX
14
+
15
+ [imdanboy/jets](https://huggingface.co/imdanboy/jets) exported to ONNX. This model is an ONNX export using the [espnet_onnx](https://github.com/espnet/espnet_onnx) library.
16
+
17
+ ## Usage with txtai
18
+
19
+ txtai has a built in Text to Speech (TTS) pipeline that makes using this model easy.
20
+
21
+ ```python
22
+ ```
23
+
24
+ ## Usage with ONNX
25
+
26
+ This model can also be run directly with ONNX provided the input text is tokenized. Tokenization can be done with [ttstokenizer](https://github.com/neuml/ttstokenizer).
27
+
28
+ ```python
29
+ ```
30
+
31
+ ## How to export
32
+
33
+ More information on how to export ESPnet models to ONNX can be [found here](https://github.com/espnet/espnet_onnx#text2speech-inference).
config.yaml ADDED
@@ -0,0 +1,97 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ normalize:
2
+ eps: 1.0e-20
3
+ norm_means: true
4
+ norm_vars: true
5
+ stats_file: imdanboy/ljspeech_tts_train_jets_raw_phn_tacotron_g2p_en_no_space_train.total_count.ave/feats_stats.npz
6
+ type: gmvn
7
+ use_normalize: true
8
+ text_cleaner:
9
+ cleaner_types: tacotron
10
+ token:
11
+ list:
12
+ - <blank>
13
+ - <unk>
14
+ - AH0
15
+ - N
16
+ - T
17
+ - D
18
+ - S
19
+ - R
20
+ - L
21
+ - DH
22
+ - K
23
+ - Z
24
+ - IH1
25
+ - IH0
26
+ - M
27
+ - EH1
28
+ - W
29
+ - P
30
+ - AE1
31
+ - AH1
32
+ - V
33
+ - ER0
34
+ - F
35
+ - ','
36
+ - AA1
37
+ - B
38
+ - HH
39
+ - IY1
40
+ - UW1
41
+ - IY0
42
+ - AO1
43
+ - EY1
44
+ - AY1
45
+ - .
46
+ - OW1
47
+ - SH
48
+ - NG
49
+ - G
50
+ - ER1
51
+ - CH
52
+ - JH
53
+ - Y
54
+ - AW1
55
+ - TH
56
+ - UH1
57
+ - EH2
58
+ - OW0
59
+ - EY2
60
+ - AO0
61
+ - IH2
62
+ - AE2
63
+ - AY2
64
+ - AA2
65
+ - UW0
66
+ - EH0
67
+ - OY1
68
+ - EY0
69
+ - AO2
70
+ - ZH
71
+ - OW2
72
+ - AE0
73
+ - UW2
74
+ - AH2
75
+ - AY0
76
+ - IY2
77
+ - AW2
78
+ - AA0
79
+ - ''''
80
+ - ER2
81
+ - UH2
82
+ - '?'
83
+ - OY2
84
+ - '!'
85
+ - AW0
86
+ - UH0
87
+ - OY0
88
+ - ..
89
+ - <sos/eos>
90
+ tokenizer:
91
+ g2p_type: g2p_en_no_space
92
+ token_type: phn
93
+ tts_model:
94
+ model_path: imdanboy/ljspeech_tts_train_jets_raw_phn_tacotron_g2p_en_no_space_train.total_count.ave/full/jets.onnx
95
+ model_type: JETS
96
+ vocoder:
97
+ vocoder_type: not_used
model.onnx ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:f8222def33152135d49b4a27dff203c24c2c6c68a7fcaf2b649475ac2fe0c871
3
+ size 132537001