You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
@@ -99,20 +129,14 @@ Improved streaming mode is suitable for complex languages such as Chinese, Japan
99
129
100
130
API Docs can be accessed from [http://localhost:8020/docs](http://localhost:8020/docs)
101
131
102
-
# Voice Samples
132
+
# How to add speaker
103
133
104
-
You can find the sample in this repository more details in the API documentation
105
-
106
-
[erew123](https://github.com/erew123) put together a pack of 40+ votes, you can download to try them out [here](https://filebin.net/t97nd69ac7qm2rsf)
134
+
By default the `speakers` folder should appear in the folder, you need to put there the wav file with the voice sample, you can also create a folder and put there several voice samples, this will give more accurate results
107
135
108
136
# Selecting Folder
109
137
110
138
You can change the folders for speakers and the folder for output via the API.
111
139
112
-
# Get Speakers
113
-
114
-
Once you have at least one file in your speakers folder, you can get its name via API and then you only need to specify the file name.
115
-
116
140
# Note on creating samples for quality voice cloning
117
141
118
142
The following post is a quote by user [Material1276 from reddit](https://www.reddit.com/r/Oobabooga/comments/1807tsl/comment/ka5l8w9/?share_id=_5hh4KJTXrEOSP0hR0hCK&utm_content=2&utm_medium=android_app&utm_name=androidcss&utm_source=share&utm_term=1)
@@ -135,28 +159,8 @@ The following post is a quote by user [Material1276 from reddit](https://www.red
135
159
>
136
160
> Using AI generated audio clips may introduce unwanted sounds as its already a copy/simulation of a voice, though, this would need testing.
137
161
138
-
# Use Docker image with Docker Compose
139
-
140
-
A Dockerfile is provided to build a Docker image, and a docker-compose.yml file is provided to run the server with Docker Compose as a service.
141
-
142
-
You will need to setup the env variables by copying the .env.example file to .env and filling in the values.
143
-
If you want to use your own speakers, you can put it in `example` folder before building the image.
144
-
The example folder will be copied to the container and the server will use it as a speaker folder.
145
-
146
-
You can build the image with the following command:
147
-
148
-
```bash
149
-
cd docker
150
-
docker compose build
151
-
```
152
-
153
-
Then you can run the server with the following command:
154
-
155
-
```bash
156
-
docker compose up # or with -d to run in background
157
-
```
158
-
159
162
# Credit
160
163
161
164
1. Thanks to the author **Kolja Beigel** for the repository [RealtimeTTS](https://github.com/KoljaB/RealtimeTTS) , I took some of its code for my project.
162
165
2. Thanks **[erew123](https://github.com/oobabooga/text-generation-webui/issues/4712#issuecomment-1825593734)** for the note about creating samples and the code to download the models
166
+
3. Thanks **lendot** for helping to fix the multiprocessing bug and adding code to use multiple samples for speakers
0 commit comments