Update Dockerfile for the new voice cloning system. #42
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
Fixing some of the issues created by the new voice cloning system in the Dockerfile. as mentioned in #40 . I've tagged a specific python version as the base image, so if that ever needs to be updated, it should be easy peasy. I changed it to copy the project files from the cloned directory, rather than pulling from the git repo during build, better for reproducibility. I added ffmpeg into the image to handle audio files other than pure .wav.
I cannot test CUDA, as I don't have any hardware to test on. CPU speech generation works perfectly, and feels faster, though I've not run benchmarks on it. However, the actual voice cloning features error on CPU.
During startup it does display forcecpu=True, so maybe something isn't respecting the config?
During the build, I do get this:
INFO | fairseq.tasks.text_to_speech | Please install tensorboardX: pip install tensorboardX
Should this be added as a dependency?
You are free to delay this merge until these are resolved, but other than voice cloning, this image works.