VulumeCode
2037b65f3f
Context prompt ( #128 )
...
Co-authored-by: Jong Wook Kim <jongwook@nyu.edu>
2022-09-26 05:22:33 -07:00
EliEron
fc0f40981d
Write each sentence as a separate line for the txt output ( #101 )
...
* Write each sentence as a separate line for the txt output
Write each sentence as a separate line for the txt output
* Update utils.py
Co-authored-by: EliEron <example@example.com>
Co-authored-by: Jong Wook Kim <jongwook@nyu.edu>
2022-09-26 04:52:28 -07:00
VulumeCode
520796a34c
fix token suppression ( #123 )
2022-09-26 04:35:21 -07:00
fatih
ead77fab97
add srt subtitle export utility ( #102 )
...
* add srt subtitle export utility
* simplifying
Co-authored-by: Jong Wook Kim <jongwook@nyu.edu>
2022-09-26 03:50:26 -07:00
Ashutosh Tripathi
5485428c81
arch linux ffmpeg install ( #93 )
2022-09-26 03:24:47 -07:00
fatih
9e7e418ff1
add progress bar for transcribe loop ( #100 )
...
* add progress bar to transcribe loop
* improved warning message for English-only models
* add --condition_on_previous_text
* progressbar renames
Co-authored-by: Jong Wook Kim <jongwook@nyu.edu>
2022-09-26 03:24:13 -07:00
Jong Wook Kim
5d8d3e75a4
add --condition_on_previous_text
2022-09-25 05:16:08 -07:00
Jong Wook Kim
2d3032de01
improved warning message for English-only models
2022-09-25 02:10:36 -07:00
Jong Wook Kim
8cf36f3508
allow hyphens and single quotes between words
2022-09-23 20:11:27 +09:00
Jong Wook Kim
15ab548263
nocaptions -> nospeech to match the paper figure
2022-09-23 15:45:32 +09:00
mj-kh
61989529b7
Fix possible mistake when loading model to device ( #57 )
...
Before this change, the model is loaded into GPU regardless of the value of "device" argument in CLI.
(e.g. whisper "test.wav" --device cpu loads into GPU anyway)
2022-09-23 15:21:47 +09:00
Niklas K
f296bcd3fa
Avoid keeping redundant copies of model weights in memory during load ( #42 )
...
* don't keep copies of model weights in host memory
* adding type annotation
Co-authored-by: Jong Wook Kim <jongwook@nyu.edu>
2022-09-23 12:57:39 +09:00
Sidney Radcliffe
a4fe05aa71
Add conda environment.yml (and fix requirements.txt) ( #8 )
...
* fix: more-itertools name in requirements.txt
* feature: minimal environment.yml for conda
* Revert "feature: minimal environment.yml for conda"
This reverts commit 8fd7438b368b0eb5df85f667fea911f293fa5e6d.
Co-authored-by: Jong Wook Kim <jongwook@nyu.edu>
2022-09-23 12:30:45 +09:00
Giovanni Lanzani
957ffc77de
Add rust as a dependency ( #30 )
...
* Add rust as a dependency
* Update README.md
Co-authored-by: Jong Wook Kim <ilikekjw@gmail.com>
2022-09-23 12:26:38 +09:00
Ram Rachum
59f543e218
Fix exception cause in audio.py ( #33 )
2022-09-23 12:12:37 +09:00
hanacchi
c85eaaae29
Use UTF-8 encoding to save the txt and vtt files ( #37 )
...
Explicitly set the text encoding to UTF-8 in order to avoid UnicodeEncodeErrors
Co-authored-by: Jong Wook Kim <jongwook@nyu.edu>
2022-09-23 12:10:55 +09:00
EliEron
759e8d47a8
Fix output_dir argument when audio file is a path ( #45 )
2022-09-23 11:38:37 +09:00
Micheal Taylor
c0607e8d22
Add scoop install for windows ( #48 )
...
Adding scoop install to setup for windows for ffmpeg
2022-09-23 11:37:57 +09:00
Jong Wook Kim
e90b8fa7e8
Merge pull request #14 from bquast/patch-1
...
make LICENSE a link instead of code-formatted text
2022-09-22 11:51:05 +09:00
Jong Wook Kim
f83cb83a42
Merge pull request #24 from ldanilov/patch-1
...
fixes the link to the model paper
2022-09-22 11:48:57 +09:00
Lev Danilov
45fc3d43c1
fixes the link to the model paper
2022-09-21 21:25:17 -04:00
Bastiaan Quast
08a739ad79
make LICENSE a link instead of code-formatted text
2022-09-21 23:17:02 +02:00
Jong Wook Kim
49a3ffc997
add section Available models and languages
2022-09-22 05:36:25 +09:00
Jong Wook Kim
cfd6bdda21
a note on speed-accuracy tradeoffs
2022-09-22 02:58:56 +09:00
Jong Wook Kim
834f00a0ea
making small model the default
2022-09-22 02:45:12 +09:00
Jong Wook Kim
6e3be77e1a
initial commit
2022-09-22 01:09:43 +09:00