* Add github action to automatically push to pypi on Release x.y.z commit
* some housekeeping for pypi upload
* add version.py
Co-authored-by: Jong Wook Kim <jongwook@nyu.edu>
For a 30s long audio file which didn't have any silence, ndimage.median_filter took 7s where signa.medfilter took 30s.
Co-authored-by: Umar Farooqi <umar@paystash.com>
Co-authored-by: Jong Wook Kim <jongwook@nyu.edu>
* Update Hebrew language code to he per IANA registry
Per [IANA registry](https://www.iana.org/assignments/language-subtag-registry/language-subtag-registry), `iw` was deprecated as the code for Hebrew in 1989 and the preferred code is `he`
The correct subtag:
```
%%
Type: language
Subtag: he
Description: Hebrew
Added: 2005-10-16
Suppress-Script: Hebr
%%
```
And the deprecation
```
%%
Type: language
Subtag: iw
Description: Hebrew
Added: 2005-10-16
Deprecated: 1989-01-01
Preferred-Value: he
Suppress-Script: Hebr
%%
```
* Update hebrew ISO code to he
Per discussion, it's ok to make this change without backwards compatibility
- The "large-v2" model is trained for more epochs with regularization and shows improved performance compared to the previous large.
- It has the same architecture as the original large model.
- When `load_model("large")` is called, the "large-v2" model will be loaded.
- We will soon update the paper regarding this new model.
Fix bug: RuntimeError: Expected all tensors to be on the same device, but found at least two devices, cuda:0 and cpu! (when checking argument for argument index in method wrapper__index_select)
* Add --threads option to transcribe
Torch on CPU uses by default number_of_cores/2. This option allows to
override this default.
* Update transcribe.py
Co-authored-by: Jong Wook Kim <ilikekjw@gmail.com>
* Use two-digit hours in WebVTT timestamps
Per the WebVTT specification [0]:
> A WebVTT timestamp consists of the following components, in the given
> order:
>
> 1. Optionally (required if hours is non-zero):
> 1. Two or more ASCII digits, representing the hours as a base ten
> integer.
> 2. A U+003A COLON character (:)
YouTube won’t accept timestamps containing single-digit hours.
[0] https://www.w3.org/TR/webvtt1/#webvtt-timestamp
* Strip segment text in WebVTT output
We already do this for plain text and SubRip output, so we should do it
for WebVTT too.
* Adds missing command for install (mac)
Required for users who didn't previously have Rust installed.
* minor wording change
Co-authored-by: Jong Wook Kim <jongwook@nyu.edu>
* Write each sentence as a separate line for the txt output
Write each sentence as a separate line for the txt output
* Update utils.py
Co-authored-by: EliEron <example@example.com>
Co-authored-by: Jong Wook Kim <jongwook@nyu.edu>
* add progress bar to transcribe loop
* improved warning message for English-only models
* add --condition_on_previous_text
* progressbar renames
Co-authored-by: Jong Wook Kim <jongwook@nyu.edu>