4 Commits

Author SHA1 Message Date
Jong Wook Kim
839639a223
Use tiktoken (#1044)
* use tiktoken==0.3.0

* formatting

* tuple should be safer

* Update whisper/tokenizer.py

Co-authored-by: Ruhollah Majdoddin <r.majdodin@gmail.com>

* use tiktoken 0.3.1

* reflecting suggestions

* cleanup

* bypassing load_tiktoken_bpe to avoid blobfile dep

---------

Co-authored-by: Ruhollah Majdoddin <r.majdodin@gmail.com>
2023-03-13 02:34:16 -07:00
Jong Wook Kim
500d0fe966
word-level timestamps in transcribe() (#869)
* word-level timestamps in `transcribe()`

* moving to `timing.py`

* numba implementation for dtw, replacing dtw-python

* triton implementation for dtw

* add test for dtw implementations

* triton implementation of median_filter

* a simple word-level timestamps test

* add scipy as dev dependency

* installs an older version of Triton if CUDA < 11.4

* fix broken merge

* loosen nvcc version match regex

* find_alignment() function

* miscellaneous improvements

* skip median filtering when the input is too small

* Expose punctuation options in cli and transcribe() (#973)

* fix merge error

* fix merge error 2

* annotating that word_timestamps is experimental

---------

Co-authored-by: ryanheise <ryan@ryanheise.com>
2023-03-06 14:00:49 -08:00
Sidney Radcliffe
a4fe05aa71
Add conda environment.yml (and fix requirements.txt) (#8)
* fix: more-itertools name in requirements.txt

* feature: minimal environment.yml for conda

* Revert "feature: minimal environment.yml for conda"

This reverts commit 8fd7438b368b0eb5df85f667fea911f293fa5e6d.

Co-authored-by: Jong Wook Kim <jongwook@nyu.edu>
2022-09-23 12:30:45 +09:00
Jong Wook Kim
6e3be77e1a initial commit 2022-09-22 01:09:43 +09:00