whisper

mirror of https://github.com/openai/whisper.git synced 2025-11-28 00:03:40 +00:00

Author	SHA1	Message	Date
Nathan Harmon	679ae1d141	Fix: Ensure DTW cost tensor is on the same device as input tensor (#2561 ) Co-authored-by: Jong Wook Kim <jongwook@openai.com>	2025-06-25 17:42:09 -07:00
Jong Wook Kim	27f971320a	using sdpa if available (#2359 ) * using sdpa if available * Update model.py	2024-09-30 10:27:14 -07:00
ryanheise	ba3f3cd54b	Skip silence around hallucinations (#1838 ) * Add clip_timestamps option * Add hallucination_silence_threshold option * Fix typing for python < 3.9 --------- Co-authored-by: Jong Wook Kim <jongwook@openai.com>	2023-12-18 12:11:16 -08:00
Arthur Kim	8b330df096	Add .pre-commit-config.yaml (#1528 ) * Add .pre-commit-config.yaml Co-authored-by: arthur <arthur@rtzr.ai> * flake8 E741 --------- Co-authored-by: Jong Wook Kim <jongwook@openai.com>	2023-09-18 16:15:33 -07:00
taylorchu	e8622f9afc	word timing tweaks (#1559 ) * word timing tweaks * comment on eot * clearer comments	2023-08-08 06:48:56 +09:00
ryanheise	f572f2161b	Improve timestamp heuristics. (#1461 ) * Improve timestamp heuristics. * Track pauses with last_speech_timestamp	2023-06-29 16:51:24 -07:00
Paul Willot	7ca9fbea86	Fix numba depreceation notice (#1233 ) From numba 0.57 raise a warning if `nopython` is not supplied: https://numba.readthedocs.io/en/stable/reference/deprecation.html#deprecation-of-object-mode-fall-back-behaviour-when-using-jit	2023-05-04 23:48:06 -07:00
ryanheise	255887f219	Squash long words at window and sentence boundaries. (#1114 ) * Squash long words at window and sentence boundaries. * Formatting requirements. * Fix squashing logic to point to correct words. --------- Co-authored-by: Jong Wook Kim <jongwook@openai.com>	2023-04-10 17:23:53 -07:00
Jong Wook Kim	79c43e4859	abort find_alignment on empty input (#1090 )	2023-03-14 12:47:58 -07:00
Guillaume Klein	671ac5a4ce	Fix alignment between the segments and the list of words (#1087 ) * Fix alignment between the segments and the list of words * Ensure the word index does not overflow	2023-03-13 16:34:09 -07:00
Jong Wook Kim	38f2f4d99d	fix all_tokens handling that caused more repetitions and discrepancy in JSON (#1060 )	2023-03-08 15:34:07 -08:00
Jong Wook Kim	b80bcf610d	apply formatting with `black` (#1038 ) * applying black (with the default 88-column limit) * add flake8 * add isort * fix isort	2023-03-06 15:50:37 -08:00
Jong Wook Kim	500d0fe966	word-level timestamps in `transcribe()` (#869 ) * word-level timestamps in `transcribe()` * moving to `timing.py` * numba implementation for dtw, replacing dtw-python * triton implementation for dtw * add test for dtw implementations * triton implementation of median_filter * a simple word-level timestamps test * add scipy as dev dependency * installs an older version of Triton if CUDA < 11.4 * fix broken merge * loosen nvcc version match regex * find_alignment() function * miscellaneous improvements * skip median filtering when the input is too small * Expose punctuation options in cli and transcribe() (#973) * fix merge error * fix merge error 2 * annotating that word_timestamps is experimental --------- Co-authored-by: ryanheise <ryan@ryanheise.com>	2023-03-06 14:00:49 -08:00

13 Commits