whisper

mirror of https://github.com/openai/whisper.git synced 2025-11-26 23:46:09 +00:00

Author	SHA1	Message	Date
PiotrBLL	adaec5a2e1	Add requirements_hpu.txt	2024-11-12 14:53:08 +01:00
PiotrBLL	e1545f4776	Fix: compute_device name in Whisper model	2024-11-11 22:22:08 +01:00
PiotrBLL	b0cf21b9b5	Add HPU checking in decoding and transcribe handle	2024-11-11 22:21:12 +01:00
PiotrBLL	d36696f808	Add util checking hpu tensor	2024-11-11 22:20:50 +01:00
PiotrBLL	e1d4b7b4d7	Add checking sparse tensors in Whisper model	2024-11-11 22:20:24 +01:00
PiotrBLL	32ec355dbf	Upgrade the Dockerfile.hpu ubuntu image	2024-11-11 22:19:54 +01:00
PiotrBLL	f6ccd01567	Wrap model to HPU + update Dockerfile packages	2024-11-11 16:52:01 +01:00
PiotrBLL	50adb7654b	Fix warnings	2024-11-09 01:20:02 +01:00
PiotrBLL	2d5a552596	Fix hpu device_name typo	2024-11-08 19:02:33 +01:00
PiotrBLL	82a5380ad7	Fix Dockerfile.hpu `ffmpeg` package install, update README.md whisper container usage	2024-11-08 18:17:37 +01:00
PiotrBLL	9e3936581a	Add optimum[habana]==1.13.1 in Dockerfile.hpu	2024-11-07 18:46:19 +01:00
PiotrBLL	3825dd0c5a	Rename and Fix Dockerfile.hpu, add .dockerignore, update in README.md	2024-11-07 17:23:30 +01:00
PiotrBLL	a49838ceb3	Add Pytorch Docker images info in README.md	2024-11-06 23:44:42 +01:00
PiotrBLL	1be8041ede	Add hpu usage info in jupyter notebook files	2024-11-06 00:47:12 +01:00
PiotrBLL	6e66088bd0	Add Dockerfile_hpu + README.md section of hpu usage	2024-11-06 00:35:03 +01:00
PiotrBLL	c479ff3614	Add hpu transcribe test	2024-11-05 01:27:48 +01:00
PiotrBLL	bee28658b9	Fix importing habana-frameworks library conditionally	2024-11-05 01:27:31 +01:00
PiotrBLL	52062dd798	Add hpu_utils	2024-10-31 13:56:55 +01:00
PiotrBLL	9269b2ac35	Add HPU support in transcribe and timing + tests	2024-10-31 13:56:46 +01:00
PiotrBLL	23651574df	Update requirements	2024-10-31 13:55:24 +01:00
PiotrBLL	c7fe328c3f	Add testing hpu file	2024-10-31 13:55:05 +01:00
PiotrBLL	8c3680cb02	Add requires_hpu marker in tests	2024-10-31 13:54:53 +01:00
PiotrBLL	b7069e579d	Add initial imports and usage of `wrap_in_hpu_graph`	2024-10-31 13:54:29 +01:00
Jong Wook Kim	25639fc17d	Release 20240930 v20240930	2024-09-30 11:20:53 -07:00
Jong Wook Kim	260bbcfcb3	allowing numpy 2 in tests (#2362 ) * allowing numpy 2 in tests * allowing numpy 2 in tests	2024-09-30 11:18:17 -07:00
Jong Wook Kim	25e5c364e0	large-v3-turbo model (#2361 )	2024-09-30 10:59:51 -07:00
Jong Wook Kim	b66b46f32d	test on python/pytorch versions up to 3.12 and 2.4.1 (#2360 )	2024-09-30 10:33:56 -07:00
Jong Wook Kim	27f971320a	using sdpa if available (#2359 ) * using sdpa if available * Update model.py	2024-09-30 10:27:14 -07:00
Jong Wook Kim	423492dda7	Release 20240927 v20240927	2024-09-27 16:43:58 -07:00
Jong Wook Kim	279133e310	pinning numpy<2 in tests (#2332 ) * pinning numpy<2 in tests * pip install together * pip install together	2024-09-10 10:43:21 -07:00
Jianan Xing	32d55d5d76	Relax triton requirements for compatibility with pytorch 2.4 and newer (#2307 ) * Relax triton requirements for compatibility with pytorch 2.4 and newer Similar to https://github.com/openai/whisper/pull/1802, but now when pytorch upgrades to 2.4, it requires triton==3.0.0. I am not sure if it makes sense to remove the upper bound version constraints * Update requirements.txt	2024-09-10 09:53:08 -07:00
ryanheise	ba3f3cd54b	Skip silence around hallucinations (#1838 ) * Add clip_timestamps option * Add hallucination_silence_threshold option * Fix typing for python < 3.9 --------- Co-authored-by: Jong Wook Kim <jongwook@openai.com>	2023-12-18 12:11:16 -08:00
Bob Lin	8bc8860694	Fix triton env marker (#1887 )	2023-12-11 10:39:08 -05:00
Jong Wook Kim	e58f288045	Release 20231117 v20231117	2023-11-17 11:59:28 -08:00
Eugene Indenbom	1cea435768	Relax triton requirements for compatibility with pytorch 2.1 and newer (#1802 )	2023-11-13 09:43:42 -08:00
Jong Wook Kim	fcfeaf1b61	Release 20231106 v20231106	2023-11-06 10:14:04 -08:00
Jong Wook Kim	c5d4256076	large-v3 (#1761 ) * mel_filters() loads 128 mel bins * can load 100-language models * large-v3 checkpoint and evals * add mandarin alias * remove unused path * flake8 fix * formatting fix	2023-11-06 10:10:30 -08:00
Jong Wook Kim	f6f01c561c	Release 20231105 v20231105	2023-11-06 03:08:56 -08:00
Jong Wook Kim	746aaaeafa	remove tiktoken pin (#1759 )	2023-11-06 03:05:21 -08:00
Philippe Hebert	b9f17e1f2d	docs: Disambiguation of the term "relative speed" in the README (#1751 ) * docs: defines relative speed in README * combined paragraphs --------- Co-authored-by: Jong Wook Kim <jongwook@nyu.edu>	2023-11-06 02:43:07 -08:00
Mohamad Zamini	7dfcd56304	allow_pickle=False while loading of mel matrix IN audio.py (#1511 ) * Update audio.py The `mel_filters` function is using a `np.load` function to load a pre-computed mel filterbank matrix. This function is not thread-safe, which means that if it is called from multiple threads at the same time, it may corrupt the data. To fix this, you can use the `torch.load` function instead. This function is thread-safe, so it will not corrupt the data if it is called from multiple threads at the same time. * Update audio.py updated the docstring * allow_pickle=False * newline --------- Co-authored-by: Jong Wook Kim <jongwook@nyu.edu> Co-authored-by: Jong Wook Kim <jongwook@openai.com>	2023-11-06 02:28:51 -08:00
Marco Zucconelli	b7d277acd5	handling transcribe exceptions. (#1682 ) * handling transcribe() exceptions. * printing stacktrace --------- Co-authored-by: invalid <invalid@email.com> Co-authored-by: Jong Wook Kim <jongwook@nyu.edu> Co-authored-by: Jong Wook Kim <jongwook@openai.com>	2023-11-06 02:06:19 -08:00
amosal	6ed314fe41	Add new option to generate subtitles by a specific number of words (#1729 ) * ADD parser for new argument --max_words_count * ADD max_words_count in words_options ADD warning for max_line_width compatibility * ADD logic for max_words_count * rename to max_words_per_line * make them kwargs * allow specifying file path by --model * black formatting --------- Co-authored-by: Jong Wook Kim <jongwook@nyu.edu>	2023-11-06 01:49:33 -08:00
Jordi Mas	b38a1f20f4	Fix exception when an audio file with no speech is provided (#1396 ) Co-authored-by: Jong Wook Kim <jongwook@openai.com>	2023-10-10 10:01:01 -07:00
Jong Wook Kim	0a60fcaa9b	Release 20230918 v20230918	2023-09-18 17:13:19 -07:00
Jong Wook Kim	5f957da5ca	Update test.yml	2023-09-18 16:38:17 -07:00
Arthur Kim	8b330df096	Add .pre-commit-config.yaml (#1528 ) * Add .pre-commit-config.yaml Co-authored-by: arthur <arthur@rtzr.ai> * flake8 E741 --------- Co-authored-by: Jong Wook Kim <jongwook@openai.com>	2023-09-18 16:15:33 -07:00
sqhao	21010ef454	fix doc of TextDecoder (#1526 ) Signed-off-by: haoshengqiang <haoshengqiang@xiaohongshu.com> Co-authored-by: haoshengqiang <haoshengqiang@xiaohongshu.com>	2023-09-18 16:09:59 -07:00
Nino Risteski	29b7df6231	Update model-card.md (#1643 ) fixed a few typos	2023-09-18 15:59:49 -07:00
taylorchu	e8622f9afc	word timing tweaks (#1559 ) * word timing tweaks * comment on eot * clearer comments	2023-08-08 06:48:56 +09:00

1 2 3 4

167 Commits