167 Commits

Author SHA1 Message Date
PiotrBLL
adaec5a2e1 Add requirements_hpu.txt 2024-11-12 14:53:08 +01:00
PiotrBLL
e1545f4776 Fix: compute_device name in Whisper model 2024-11-11 22:22:08 +01:00
PiotrBLL
b0cf21b9b5 Add HPU checking in decoding and transcribe handle 2024-11-11 22:21:12 +01:00
PiotrBLL
d36696f808 Add util checking hpu tensor 2024-11-11 22:20:50 +01:00
PiotrBLL
e1d4b7b4d7 Add checking sparse tensors in Whisper model 2024-11-11 22:20:24 +01:00
PiotrBLL
32ec355dbf Upgrade the Dockerfile.hpu ubuntu image 2024-11-11 22:19:54 +01:00
PiotrBLL
f6ccd01567 Wrap model to HPU + update Dockerfile packages 2024-11-11 16:52:01 +01:00
PiotrBLL
50adb7654b Fix warnings 2024-11-09 01:20:02 +01:00
PiotrBLL
2d5a552596 Fix hpu device_name typo 2024-11-08 19:02:33 +01:00
PiotrBLL
82a5380ad7 Fix Dockerfile.hpu ffmpeg package install, update README.md whisper container usage 2024-11-08 18:17:37 +01:00
PiotrBLL
9e3936581a Add optimum[habana]==1.13.1 in Dockerfile.hpu 2024-11-07 18:46:19 +01:00
PiotrBLL
3825dd0c5a Rename and Fix Dockerfile.hpu, add .dockerignore, update in README.md 2024-11-07 17:23:30 +01:00
PiotrBLL
a49838ceb3 Add Pytorch Docker images info in README.md 2024-11-06 23:44:42 +01:00
PiotrBLL
1be8041ede Add hpu usage info in jupyter notebook files 2024-11-06 00:47:12 +01:00
PiotrBLL
6e66088bd0 Add Dockerfile_hpu + README.md section of hpu usage 2024-11-06 00:35:03 +01:00
PiotrBLL
c479ff3614 Add hpu transcribe test 2024-11-05 01:27:48 +01:00
PiotrBLL
bee28658b9 Fix importing habana-frameworks library conditionally 2024-11-05 01:27:31 +01:00
PiotrBLL
52062dd798 Add hpu_utils 2024-10-31 13:56:55 +01:00
PiotrBLL
9269b2ac35 Add HPU support in transcribe and timing + tests 2024-10-31 13:56:46 +01:00
PiotrBLL
23651574df Update requirements 2024-10-31 13:55:24 +01:00
PiotrBLL
c7fe328c3f Add testing hpu file 2024-10-31 13:55:05 +01:00
PiotrBLL
8c3680cb02 Add requires_hpu marker in tests 2024-10-31 13:54:53 +01:00
PiotrBLL
b7069e579d Add initial imports and usage of wrap_in_hpu_graph 2024-10-31 13:54:29 +01:00
Jong Wook Kim
25639fc17d Release 20240930 v20240930 2024-09-30 11:20:53 -07:00
Jong Wook Kim
260bbcfcb3
allowing numpy 2 in tests (#2362)
* allowing numpy 2 in tests

* allowing numpy 2 in tests
2024-09-30 11:18:17 -07:00
Jong Wook Kim
25e5c364e0
large-v3-turbo model (#2361) 2024-09-30 10:59:51 -07:00
Jong Wook Kim
b66b46f32d
test on python/pytorch versions up to 3.12 and 2.4.1 (#2360) 2024-09-30 10:33:56 -07:00
Jong Wook Kim
27f971320a
using sdpa if available (#2359)
* using sdpa if available

* Update model.py
2024-09-30 10:27:14 -07:00
Jong Wook Kim
423492dda7 Release 20240927 v20240927 2024-09-27 16:43:58 -07:00
Jong Wook Kim
279133e310
pinning numpy<2 in tests (#2332)
* pinning numpy<2 in tests

* pip install together

* pip install together
2024-09-10 10:43:21 -07:00
Jianan Xing
32d55d5d76
Relax triton requirements for compatibility with pytorch 2.4 and newer (#2307)
* Relax triton requirements for compatibility with pytorch 2.4 and newer

Similar to https://github.com/openai/whisper/pull/1802, but now when pytorch upgrades to 2.4, it requires triton==3.0.0. I am not sure if it makes sense to remove the upper bound version constraints

* Update requirements.txt
2024-09-10 09:53:08 -07:00
ryanheise
ba3f3cd54b
Skip silence around hallucinations (#1838)
* Add clip_timestamps option

* Add hallucination_silence_threshold option

* Fix typing for python < 3.9

---------

Co-authored-by: Jong Wook Kim <jongwook@openai.com>
2023-12-18 12:11:16 -08:00
Bob Lin
8bc8860694
Fix triton env marker (#1887) 2023-12-11 10:39:08 -05:00
Jong Wook Kim
e58f288045 Release 20231117 v20231117 2023-11-17 11:59:28 -08:00
Eugene Indenbom
1cea435768
Relax triton requirements for compatibility with pytorch 2.1 and newer (#1802) 2023-11-13 09:43:42 -08:00
Jong Wook Kim
fcfeaf1b61 Release 20231106 v20231106 2023-11-06 10:14:04 -08:00
Jong Wook Kim
c5d4256076
large-v3 (#1761)
* mel_filters() loads 128 mel bins

* can load 100-language models

* large-v3 checkpoint and evals

* add mandarin alias

* remove unused path

* flake8 fix

* formatting fix
2023-11-06 10:10:30 -08:00
Jong Wook Kim
f6f01c561c Release 20231105 v20231105 2023-11-06 03:08:56 -08:00
Jong Wook Kim
746aaaeafa
remove tiktoken pin (#1759) 2023-11-06 03:05:21 -08:00
Philippe Hebert
b9f17e1f2d
docs: Disambiguation of the term "relative speed" in the README (#1751)
* docs: defines relative speed in README

* combined paragraphs

---------

Co-authored-by: Jong Wook Kim <jongwook@nyu.edu>
2023-11-06 02:43:07 -08:00
Mohamad Zamini
7dfcd56304
allow_pickle=False while loading of mel matrix IN audio.py (#1511)
* Update audio.py

 The `mel_filters` function is using a `np.load` function to load a pre-computed mel filterbank matrix. This function is not thread-safe, which means that if it is called from multiple threads at the same time, it may corrupt the data.

To fix this, you can use the `torch.load` function instead. This function is thread-safe, so it will not corrupt the data if it is called from multiple threads at the same time.

* Update audio.py

updated the docstring

* allow_pickle=False

* newline

---------

Co-authored-by: Jong Wook Kim <jongwook@nyu.edu>
Co-authored-by: Jong Wook Kim <jongwook@openai.com>
2023-11-06 02:28:51 -08:00
Marco Zucconelli
b7d277acd5
handling transcribe exceptions. (#1682)
* handling transcribe() exceptions.

* printing stacktrace

---------

Co-authored-by: invalid <invalid@email.com>
Co-authored-by: Jong Wook Kim <jongwook@nyu.edu>
Co-authored-by: Jong Wook Kim <jongwook@openai.com>
2023-11-06 02:06:19 -08:00
amosal
6ed314fe41
Add new option to generate subtitles by a specific number of words (#1729)
* ADD parser for new argument --max_words_count

* ADD max_words_count in words_options
ADD warning for max_line_width compatibility

* ADD logic for max_words_count

* rename to max_words_per_line

* make them kwargs

* allow specifying file path by --model

* black formatting

---------

Co-authored-by: Jong Wook Kim <jongwook@nyu.edu>
2023-11-06 01:49:33 -08:00
Jordi Mas
b38a1f20f4
Fix exception when an audio file with no speech is provided (#1396)
Co-authored-by: Jong Wook Kim <jongwook@openai.com>
2023-10-10 10:01:01 -07:00
Jong Wook Kim
0a60fcaa9b Release 20230918 v20230918 2023-09-18 17:13:19 -07:00
Jong Wook Kim
5f957da5ca
Update test.yml 2023-09-18 16:38:17 -07:00
Arthur Kim
8b330df096
Add .pre-commit-config.yaml (#1528)
* Add .pre-commit-config.yaml

Co-authored-by: arthur <arthur@rtzr.ai>

* flake8 E741

---------

Co-authored-by: Jong Wook Kim <jongwook@openai.com>
2023-09-18 16:15:33 -07:00
sqhao
21010ef454
fix doc of TextDecoder (#1526)
Signed-off-by: haoshengqiang <haoshengqiang@xiaohongshu.com>
Co-authored-by: haoshengqiang <haoshengqiang@xiaohongshu.com>
2023-09-18 16:09:59 -07:00
Nino Risteski
29b7df6231
Update model-card.md (#1643)
fixed a few typos
2023-09-18 15:59:49 -07:00
taylorchu
e8622f9afc
word timing tweaks (#1559)
* word timing tweaks

* comment on eot

* clearer comments
2023-08-08 06:48:56 +09:00