PiotrBLL
adaec5a2e1
Add requirements_hpu.txt
2024-11-12 14:53:08 +01:00
PiotrBLL
e1545f4776
Fix: compute_device name in Whisper model
2024-11-11 22:22:08 +01:00
PiotrBLL
b0cf21b9b5
Add HPU checking in decoding and transcribe handle
2024-11-11 22:21:12 +01:00
PiotrBLL
d36696f808
Add util checking hpu tensor
2024-11-11 22:20:50 +01:00
PiotrBLL
e1d4b7b4d7
Add checking sparse tensors in Whisper model
2024-11-11 22:20:24 +01:00
PiotrBLL
32ec355dbf
Upgrade the Dockerfile.hpu ubuntu image
2024-11-11 22:19:54 +01:00
PiotrBLL
f6ccd01567
Wrap model to HPU + update Dockerfile packages
2024-11-11 16:52:01 +01:00
PiotrBLL
50adb7654b
Fix warnings
2024-11-09 01:20:02 +01:00
PiotrBLL
2d5a552596
Fix hpu device_name typo
2024-11-08 19:02:33 +01:00
PiotrBLL
82a5380ad7
Fix Dockerfile.hpu ffmpeg package install, update README.md whisper container usage
2024-11-08 18:17:37 +01:00
PiotrBLL
9e3936581a
Add optimum[habana]==1.13.1 in Dockerfile.hpu
2024-11-07 18:46:19 +01:00
PiotrBLL
3825dd0c5a
Rename and Fix Dockerfile.hpu, add .dockerignore, update in README.md
2024-11-07 17:23:30 +01:00
PiotrBLL
a49838ceb3
Add Pytorch Docker images info in README.md
2024-11-06 23:44:42 +01:00
PiotrBLL
1be8041ede
Add hpu usage info in jupyter notebook files
2024-11-06 00:47:12 +01:00
PiotrBLL
6e66088bd0
Add Dockerfile_hpu + README.md section of hpu usage
2024-11-06 00:35:03 +01:00
PiotrBLL
c479ff3614
Add hpu transcribe test
2024-11-05 01:27:48 +01:00
PiotrBLL
bee28658b9
Fix importing habana-frameworks library conditionally
2024-11-05 01:27:31 +01:00
PiotrBLL
52062dd798
Add hpu_utils
2024-10-31 13:56:55 +01:00
PiotrBLL
9269b2ac35
Add HPU support in transcribe and timing + tests
2024-10-31 13:56:46 +01:00
PiotrBLL
23651574df
Update requirements
2024-10-31 13:55:24 +01:00
PiotrBLL
c7fe328c3f
Add testing hpu file
2024-10-31 13:55:05 +01:00
PiotrBLL
8c3680cb02
Add requires_hpu marker in tests
2024-10-31 13:54:53 +01:00
PiotrBLL
b7069e579d
Add initial imports and usage of wrap_in_hpu_graph
2024-10-31 13:54:29 +01:00
Jong Wook Kim
25639fc17d
Release 20240930
v20240930
2024-09-30 11:20:53 -07:00
Jong Wook Kim
260bbcfcb3
allowing numpy 2 in tests ( #2362 )
...
* allowing numpy 2 in tests
* allowing numpy 2 in tests
2024-09-30 11:18:17 -07:00
Jong Wook Kim
25e5c364e0
large-v3-turbo model ( #2361 )
2024-09-30 10:59:51 -07:00
Jong Wook Kim
b66b46f32d
test on python/pytorch versions up to 3.12 and 2.4.1 ( #2360 )
2024-09-30 10:33:56 -07:00
Jong Wook Kim
27f971320a
using sdpa if available ( #2359 )
...
* using sdpa if available
* Update model.py
2024-09-30 10:27:14 -07:00
Jong Wook Kim
423492dda7
Release 20240927
v20240927
2024-09-27 16:43:58 -07:00
Jong Wook Kim
279133e310
pinning numpy<2 in tests ( #2332 )
...
* pinning numpy<2 in tests
* pip install together
* pip install together
2024-09-10 10:43:21 -07:00
Jianan Xing
32d55d5d76
Relax triton requirements for compatibility with pytorch 2.4 and newer ( #2307 )
...
* Relax triton requirements for compatibility with pytorch 2.4 and newer
Similar to https://github.com/openai/whisper/pull/1802 , but now when pytorch upgrades to 2.4, it requires triton==3.0.0. I am not sure if it makes sense to remove the upper bound version constraints
* Update requirements.txt
2024-09-10 09:53:08 -07:00
ryanheise
ba3f3cd54b
Skip silence around hallucinations ( #1838 )
...
* Add clip_timestamps option
* Add hallucination_silence_threshold option
* Fix typing for python < 3.9
---------
Co-authored-by: Jong Wook Kim <jongwook@openai.com>
2023-12-18 12:11:16 -08:00
Bob Lin
8bc8860694
Fix triton env marker ( #1887 )
2023-12-11 10:39:08 -05:00
Jong Wook Kim
e58f288045
Release 20231117
v20231117
2023-11-17 11:59:28 -08:00
Eugene Indenbom
1cea435768
Relax triton requirements for compatibility with pytorch 2.1 and newer ( #1802 )
2023-11-13 09:43:42 -08:00
Jong Wook Kim
fcfeaf1b61
Release 20231106
v20231106
2023-11-06 10:14:04 -08:00
Jong Wook Kim
c5d4256076
large-v3 ( #1761 )
...
* mel_filters() loads 128 mel bins
* can load 100-language models
* large-v3 checkpoint and evals
* add mandarin alias
* remove unused path
* flake8 fix
* formatting fix
2023-11-06 10:10:30 -08:00
Jong Wook Kim
f6f01c561c
Release 20231105
v20231105
2023-11-06 03:08:56 -08:00
Jong Wook Kim
746aaaeafa
remove tiktoken pin ( #1759 )
2023-11-06 03:05:21 -08:00
Philippe Hebert
b9f17e1f2d
docs: Disambiguation of the term "relative speed" in the README ( #1751 )
...
* docs: defines relative speed in README
* combined paragraphs
---------
Co-authored-by: Jong Wook Kim <jongwook@nyu.edu>
2023-11-06 02:43:07 -08:00
Mohamad Zamini
7dfcd56304
allow_pickle=False while loading of mel matrix IN audio.py ( #1511 )
...
* Update audio.py
The `mel_filters` function is using a `np.load` function to load a pre-computed mel filterbank matrix. This function is not thread-safe, which means that if it is called from multiple threads at the same time, it may corrupt the data.
To fix this, you can use the `torch.load` function instead. This function is thread-safe, so it will not corrupt the data if it is called from multiple threads at the same time.
* Update audio.py
updated the docstring
* allow_pickle=False
* newline
---------
Co-authored-by: Jong Wook Kim <jongwook@nyu.edu>
Co-authored-by: Jong Wook Kim <jongwook@openai.com>
2023-11-06 02:28:51 -08:00
Marco Zucconelli
b7d277acd5
handling transcribe exceptions. ( #1682 )
...
* handling transcribe() exceptions.
* printing stacktrace
---------
Co-authored-by: invalid <invalid@email.com>
Co-authored-by: Jong Wook Kim <jongwook@nyu.edu>
Co-authored-by: Jong Wook Kim <jongwook@openai.com>
2023-11-06 02:06:19 -08:00
amosal
6ed314fe41
Add new option to generate subtitles by a specific number of words ( #1729 )
...
* ADD parser for new argument --max_words_count
* ADD max_words_count in words_options
ADD warning for max_line_width compatibility
* ADD logic for max_words_count
* rename to max_words_per_line
* make them kwargs
* allow specifying file path by --model
* black formatting
---------
Co-authored-by: Jong Wook Kim <jongwook@nyu.edu>
2023-11-06 01:49:33 -08:00
Jordi Mas
b38a1f20f4
Fix exception when an audio file with no speech is provided ( #1396 )
...
Co-authored-by: Jong Wook Kim <jongwook@openai.com>
2023-10-10 10:01:01 -07:00
Jong Wook Kim
0a60fcaa9b
Release 20230918
v20230918
2023-09-18 17:13:19 -07:00
Jong Wook Kim
5f957da5ca
Update test.yml
2023-09-18 16:38:17 -07:00
Arthur Kim
8b330df096
Add .pre-commit-config.yaml ( #1528 )
...
* Add .pre-commit-config.yaml
Co-authored-by: arthur <arthur@rtzr.ai>
* flake8 E741
---------
Co-authored-by: Jong Wook Kim <jongwook@openai.com>
2023-09-18 16:15:33 -07:00
sqhao
21010ef454
fix doc of TextDecoder ( #1526 )
...
Signed-off-by: haoshengqiang <haoshengqiang@xiaohongshu.com>
Co-authored-by: haoshengqiang <haoshengqiang@xiaohongshu.com>
2023-09-18 16:09:59 -07:00
Nino Risteski
29b7df6231
Update model-card.md ( #1643 )
...
fixed a few typos
2023-09-18 15:59:49 -07:00
taylorchu
e8622f9afc
word timing tweaks ( #1559 )
...
* word timing tweaks
* comment on eot
* clearer comments
2023-08-08 06:48:56 +09:00