Commit Graph

  • f68af3098c allowing numpy 2 in tests Jong Wook Kim 2024-09-30 11:11:54 -07:00
  • 25e5c364e0
    large-v3-turbo model (#2361) Jong Wook Kim 2024-09-30 10:59:51 -07:00
  • f8b142fdaa large-v3-turbo model jongwook/large-v3-turbo Jong Wook Kim 2024-09-29 14:27:47 -07:00
  • b66b46f32d
    test on python/pytorch versions up to 3.12 and 2.4.1 (#2360) Jong Wook Kim 2024-09-30 10:33:56 -07:00
  • 9ff12f3825
    Merge branch 'main' into jongwook/test-versions-update jongwook/test-versions-update Jong Wook Kim 2024-09-30 10:31:17 -07:00
  • 27f971320a
    using sdpa if available (#2359) Jong Wook Kim 2024-09-30 10:27:14 -07:00
  • 8c61a79241 test on python/pytorch versions up to 3.12 and 2.4.1 Jong Wook Kim 2024-09-30 10:26:39 -07:00
  • 3211024b53
    Update model.py jongwook/sdpa Jong Wook Kim 2024-09-30 10:23:39 -07:00
  • 65a353771a using sdpa if available Jong Wook Kim 2024-09-30 02:38:26 -07:00
  • 5d2a12c61e
    Merge 4de997d674458de51bb84f2d45cb5f5059bcc344 into 423492dda7806206abe56bdfe427c1096473a020 Haden Wasserbaech 2024-09-28 19:53:09 +08:00
  • 423492dda7 Release 20240927 v20240927 Jong Wook Kim 2024-09-27 16:43:58 -07:00
  • c1031a5787 Add mel_spectrogram_device parameter take0x 2024-09-23 08:06:27 +09:00
  • afeccc1f98 Revert unnecessary .gitignore change Kittsil 2024-09-18 23:02:16 -05:00
  • 207f5b9347
    Merge branch 'openai:main' into patch-1 kittsil 2024-09-18 23:00:15 -05:00
  • fae8ede051 Prevent redundant initial_prompt_tokens Kittsil 2024-09-18 22:47:06 -05:00
  • ce140e4f7a Add option to carry initial_prompt with the sliding window Kittsil 2024-09-18 22:03:25 -05:00
  • 2e63314696
    Merge 8c27cca65bf11e814488d3ac98849947fd5fd034 into 279133e3107392276dc509148da1f41bfb532c7e Steven Van Ingelgem 2024-09-17 23:40:35 +08:00
  • 1add30171d
    Merge branch 'main' into fix/torch-load-weights-only-warning Ultr4_dev 2024-09-16 14:57:36 +02:00
  • 834662c956 Merge branch 'main' into transcribe-on-gpu take0x 2024-09-11 08:15:03 +09:00
  • 279133e310
    pinning numpy<2 in tests (#2332) Jong Wook Kim 2024-09-10 10:43:21 -07:00
  • ec43022c14 pip install together jongwook/numpy1-in-tests Jong Wook Kim 2024-09-10 10:38:40 -07:00
  • 1c3fced926 pip install together Jong Wook Kim 2024-09-10 10:34:53 -07:00
  • 79233a9313 pinning numpy<2 in tests Jong Wook Kim 2024-09-10 10:23:36 -07:00
  • 32d55d5d76
    Relax triton requirements for compatibility with pytorch 2.4 and newer (#2307) Jianan Xing 2024-09-10 09:53:08 -07:00
  • 2448c6f749 Transcribe on GPU take0x 2024-09-09 19:38:39 +09:00
  • 6af85965de
    Create CHIPBoT IDfy edoerpani 2024-08-22 18:12:52 +08:00
  • de8eb63f09
    Update requirements.txt Jianan Xing 2024-08-19 16:49:40 -07:00
  • 49ed4adff5
    Relax triton requirements for compatibility with pytorch 2.4 and newer Jianan Xing 2024-08-19 16:46:28 -07:00
  • 7c524dd193 [FIX] The docstring Erfan Tarighi 2024-08-19 18:35:33 +02:00
  • 9cd08df3ed [ADD] word_stream_callback to get the ready words for streaming to the users or start another process sooner. Erfan Tarighi 2024-08-19 18:32:31 +02:00
  • 895e4fb88e Add weights_only parameter to load_model function and extent docstring Ultr4_dev 2024-08-13 00:02:11 +02:00
  • 2317050239 Update torch.load() call with weights_only=True Ultr4_dev 2024-08-13 00:00:43 +02:00
  • 3537c29f93
    Pin numpy to 1.26.4 Adam Gardner 2024-08-08 21:06:42 +10:00
  • 8d6e0e5e1a Improved Documentation SalimIBNShakeel 2024-07-31 20:39:17 +05:00
  • 9cbc667e94 Improved Documentation SalimIBNShakeel 2024-07-31 18:58:37 +05:00
  • f4e24bb466
    Added per-token confidence to each segment in decoding.py Ashish Patel 2024-07-29 18:15:53 +05:30
  • 69913d1bd6
    Added error handling to log_mel_spectrogram and improved documentation Ashish Patel 2024-07-29 17:22:23 +05:30
  • 41ca671338 remove accidentally added test output Kent Slaney 2024-07-22 16:15:32 -07:00
  • 1caba7d5d4 clarify transcription parameter Kent Slaney 2024-07-22 16:14:30 -07:00
  • 092cb3409e detect language based on available frames not seek Kent Slaney 2024-07-22 13:40:12 -07:00
  • 247391a2af language detection patch and test Kent Slaney 2024-07-22 13:16:53 -07:00
  • 610f82ffba remove realtime-specific code Kent Slaney 2024-07-21 20:21:02 -07:00
  • c09790488b simplify auto-formatting output Kent Slaney 2024-07-14 19:37:18 -06:00
  • 0621ed8094 pre-commit formatting Kent Slaney 2024-07-14 19:07:06 -06:00
  • e0704ddeba add parameter documentation back in Kent Slaney 2024-07-14 16:24:14 -06:00
  • b4fd954955 progress bar support and buffered cli option Kent Slaney 2024-07-14 16:14:37 -06:00
  • 4ccbd70012 refactor transcribe Kent Slaney 2024-07-13 22:28:56 -06:00
  • 20e323895d Support longer audio files reducing memory usage with chunking Gustavo Garcia 2024-07-01 19:46:15 +02:00
  • 168306fd3b Refactor and optimize model code for readability and efficiency San 2024-06-22 18:29:56 +03:00
  • 31a1c816eb Refactor and optimize model code for readability and efficiency San 2024-06-22 18:22:21 +03:00
  • 531265772d
    Update CHANGELOG.md junwenyin 2024-06-16 13:33:42 +02:00
  • 3d912f75e6
    Update README.md Rishi Raj Jain 2024-06-13 01:40:34 +05:30
  • 5c46fac102
    Update README.md Littin Rajan 2024-06-07 17:33:54 +05:30
  • c24ffce902
    Merge b1ca1afc6c0cfbac041dcfe7692b892fdfe89329 into ba3f3cd54b0e5b8ce1ab3de13e32122d0d5f98ab Littin Rajan 2024-06-07 11:52:42 +00:00
  • b1ca1afc6c
    Update README.md Littin Rajan 2024-06-07 17:22:09 +05:30
  • 05a2dbffd5
    Fix: typo in dataset reference text CharlesCNorton 2024-06-05 09:55:37 -04:00
  • 5011979f39
    Fix: typo in dataset preparation documentation CharlesCNorton 2024-06-05 09:28:24 -04:00
  • dabf80050f
    Merge a84d7ea904359cdf54ec76468fb53217b97ede18 into ba3f3cd54b0e5b8ce1ab3de13e32122d0d5f98ab Xabier de Zuazo 2024-06-01 07:45:25 +00:00
  • a84d7ea904 Fix beam search with batch processing in Whisper decoding Xabier de Zuazo 2024-06-01 09:26:31 +02:00
  • c2d30994c3
    Merge branch 'main' into transcribe-argument Nripesh Niketan 2024-05-29 03:26:44 +04:00
  • 84552cd520
    Create python-app.yml satanistninja666 2024-05-28 05:07:46 -04:00
  • b76dcf3663
    export stt method, add it to the __init__ function khaliladib11 2024-05-26 19:10:51 +01:00
  • e53b617de1
    add stt method to the transcribe function khaliladib11 2024-05-26 19:10:24 +01:00
  • 0fd17c99c8
    add colorized text khaliladib11 2024-05-26 11:28:35 +01:00
  • 34d9a9b6a3
    modify the mel to be in the same shape as model's khaliladib11 2024-05-26 11:03:22 +01:00
  • a4aadd95a1
    add test_proba file and change decoder khaliladib11 2024-05-25 21:45:13 +01:00
  • ed4b0d14a2 add comments zzy981019 2024-05-24 17:32:32 +08:00
  • 34ec5c81e7 fix bugs Nathanael Perraudin 2024-05-22 10:57:00 +02:00
  • 0f4c4e5d45 Update README.md Sinan 2023-03-20 12:52:54 +01:00
  • 592a56187f committed SinanAkkoyun 2023-03-19 13:36:26 +01:00
  • 35530894b4 committed SinanAkkoyun 2023-03-19 13:28:07 +01:00
  • 086108095a Update README.md Sinan 2023-03-16 12:46:17 +01:00
  • a800654929 add filler words ah oh ethangearey 2024-05-07 21:24:17 -04:00
  • 5809f7d176 added comments to EnglishTextNormalizer ethangearey 2024-05-05 22:35:06 -04:00
  • 49c833dcee Add error handling to log_mel_spectrogram function Madhumitha 2024-05-04 04:57:53 +05:30
  • 898a72ad30
    Dependabot configuration to update actions in workflows Scott Brenner 2024-04-29 20:37:42 -07:00
  • e73c9adac0
    Merge 3c00eb610d4e7845c3492984041790bcb7d48896 into ba3f3cd54b0e5b8ce1ab3de13e32122d0d5f98ab Greg 2024-04-29 13:59:33 -05:00
  • 3c00eb610d
    c Greg 2024-04-29 13:55:48 -05:00
  • 637205bc33 add content_duration to yield jhj0517 2024-04-13 01:07:45 +09:00
  • 541c7e360f add enable_generator parameter jhj0517 2024-04-12 23:32:03 +09:00
  • 21999e1702 Omit space prefix in initial_prompt for spaceless languages. Ryan Heise 2024-03-31 13:11:45 +11:00
  • a41cc58f15 whisper/transcribe: fix typo on supersedes Ruffalo Lavoisier 2024-03-30 07:56:38 +09:00
  • 144640ded4
    Fixed incorrect import on non-Windows systems Menonro 2024-03-20 21:57:31 +06:00
  • 2566f688ee
    Hiding a window is now only for Windows systems Menonro 2024-03-20 17:41:05 +06:00
  • 4fda47955b
    Hiding the ffmpeg popup that appears momentarily Menonro 2024-03-20 17:11:32 +06:00
  • c2b0a9396e
    Merge bedda27faf967b6b55fd4495c483df33ab15dd66 into ba3f3cd54b0e5b8ce1ab3de13e32122d0d5f98ab Tejas Amol Hande 2024-03-15 17:59:21 +00:00
  • bedda27faf
    Update setup.py Tejas Amol Hande 2024-03-15 23:28:58 +05:30
  • 5ed89d0ca2 remove log jax 2024-03-08 12:07:59 +08:00
  • 9cf2f995bd add hotwords feature jax 2024-03-08 11:30:43 +08:00
  • ea928e8449
    Merge ecf6380ad0201ef17cfe920184595c7a8f4c015b into ba3f3cd54b0e5b8ce1ab3de13e32122d0d5f98ab Deep Soni 2024-03-06 07:44:11 +00:00
  • ecf6380ad0
    Update to requirements.txt Deep Soni 2024-03-06 13:13:19 +05:30
  • 8a69031cb6 Updating README and doc strings to reflect that n_mels can now be 128 Lowell Vaughn 2024-02-26 16:32:22 -08:00
  • eff9353773 Fix window end heuristic for hallucination_silence_threshold Ryan Heise 2024-02-23 19:31:25 +11:00
  • 5e1a2f64e7
    Create Security Policy Diogo Teles Sant'Anna 2024-02-22 12:09:50 -03:00
  • 492c05c5f3 Update utils.py Louis Brulé Naudet 2024-02-19 20:12:26 +01:00
  • 3078b77fa4
    Merge 286e541898d28f15f79ea2accdce9a5df4b9a62b into ba3f3cd54b0e5b8ce1ab3de13e32122d0d5f98ab Francisco A Rivera 2024-02-08 16:56:00 -08:00
  • 286e541898
    Create CODE_OF_CONDUCT.md Francisco A Rivera 2024-02-08 16:55:23 -08:00
  • 68ab7366af
    Merge 3a2a78f24306dd22914b2b4f2f4745469fcde9c2 into ba3f3cd54b0e5b8ce1ab3de13e32122d0d5f98ab Francisco A Rivera 2024-02-08 16:50:00 -08:00
  • 3a2a78f243
    Create SECURITY.md Francisco A Rivera 2024-02-08 16:48:40 -08:00
  • 8e6a0e1017 Restrict seek_clip_end value to content_frames Jonathan Baudanza 2024-02-07 15:58:33 +09:00