├── README.md └── LICENSE /README.md: -------------------------------------------------------------------------------- 1 | # awesome-openai-whisper 2 | A curated list of awesome OpenAI's Whisper 3 | 4 | 5 | ## General Resources 6 | * [Introducing Whisper](https://openai.com/blog/whisper/) 7 | * [Whisper Paper](https://cdn.openai.com/papers/whisper.pdf) 8 | * [Whisper Code](https://github.com/openai/whisper) 9 | * [Introducing ChatGPT and Whisper APIs](https://openai.com/blog/introducing-chatgpt-and-whisper-apis) 10 | 11 | ## API Ready / Playground / Demo 12 | * [whisperx](https://replicate.com/daanelson/whisperx) 13 | * [WHISPER+](https://www.oneai.com/speech-to-text) 14 | * [Fine-Tuned Whisper API](https://www.assemblyai.com/) 15 | * [openai/whisper – Run with an API on Replicate](https://replicate.com/openai/whisper) 16 | * [Whisper - a Hugging Face Space by openai](https://huggingface.co/spaces/openai/whisper) 17 | * [Whisper Playground](https://whisperui.monsterapi.ai/) 18 | * [Source](https://github.com/saharmor/whisper-playground) 19 | * [Web Whisper - 🎶 Convert any audio to text 📝](https://whisper.r3d.red) 20 | * [Source](https://codeberg.org/pluja/web-whisper) 21 | 22 | ## Model Variants 23 | * [whisper-timestamped - Whisper with word-level timestamps and confidence ](https://github.com/linto-ai/whisper-timestamped) 24 | * [whisper.cpp - Port of OpenAI's Whisper model in C/C++](https://github.com/ggerganov/whisper.cpp) 25 | * [pywhispercpp - Python bindings for whisper.cpp ](https://github.com/abdeladim-s/pywhispercpp) 26 | * [Faster Whisper - reimplementation using CTranslate2 up to 4 times faster](https://github.com/guillaumekln/faster-whisper) 27 | * [Whisper JAX - optimised JAX code, largely built on the hugs Hugging Face Transformers Whisper implementation, over 70x faster](https://github.com/sanchit-gandhi/whisper-jax/) 28 | * [whisper.tflite](https://github.com/usefulsensors/openai-whisper) 29 | * [OpenAI Whisper - CPU](https://github.com/MiscellaneousStuff/openai-whisper-cpu) 30 | * [whisper_onnx](https://github.com/Fhrozen/whisper_onnx) 31 | * [whisper-export - openvino version of openai/whisper](https://github.com/axinc-ai/whisper-export) 32 | * [onnx-export](https://github.com/axinc-ai/whisper-export/tree/onnx-export) 33 | * [Whisper OpenVINO](https://github.com/zhuzilin/whisper-openvino) 34 | * [Whisper models on Hugging Face](https://huggingface.co/models?other=whisper) 35 | 36 | ## Applications 37 | * [React hook for OpenAI Whisper](https://github.com/chengsokdara/use-whisper) 38 | * [🎞️ Subtitles generation tool (Web-UI + CLI + Python package)](https://github.com/abdeladim-s/subsai) 39 | * [Whisper as a Service (GUI and API for OpenAI Whisper)](https://github.com/schibsted/WAAS) 40 | * [WhisperX: Automatic Speech Recognition with Accurate Word-level Timestamps.](https://github.com/m-bain/whisperX) 41 | * [stable-ts - Stabilizing Timestamps for Whisper](https://github.com/jianfch/stable-ts) 42 | * [buzz - Buzz transcribes audio from your computer's microphones to text using OpenAI's Whisper](https://github.com/chidiwilliams/buzz) 43 | * [whispering - Streaming transcriber with whisper](https://github.com/shirayu/whispering) 44 | * [whisper-youtube - 🔉 Youtube Videos Transcription with OpenAI's Whisper](https://github.com/ArthurFDLR/whisper-youtube) 45 | * [Speaker Identification - Pyannote plays and Whisper rhymes](https://github.com/Majdoddin/nlp) 46 | * [Automatic YouTube subtitle generation](https://github.com/m1guelpf/yt-whisper) 47 | * [Whisper Webui - WebUI for Whisper that can transcribe and translate audio](https://gitlab.com/aadnk/whisper-webui/) 48 | * [AutoCut - generate video subtitles and edit the video by selecting subtitle clips](https://github.com/mli/autocut) 49 | * [AutoCut Client](https://github.com/zcf0508/autocut-client) 50 | * [Whisper Playground - Build real time speech2text web apps using OpenAI's Whisper](https://github.com/saharmor/whisper-playground) 51 | * [Subtitle Edit - a subtitle editor supporting audio to text (speech recognition) via Whisper or Vosk/Kaldi](https://www.nikse.dk/subtitleedit) 52 | * [WEB WHISPER - A light user interface for OpenAI's Whisper right into your browser!](https://codeberg.org/pluja/web-whisper) 53 | * [Whisper Mic - Project that allows one to use a microphone with OpenAI whisper](https://github.com/mallorbc/whisper_mic) 54 | * [Android Whisper ASR App](https://play.google.com/store/apps/details?id=com.whisper.android.tflitecpp) 55 | * [Source](https://github.com/usefulsensors/openai-whisper/tree/main/android_app) 56 | * [Apple Whisper ASR App](https://apps.apple.com/in/app/whisper-asr/id6444556326) 57 | * [💬 ASR FastAPI](https://github.com/Wordcab/wordcab-transcribe) 58 | 59 | ## Videos 60 | * [OpenAI Whisper - MultiLingual AI Speech Recognition Live App Tutorial](https://www.youtube.com/watch?v=ywIyc8l1K1Q) 61 | * [Complete Tutorial Video for OpenAI's Whisper Model for Windows Users](https://www.youtube.com/watch?v=msj3wuYf3d8) 62 | * [Open AI’s Whisper is Amazing!](https://www.youtube.com/watch?v=OCBZtgQGt1I) 63 | * [How to Use OpenAI Whisper to Fix YouTube Search](https://www.youtube.com/watch?v=vpU_6x3jowg) 64 | 65 | ## Tutorials 66 | * [Convert Podcasts to Text With OpenAI’s Whisper API Using Python](https://betterprogramming.pub/openais-whisper-tutorial-42140dd696ee) 67 | * [Create your own speech to text application with Whisper from OpenAI and Flask](https://blog.paperspace.com/whisper-openai-flask-application-deployment/) 68 | * [How to Run OpenAI’s Whisper Speech Recognition Model](https://www.assemblyai.com/blog/how-to-run-openais-whisper-speech-recognition-model/) 69 | * [Speech-to-Text with OpenAI’s Whisper](https://towardsdatascience.com/speech-to-text-with-openais-whisper-53d5cea9005e) 70 | 71 | ## Articles 72 | * [Whispers of A.I.’s Modular Future](https://www.newyorker.com/tech/annals-of-technology/whispers-of-ais-modular-future) 73 | * [OpenAI open-sources Whisper, a multilingual speech recognition system](https://techcrunch.com/2022/09/21/openai-open-sources-whisper-a-multilingual-speech-recognition-system/) 74 | * [OpenAI Releases 1.6 Billion Parameter Multilingual Speech Recognition AI Whisper](https://www.infoq.com/news/2022/10/openai-whisper-speech/) 75 | * [OpenAI Releases Whisper: A New Open-Source Machine Learning Model For Multi-Lingual Automatic Speech Recognition](https://www.marktechpost.com/2022/09/27/openai-releases-whisper-a-new-open-source-machine-learning-model-for-multi-lingual-automatic-speech-recognition/) 76 | -------------------------------------------------------------------------------- /LICENSE: -------------------------------------------------------------------------------- 1 | Creative Commons Legal Code 2 | 3 | CC0 1.0 Universal 4 | 5 | CREATIVE COMMONS CORPORATION IS NOT A LAW FIRM AND DOES NOT PROVIDE 6 | LEGAL SERVICES. DISTRIBUTION OF THIS DOCUMENT DOES NOT CREATE AN 7 | ATTORNEY-CLIENT RELATIONSHIP. CREATIVE COMMONS PROVIDES THIS 8 | INFORMATION ON AN "AS-IS" BASIS. CREATIVE COMMONS MAKES NO WARRANTIES 9 | REGARDING THE USE OF THIS DOCUMENT OR THE INFORMATION OR WORKS 10 | PROVIDED HEREUNDER, AND DISCLAIMS LIABILITY FOR DAMAGES RESULTING FROM 11 | THE USE OF THIS DOCUMENT OR THE INFORMATION OR WORKS PROVIDED 12 | HEREUNDER. 13 | 14 | Statement of Purpose 15 | 16 | The laws of most jurisdictions throughout the world automatically confer 17 | exclusive Copyright and Related Rights (defined below) upon the creator 18 | and subsequent owner(s) (each and all, an "owner") of an original work of 19 | authorship and/or a database (each, a "Work"). 20 | 21 | Certain owners wish to permanently relinquish those rights to a Work for 22 | the purpose of contributing to a commons of creative, cultural and 23 | scientific works ("Commons") that the public can reliably and without fear 24 | of later claims of infringement build upon, modify, incorporate in other 25 | works, reuse and redistribute as freely as possible in any form whatsoever 26 | and for any purposes, including without limitation commercial purposes. 27 | These owners may contribute to the Commons to promote the ideal of a free 28 | culture and the further production of creative, cultural and scientific 29 | works, or to gain reputation or greater distribution for their Work in 30 | part through the use and efforts of others. 31 | 32 | For these and/or other purposes and motivations, and without any 33 | expectation of additional consideration or compensation, the person 34 | associating CC0 with a Work (the "Affirmer"), to the extent that he or she 35 | is an owner of Copyright and Related Rights in the Work, voluntarily 36 | elects to apply CC0 to the Work and publicly distribute the Work under its 37 | terms, with knowledge of his or her Copyright and Related Rights in the 38 | Work and the meaning and intended legal effect of CC0 on those rights. 39 | 40 | 1. Copyright and Related Rights. A Work made available under CC0 may be 41 | protected by copyright and related or neighboring rights ("Copyright and 42 | Related Rights"). Copyright and Related Rights include, but are not 43 | limited to, the following: 44 | 45 | i. the right to reproduce, adapt, distribute, perform, display, 46 | communicate, and translate a Work; 47 | ii. moral rights retained by the original author(s) and/or performer(s); 48 | iii. publicity and privacy rights pertaining to a person's image or 49 | likeness depicted in a Work; 50 | iv. rights protecting against unfair competition in regards to a Work, 51 | subject to the limitations in paragraph 4(a), below; 52 | v. rights protecting the extraction, dissemination, use and reuse of data 53 | in a Work; 54 | vi. database rights (such as those arising under Directive 96/9/EC of the 55 | European Parliament and of the Council of 11 March 1996 on the legal 56 | protection of databases, and under any national implementation 57 | thereof, including any amended or successor version of such 58 | directive); and 59 | vii. other similar, equivalent or corresponding rights throughout the 60 | world based on applicable law or treaty, and any national 61 | implementations thereof. 62 | 63 | 2. Waiver. To the greatest extent permitted by, but not in contravention 64 | of, applicable law, Affirmer hereby overtly, fully, permanently, 65 | irrevocably and unconditionally waives, abandons, and surrenders all of 66 | Affirmer's Copyright and Related Rights and associated claims and causes 67 | of action, whether now known or unknown (including existing as well as 68 | future claims and causes of action), in the Work (i) in all territories 69 | worldwide, (ii) for the maximum duration provided by applicable law or 70 | treaty (including future time extensions), (iii) in any current or future 71 | medium and for any number of copies, and (iv) for any purpose whatsoever, 72 | including without limitation commercial, advertising or promotional 73 | purposes (the "Waiver"). Affirmer makes the Waiver for the benefit of each 74 | member of the public at large and to the detriment of Affirmer's heirs and 75 | successors, fully intending that such Waiver shall not be subject to 76 | revocation, rescission, cancellation, termination, or any other legal or 77 | equitable action to disrupt the quiet enjoyment of the Work by the public 78 | as contemplated by Affirmer's express Statement of Purpose. 79 | 80 | 3. Public License Fallback. Should any part of the Waiver for any reason 81 | be judged legally invalid or ineffective under applicable law, then the 82 | Waiver shall be preserved to the maximum extent permitted taking into 83 | account Affirmer's express Statement of Purpose. In addition, to the 84 | extent the Waiver is so judged Affirmer hereby grants to each affected 85 | person a royalty-free, non transferable, non sublicensable, non exclusive, 86 | irrevocable and unconditional license to exercise Affirmer's Copyright and 87 | Related Rights in the Work (i) in all territories worldwide, (ii) for the 88 | maximum duration provided by applicable law or treaty (including future 89 | time extensions), (iii) in any current or future medium and for any number 90 | of copies, and (iv) for any purpose whatsoever, including without 91 | limitation commercial, advertising or promotional purposes (the 92 | "License"). The License shall be deemed effective as of the date CC0 was 93 | applied by Affirmer to the Work. Should any part of the License for any 94 | reason be judged legally invalid or ineffective under applicable law, such 95 | partial invalidity or ineffectiveness shall not invalidate the remainder 96 | of the License, and in such case Affirmer hereby affirms that he or she 97 | will not (i) exercise any of his or her remaining Copyright and Related 98 | Rights in the Work or (ii) assert any associated claims and causes of 99 | action with respect to the Work, in either case contrary to Affirmer's 100 | express Statement of Purpose. 101 | 102 | 4. Limitations and Disclaimers. 103 | 104 | a. No trademark or patent rights held by Affirmer are waived, abandoned, 105 | surrendered, licensed or otherwise affected by this document. 106 | b. Affirmer offers the Work as-is and makes no representations or 107 | warranties of any kind concerning the Work, express, implied, 108 | statutory or otherwise, including without limitation warranties of 109 | title, merchantability, fitness for a particular purpose, non 110 | infringement, or the absence of latent or other defects, accuracy, or 111 | the present or absence of errors, whether or not discoverable, all to 112 | the greatest extent permissible under applicable law. 113 | c. Affirmer disclaims responsibility for clearing rights of other persons 114 | that may apply to the Work or any use thereof, including without 115 | limitation any person's Copyright and Related Rights in the Work. 116 | Further, Affirmer disclaims responsibility for obtaining any necessary 117 | consents, permissions or other rights required for any use of the 118 | Work. 119 | d. Affirmer understands and acknowledges that Creative Commons is not a 120 | party to this document and has no duty or obligation with respect to 121 | this CC0 or use of the Work. 122 | --------------------------------------------------------------------------------