├── LICENSE ├── README.md ├── Screenshot1_2023-07-18 14_58_00-Transcript Cleaner.png ├── Screenshot2_2023-07-18 14_58_06-Transcript Cleaner.png └── transcript_cleaner.html /LICENSE: -------------------------------------------------------------------------------- 1 | MIT License 2 | 3 | Copyright (c) 2023 Jeff Emanuel 4 | 5 | Permission is hereby granted, free of charge, to any person obtaining a copy 6 | of this software and associated documentation files (the "Software"), to deal 7 | in the Software without restriction, including without limitation the rights 8 | to use, copy, modify, merge, publish, distribute, sublicense, and/or sell 9 | copies of the Software, and to permit persons to whom the Software is 10 | furnished to do so, subject to the following conditions: 11 | 12 | The above copyright notice and this permission notice shall be included in all 13 | copies or substantial portions of the Software. 14 | 15 | THE SOFTWARE IS PROVIDED "AS IS", WITHOUT WARRANTY OF ANY KIND, EXPRESS OR 16 | IMPLIED, INCLUDING BUT NOT LIMITED TO THE WARRANTIES OF MERCHANTABILITY, 17 | FITNESS FOR A PARTICULAR PURPOSE AND NONINFRINGEMENT. IN NO EVENT SHALL THE 18 | AUTHORS OR COPYRIGHT HOLDERS BE LIABLE FOR ANY CLAIM, DAMAGES OR OTHER 19 | LIABILITY, WHETHER IN AN ACTION OF CONTRACT, TORT OR OTHERWISE, ARISING FROM, 20 | OUT OF OR IN CONNECTION WITH THE SOFTWARE OR THE USE OR OTHER DEALINGS IN THE 21 | SOFTWARE. 22 | -------------------------------------------------------------------------------- /README.md: -------------------------------------------------------------------------------- 1 | # YouTube Transcript Cleaner 2 | 3 | YouTube Transcript Cleaner is a simple web-based application that improves the readability of YouTube transcripts. It's designed to work best with transcripts obtained using the [Youtube Transcript Extractor Chrome Extension](https://chrome.google.com/webstore/detail/youtube-transcript-extrac/lclpibfglbkghjkdmpjkgehcnadcffdl). 4 | 5 | ## Features 6 | 7 | - Cleans transcripts by merging relevant lines and splitting text into sentences. 8 | - Provides a reader mode with adjustable settings such as font size, content width, and font selection. 9 | - Supports file upload and download for transcript cleaning. 10 | 11 | ## Screenshots 12 | 13 |  14 | 15 |  16 | 17 | ## How to Use 18 | 19 | 1. Download the HTML file from this repository to your local machine. 20 | 2. Open the downloaded HTML file in your web browser. 21 | 3. Use the [Youtube Transcript Extractor Chrome Extension](https://chrome.google.com/webstore/detail/youtube-transcript-extrac/lclpibfglbkghjkdmpjkgehcnadcffdl) to extract a transcript from a YouTube video. 22 | 4. Paste the transcript into the 'Original Transcript' text area or upload a .txt file containing the transcript. 23 | 5. Click the 'Clean Transcript' button to process the transcript. 24 | 6. The cleaned transcript will appear in the 'Cleaned Transcript' text area. 25 | 7. You can download the cleaned transcript as a .txt file or view it in reader mode. Reader mode provides options to customize the viewing experience, including adjusting the font size and width, and switching between different fonts. 26 | 27 | 28 | ## How I made it Quickly and Easily: 29 | 30 | [Transcript of Shared ChatGPT Conversation](https://chat.openai.com/share/ccefa576-b0af-49bf-8981-72bf78b45d69) 31 | 32 | --- 33 | 34 | Thanks for your interest in my open-source project! I hope you find it useful. You might also find my commercial web apps useful, and I would really appreciate it if you checked them out: 35 | 36 | **[YoutubeTranscriptOptimizer.com](https://youtubetranscriptoptimizer.com)** makes it really quick and easy to paste in a YouTube video URL and have it automatically generate not just a really accurate direct transcription, but also a super polished and beautifully formatted written document that can be used independently of the video. 37 | 38 | The document basically sticks to the same material as discussed in the video, but it sounds much more like a real piece of writing and not just a transcript. It also lets you optionally generate quizzes based on the contents of the document, which can be either multiple choice or short-answer quizzes, and the multiple choice quizzes get turned into interactive HTML files that can be hosted and easily shared, where you can actually take the quiz and it will grade your answers and score the quiz for you. 39 | 40 | **[FixMyDocuments.com](https://fixmydocuments.com/)** lets you submit any kind of document— PDFs (including scanned PDFs that require OCR), MS Word and Powerpoint files, images, audio files (mp3, m4a, etc.) —and turn them into highly optimized versions in nice markdown formatting, from which HTML and PDF versions are automatically generated. Once converted, you can also edit them directly in the site using the built-in markdown editor, where it saves a running revision history and regenerates the PDF/HTML versions. 41 | 42 | In addition to just getting the optimized version of the document, you can also generate many other kinds of "derived documents" from the original: interactive multiple-choice quizzes that you can actually take and get graded on; slick looking presentation slides as PDF or HTML (using LaTeX and Reveal.js), an in-depth summary, a concept mind map (using Mermaid diagrams) and outline, custom lesson plans where you can select your target audience, a readability analysis and grade-level versions of your original document (good for simplifying concepts for students), Anki Flashcards that you can import directly into the Anki app or use on the site in a nice interface, and more. 43 | -------------------------------------------------------------------------------- /Screenshot1_2023-07-18 14_58_00-Transcript Cleaner.png: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/Dicklesworthstone/youtube_transcript_cleaner/ab0028ae0231e66019870fdb35b9ac1e2ed3d164/Screenshot1_2023-07-18 14_58_00-Transcript Cleaner.png -------------------------------------------------------------------------------- /Screenshot2_2023-07-18 14_58_06-Transcript Cleaner.png: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/Dicklesworthstone/youtube_transcript_cleaner/ab0028ae0231e66019870fdb35b9ac1e2ed3d164/Screenshot2_2023-07-18 14_58_06-Transcript Cleaner.png -------------------------------------------------------------------------------- /transcript_cleaner.html: -------------------------------------------------------------------------------- 1 | 2 | 3 |
4 |