├── LICENSE └── README.md /LICENSE: -------------------------------------------------------------------------------- 1 | Creative Commons Legal Code 2 | 3 | CC0 1.0 Universal 4 | 5 | CREATIVE COMMONS CORPORATION IS NOT A LAW FIRM AND DOES NOT PROVIDE 6 | LEGAL SERVICES. DISTRIBUTION OF THIS DOCUMENT DOES NOT CREATE AN 7 | ATTORNEY-CLIENT RELATIONSHIP. CREATIVE COMMONS PROVIDES THIS 8 | INFORMATION ON AN "AS-IS" BASIS. CREATIVE COMMONS MAKES NO WARRANTIES 9 | REGARDING THE USE OF THIS DOCUMENT OR THE INFORMATION OR WORKS 10 | PROVIDED HEREUNDER, AND DISCLAIMS LIABILITY FOR DAMAGES RESULTING FROM 11 | THE USE OF THIS DOCUMENT OR THE INFORMATION OR WORKS PROVIDED 12 | HEREUNDER. 13 | 14 | Statement of Purpose 15 | 16 | The laws of most jurisdictions throughout the world automatically confer 17 | exclusive Copyright and Related Rights (defined below) upon the creator 18 | and subsequent owner(s) (each and all, an "owner") of an original work of 19 | authorship and/or a database (each, a "Work"). 20 | 21 | Certain owners wish to permanently relinquish those rights to a Work for 22 | the purpose of contributing to a commons of creative, cultural and 23 | scientific works ("Commons") that the public can reliably and without fear 24 | of later claims of infringement build upon, modify, incorporate in other 25 | works, reuse and redistribute as freely as possible in any form whatsoever 26 | and for any purposes, including without limitation commercial purposes. 27 | These owners may contribute to the Commons to promote the ideal of a free 28 | culture and the further production of creative, cultural and scientific 29 | works, or to gain reputation or greater distribution for their Work in 30 | part through the use and efforts of others. 31 | 32 | For these and/or other purposes and motivations, and without any 33 | expectation of additional consideration or compensation, the person 34 | associating CC0 with a Work (the "Affirmer"), to the extent that he or she 35 | is an owner of Copyright and Related Rights in the Work, voluntarily 36 | elects to apply CC0 to the Work and publicly distribute the Work under its 37 | terms, with knowledge of his or her Copyright and Related Rights in the 38 | Work and the meaning and intended legal effect of CC0 on those rights. 39 | 40 | 1. Copyright and Related Rights. A Work made available under CC0 may be 41 | protected by copyright and related or neighboring rights ("Copyright and 42 | Related Rights"). Copyright and Related Rights include, but are not 43 | limited to, the following: 44 | 45 | i. the right to reproduce, adapt, distribute, perform, display, 46 | communicate, and translate a Work; 47 | ii. moral rights retained by the original author(s) and/or performer(s); 48 | iii. publicity and privacy rights pertaining to a person's image or 49 | likeness depicted in a Work; 50 | iv. rights protecting against unfair competition in regards to a Work, 51 | subject to the limitations in paragraph 4(a), below; 52 | v. rights protecting the extraction, dissemination, use and reuse of data 53 | in a Work; 54 | vi. database rights (such as those arising under Directive 96/9/EC of the 55 | European Parliament and of the Council of 11 March 1996 on the legal 56 | protection of databases, and under any national implementation 57 | thereof, including any amended or successor version of such 58 | directive); and 59 | vii. other similar, equivalent or corresponding rights throughout the 60 | world based on applicable law or treaty, and any national 61 | implementations thereof. 62 | 63 | 2. Waiver. To the greatest extent permitted by, but not in contravention 64 | of, applicable law, Affirmer hereby overtly, fully, permanently, 65 | irrevocably and unconditionally waives, abandons, and surrenders all of 66 | Affirmer's Copyright and Related Rights and associated claims and causes 67 | of action, whether now known or unknown (including existing as well as 68 | future claims and causes of action), in the Work (i) in all territories 69 | worldwide, (ii) for the maximum duration provided by applicable law or 70 | treaty (including future time extensions), (iii) in any current or future 71 | medium and for any number of copies, and (iv) for any purpose whatsoever, 72 | including without limitation commercial, advertising or promotional 73 | purposes (the "Waiver"). Affirmer makes the Waiver for the benefit of each 74 | member of the public at large and to the detriment of Affirmer's heirs and 75 | successors, fully intending that such Waiver shall not be subject to 76 | revocation, rescission, cancellation, termination, or any other legal or 77 | equitable action to disrupt the quiet enjoyment of the Work by the public 78 | as contemplated by Affirmer's express Statement of Purpose. 79 | 80 | 3. Public License Fallback. Should any part of the Waiver for any reason 81 | be judged legally invalid or ineffective under applicable law, then the 82 | Waiver shall be preserved to the maximum extent permitted taking into 83 | account Affirmer's express Statement of Purpose. In addition, to the 84 | extent the Waiver is so judged Affirmer hereby grants to each affected 85 | person a royalty-free, non transferable, non sublicensable, non exclusive, 86 | irrevocable and unconditional license to exercise Affirmer's Copyright and 87 | Related Rights in the Work (i) in all territories worldwide, (ii) for the 88 | maximum duration provided by applicable law or treaty (including future 89 | time extensions), (iii) in any current or future medium and for any number 90 | of copies, and (iv) for any purpose whatsoever, including without 91 | limitation commercial, advertising or promotional purposes (the 92 | "License"). The License shall be deemed effective as of the date CC0 was 93 | applied by Affirmer to the Work. Should any part of the License for any 94 | reason be judged legally invalid or ineffective under applicable law, such 95 | partial invalidity or ineffectiveness shall not invalidate the remainder 96 | of the License, and in such case Affirmer hereby affirms that he or she 97 | will not (i) exercise any of his or her remaining Copyright and Related 98 | Rights in the Work or (ii) assert any associated claims and causes of 99 | action with respect to the Work, in either case contrary to Affirmer's 100 | express Statement of Purpose. 101 | 102 | 4. Limitations and Disclaimers. 103 | 104 | a. No trademark or patent rights held by Affirmer are waived, abandoned, 105 | surrendered, licensed or otherwise affected by this document. 106 | b. Affirmer offers the Work as-is and makes no representations or 107 | warranties of any kind concerning the Work, express, implied, 108 | statutory or otherwise, including without limitation warranties of 109 | title, merchantability, fitness for a particular purpose, non 110 | infringement, or the absence of latent or other defects, accuracy, or 111 | the present or absence of errors, whether or not discoverable, all to 112 | the greatest extent permissible under applicable law. 113 | c. Affirmer disclaims responsibility for clearing rights of other persons 114 | that may apply to the Work or any use thereof, including without 115 | limitation any person's Copyright and Related Rights in the Work. 116 | Further, Affirmer disclaims responsibility for obtaining any necessary 117 | consents, permissions or other rights required for any use of the 118 | Work. 119 | d. Affirmer understands and acknowledges that Creative Commons is not a 120 | party to this document and has no duty or obligation with respect to 121 | this CC0 or use of the Work. 122 | -------------------------------------------------------------------------------- /README.md: -------------------------------------------------------------------------------- 1 | # Deep-Learning-Applications-with-GUI 2 | 3 | List of FOSS Deep Learning (Desktop) Applications with GUI and/or CLI. Most Deep Learning models require a complicated setup and are not ready to use out of the box. At best there is a Colab or Jupiter notebook available, which is good for testing, but not for production use. 4 | 5 | The software in this list is ready to use (CLI or GUI). 6 | 7 | :camera_flash: NeRF 📃 Text 🎨 Image :notes: Audio 🎞️ Video frame interpolation 🖼️🎞️ Video upscaler 🖼️ Image/Video upscaling 🧩 Ready to use Plugins 8 | 9 | # :camera_flash: 3D / NeRF: Neural Radiance Fields 10 | 11 | https://github.com/NVlabs/instant-ngp#interactive-training-and-rendering 12 | 13 | # General 14 | https://github.com/pinokiocomputer/pinokio 15 | 16 | # 📃 Text 17 | 18 | ## Jan 19 | 20 | Jan is an open-source ChatGPT alternative that runs 100% offline on your computer. 21 | 22 | https://github.com/janhq/jan 23 | 24 | ## text-generation-webui 25 | 26 | A gradio web UI for running Large Language Models like LLaMA, llama.cpp, GPT-J, Pythia, OPT, and GALACTICA. 27 | 28 | Its goal is to become the AUTOMATIC1111/stable-diffusion-webui of text generation. 29 | 30 | https://github.com/oobabooga/text-generation-webui 31 | 32 | ## chatai 33 | 34 | https://github.com/Capsize-Games/chatai 35 | 36 | ## Audio to text 37 | https://github.com/Const-me/Whisper 38 | 39 | https://github.com/kaixxx/noScribe 40 | 41 | https://github.com/CheshireCC/fatser-whisper-GUI (highly recommended) 42 | 43 | https://github.com/Purfview/whisper-standalone-win (ready to use .exe for cli use) 44 | 45 | https://github.com/FL33TW00D/whisper-turbo (wip) 46 | 47 | ## subtitle 48 | https://github.com/URUWorks/TeroSubtitler (whisper support) 49 | 50 | ## video-subtitle-generator 51 | https://github.com/YaoFANGUK/video-subtitle-generator 52 | 53 | ## Translation 54 | https://github.com/argosopentech/argos-translate#installation 55 | https://github.com/ymoslem/DesktopTranslator 56 | 57 | ## Chat-Style LLM (Alpaca/Llama) (Chat GTP Clone) 58 | https://github.com/antimatter15/alpaca.cpp 59 | https://github.com/tatsu-lab/stanford_alpaca 60 | https://github.com/tloen/alpaca-lora 61 | 62 | ## dalai 63 | 64 | Run LLaMA and Alpaca on your computer. 65 | 66 | https://github.com/cocktailpeanut/dalai 67 | 68 | ## GTP4All 69 | 70 | Installs a native chat-client with auto-update functionality that runs on your desktop with the GPT4All-J model baked into it. 71 | 72 | https://github.com/nomic-ai/gpt4all 73 | 74 | ## Gpt4All Web UI 75 | 76 | This is a Flask web application that provides a chat UI for interacting with llamacpp based chatbots such as GPT4all, vicuna etc... 77 | 78 | https://github.com/nomic-ai/gpt4all-ui 79 | 80 | ## AutoGPT GUI 81 | 82 | A graphical user interface to AutoGPT 83 | 84 | https://github.com/thecookingsenpai/autogpt-gui 85 | 86 | # 🎨 Image 87 | 88 | ## Fooocus 89 | 90 | Fooocus is an image generating software. 91 | 92 | https://github.com/lllyasviel/Fooocus 93 | 94 | ## Style2Paints 95 | 96 | Style2paints V4 is the current best AI driven lineart **colorization tool**. 97 | 98 | https://github.com/lllyasviel/style2paints 99 | 100 | Source code not available for the latest release. 101 | 102 | JavaScript, Python, Tensorflow 103 | 104 | ## StableStudio 105 | 106 | StableStudio, the open-source version of DreamStudio! 107 | 108 | https://github.com/Stability-AI/StableStudio 109 | 110 | ## NMKD Stable Diffusion GUI - AI Image Generator 111 | 112 | https://github.com/n00mkrad/text2image-gui 113 | 114 | https://nmkd.itch.io/t2i-gui 115 | 116 | ## stable-diffusion-webui-forge 117 | 118 | https://github.com/lllyasviel/stable-diffusion-webui-forge 119 | 120 | ## stable-diffusion-ui 121 | 122 | Easiest 1-click way to install and use Stable Diffusion on your own computer. Provides a browser UI for generating images from text prompts and images. Just enter your text prompt, and see the generated image 123 | 124 | https://github.com/cmdr2/stable-diffusion-ui 125 | 126 | ## MochiDiffusion 127 | 128 | This app uses Apple's Core ML Stable Diffusion implementation to achieve maximum performance and speed on Apple Silicon based Macs while reducing memory requirements. 129 | 130 | https://github.com/godly-devotion/MochiDiffusion 131 | 132 | ## ComfyUI 133 | 134 | This ui will let you design and execute advanced stable diffusion pipelines using a graph/nodes/flowchart based interface. For some workflow examples and see what ComfyUI can do you can check out: 135 | 136 | https://github.com/comfyanonymous/ComfyUI 137 | 138 | ## Stable Diffusion on AMD GPU 139 | 140 | https://github.com/azuritecoin/OnnxDiffusersUI (deprecated) 141 | https://github.com/lshqqytiger/stable-diffusion-webui-directml 142 | 143 | ## Diffusion Bee - Stable Diffusion GUI App for MacOS 144 | 145 | https://github.com/divamgupta/diffusionbee-stable-diffusion-ui 146 | 147 | ## WebUI 148 | 149 | ready to use colab based notebooks with web ui 150 | 151 | https://github.com/camenduru/stable-diffusion-webui-colab 152 | 153 | ## InvokeAI 154 | 155 | InvokeAI is a leading creative engine for Stable Diffusion models, empowering professionals, artists, and enthusiasts to generate and create visual media using the latest AI-driven technologies. The solution offers an industry leading WebUI, supports terminal use through a CLI, and serves as the foundation for multiple commercial products. 156 | 157 | https://github.com/invoke-ai/InvokeAI 158 | 159 | ## airunner 160 | 161 | https://github.com/Capsize-Games/airunner $ 162 | 163 | 164 | ## Lama Cleaner 165 | 166 | A free and open-source inpainting tool powered by SOTA AI model. 167 | 168 | https://github.com/Sanster/lama-cleaner 169 | 170 | https://panicbyte.itch.io/lama-cleaner 171 | 172 | ## carefree-creator 173 | 174 | AI magics meet Infinite draw board. 175 | 176 | https://github.com/carefree0910/carefree-creator 177 | 178 | ## photoshot 179 | 180 | An open-source AI avatar generator web app 181 | 182 | https://github.com/shinework/photoshot 183 | 184 | ## DeOldify.NET 185 | 186 | DeOldify.NET: cross-platform application for coloring black and white photos" 187 | 188 | https://github.com/ColorfulSoft/DeOldify.NET 189 | 190 | ## Queryable 191 | 192 | Offline photo search App for iOS 193 | 194 | https://github.com/mazzzystar/Queryable 195 | 196 | ## X-AnyLabeling 197 | 198 | Effortless data labeling with AI support from Segment Anything and other awesome models. 199 | 200 | https://github.com/CVHub520/X-AnyLabeling 201 | 202 | # :notes: Audio 203 | 204 | ## clone-voice 205 | 206 | A sound cloning tool with a web interface, using your voice or any sound to record audio 207 | 208 | https://github.com/jianchang512/clone-voice 209 | 210 | ## tacotron2-tts-GUI 211 | 212 | GUI wrapper for synthesize. Allows CPU-only synthesis via a toggleable switch. Portable exe file is available (that runs on CPU only). 213 | 214 | https://github.com/lokkelvin2/tacotron2-tts-GUI 215 | 216 | ## riffusion 217 | 218 | Riffusion is an app for real-time music generation with stable diffusion. 219 | 220 | https://github.com/riffusion/riffusion-app 221 | 222 | ## whisper-ui 223 | 224 | Whisper is a general-purpose speech recognition model. It is trained on a large dataset of diverse audio and is also a multitasking model that can perform multilingual speech recognition, speech translation, and language identification. 225 | 226 | This is a simple Streamlit UI for OpenAI's Whisper speech-to-text model. It let's you download and transcribe media from YouTube videos, playlists, or local files. You can then browse, filter, and search through your saved audio files. 227 | 228 | https://github.com/hayabhay/whisper-ui for https://github.com/openai/whisper 229 | 230 | https://grisk.itch.io/whisper-gui 231 | 232 | ## buzz 233 | 234 | GUI for whisper 235 | 236 | https://github.com/chidiwilliams/buzz 237 | 238 | ## Ultimate Vocal Remover GUI v5.5.1 239 | 240 | This application uses state-of-the-art source separation models to remove vocals from audio files. UVR's core developers trained all of the models provided in this package (except for the Demucs v3 and v4 4-stem models). 241 | 242 | https://github.com/Anjok07/ultimatevocalremovergui 243 | 244 | ## SpleeterGUI 245 | 246 | SpleeterGUI - Music source separation desktop app 247 | 248 | Windows Desktop Front end for Spleeter - AI source separation 249 | 250 | https://github.com/boy1dr/SpleeterGui 251 | 252 | ## SoftVC VITS Singing Voice Conversion Fork 253 | 254 | https://github.com/34j/so-vits-svc-fork 255 | 256 | ## subsync subtitle alignment 257 | 258 | This is an automatic movie subtitle synchronization tool. Synchronization is done by listening to the audio track, translating it if necessary. 259 | 260 | https://github.com/sc0ty/subsync/ 261 | 262 | ## Handy (Speech to text) 263 | 264 | A free, open source, and extensible speech-to-text application that works completely offline. 265 | 266 | https://github.com/cjpais/Handy 267 | 268 | # 🎞️ Video frame interpolation 269 | 270 | ## Video Dubbing 271 | 272 | https://github.com/jianchang512/pyvideotrans 273 | 274 | ## DAIN-App 275 | 276 | The backbone of this project is a algorithm called DAIN. (https://github.com/baowenbo/DAIN) 277 | 278 | https://github.com/BurguerJohn/Dain-App 279 | 280 | https://grisk.itch.io/dain-app?download 281 | 282 | ## DAIN Vulkan GUI 283 | 284 | AI-Powered video interpolater (eg. 30fps -> 60fps) for Vulkan devices. Based on dain-ncnn-vulkan and ffmpeg 285 | 286 | https://github.com/Mar2ck/DAIN-Vulkan-GUI 287 | 288 | ## dain-ncnn-vulkan 289 | 290 | ncnn implementation of DAIN, Depth-Aware Video Frame Interpolation. 291 | 292 | https://github.com/nihui/dain-ncnn-vulkan 293 | 294 | ## Dain-App 295 | 296 | https://github.com/BurguerJohn/Dain-App 297 | 298 | ## cain-ncnn-vulkan 299 | 300 | Video Frame Interpolation 301 | 302 | https://github.com/nihui/cain-ncnn-vulkan 303 | 304 | ## Flowframes 305 | 306 | Flowframes is a simple but powerful app that utilizes AI frameworks to interpolate videos in order to increase their framerate with little to no noticable quality loss. (DAIN, CAIN, RIFE) 307 | 308 | https://github.com/n00mkrad/flowframes 309 | 310 | https://nmkd.itch.io/flowframes 311 | 312 | # 🖼️🎞️ Video upscaler 313 | 314 | ## video2x 315 | 316 | A lossless video/GIF/image upscaler achieved with waifu2x, Anime4K, SRMD and RealSR. Started in Hack the Valley 2, 2018. 317 | https://github.com/k4yt3x/video2x 318 | 319 | ## https://github.com/Communist02/AnimePlayer 320 | 321 | 322 | ## 🎞️ Video Deep Face 323 | 324 | https://github.com/iperov/DeepFaceLive 325 | 326 | # 🖼️ Image/Video upscaling 327 | 328 | ## Final2x 329 | 330 | This is a powerful tool that allows for image super-resolution to arbitrary sizes using multiple models, designed to enhance the resolution and quality of images, making them clearer and more detailed. Presently, it supports several models, such as RealCUGAN, RealESRGAN, Waifu2x, and SRMD. 331 | 332 | https://github.com/Tohrusky/Final2x 333 | 334 | ## Upscayl 335 | Upscayl is a cross-platform application built with the Linux-first philosophy. This means that we prioritize Linux builds over others but that doesn't mean we'll break things for other OSes :) Makes use of Real-ESRGAN 336 | 337 | https://github.com/upscayl/upscayl 338 | 339 | ## Anime4kSharp 340 | 341 | https://github.com/shadow578/Anime4kSharp 342 | 343 | ## dandere2x-tremx 344 | 345 | video compression 346 | https://github.com/Tremeschin/dandere2x-tremx 347 | 348 | ## Cupscale 349 | 350 | Image Upscaling GUI based on ESRGAN - WORK IN PROGRESS 351 | 352 | https://github.com/n00mkrad/cupscale 353 | 354 | ## IEU.Winforms 355 | 356 | Windows only GUI for ESRGAN with additional features 357 | https://github.com/ptrsuder/IEU.Winforms 358 | 359 | ## NVIDIA-Ansel-AI-Enhancer 360 | 361 | NVIDIA Upscaler, utilizes NVIDIA´s Ansel RTX. Requires latest drivers and a RTX GPU. Free, but not open source. 362 | 363 | https://github.com/dynamiquel/NVIDIA-Ansel-AI-Enhancer 364 | 365 | ## RealSR 366 | 367 | Real-World Super-Resolution via Kernel Estimation and Noise Injection 368 | 369 | https://github.com/jixiaozhong/RealSR 370 | 371 | (https://github.com/nihui/RealSR) 372 | 373 | ## realsr-ncnn-vulkan 374 | 375 | https://github.com/nihui/realsr-ncnn-vulkan 376 | 377 | ## Shell upscaler 378 | 379 | https://github.com/n00mkrad/shell-upscaler 380 | 381 | Acts as a GUI for esrgan-launcher. 382 | 383 | ## srmd-ncnn-vulkan 384 | 385 | https://github.com/nihui/srmd-ncnn-vulkan 386 | 387 | ## VapourSynth-Super-Resolution-Helper 388 | 389 | https://github.com/AlphaAtlas/VapourSynth-Super-Resolution-Helper 390 | 391 | ## Waifu2x-Extension-GUI 392 | 393 | Photo/Video/GIF enlargement using machine learning 394 | 395 | https://github.com/AaronFeng753/Waifu2x-Extension-GUI 396 | 397 | C++, python, qt, caffe, ncnn_vulkan 398 | 399 | AGPL-3.0 License 400 | 401 | ## waifu2x-ncnn-vulkan 402 | 403 | https://github.com/nihui/waifu2x-ncnn-vulkan 404 | 405 | # GAN / Style Transfer 406 | 407 | ## TorchRayLib 408 | 409 | TorchRayLib++: A CMake based AI & generative art platform, integrating the rayib GUI and the PyTorch C++ Deep Learning Library. 410 | 411 | https://github.com/QuantScientist/TorchRayLib 412 | 413 | ## more 414 | https://github.com/Seprentes/ScaleUp 415 | https://github.com/HolyWu/vs-realesrgan 416 | https://github.com/tumuyan/RealSR-NCNN-Android 417 | https://github.com/TransparentLC/realesrgan-gui/blob/master/README.en-US.md 418 | https://github.com/Lucchetto/SuperImage 419 | https://github.com/Tohrusky/Final2x 420 | 421 | # :toolbox: DL Training GUI 422 | 423 | ## AIDeveloper 424 | 425 | GUI-based software for training, evaluating and applying deep neural nets for image classification 426 | 427 | https://github.com/maikherbig/AIDeveloper 428 | 429 | # 🧩 Ready to use Plugins 430 | 431 | ## vstSpleeter 432 | 433 | audio source separation library vst plugin 434 | 435 | https://github.com/gvne/vstSpleeter 436 | 437 | 438 | --- 439 | 440 | 441 | ## various apps (untested) 442 | 443 | Paper2GUI is an AI desktop Application toolbox for everyone no matter your background with it being free and ready to use out of the box, already supporting 10+ AI models, covering the fields of speech synthesis, video frame filling, video overscoring, target detection, image stylization, OCR recognition, etc. Supporting Windows, Mac OS, Linux systems. 444 | 445 | https://github.com/Baiyuetribe/paper2gui/blob/main/README_en.md 446 | 447 | Free: https://www.nvidia.com/en-us/studio/canvas/ 448 | --------------------------------------------------------------------------------