├── LICENSE
└── README.md


/LICENSE:
--------------------------------------------------------------------------------
  1 | Creative Commons Legal Code
  2 | 
  3 | CC0 1.0 Universal
  4 | 
  5 |     CREATIVE COMMONS CORPORATION IS NOT A LAW FIRM AND DOES NOT PROVIDE
  6 |     LEGAL SERVICES. DISTRIBUTION OF THIS DOCUMENT DOES NOT CREATE AN
  7 |     ATTORNEY-CLIENT RELATIONSHIP. CREATIVE COMMONS PROVIDES THIS
  8 |     INFORMATION ON AN "AS-IS" BASIS. CREATIVE COMMONS MAKES NO WARRANTIES
  9 |     REGARDING THE USE OF THIS DOCUMENT OR THE INFORMATION OR WORKS
 10 |     PROVIDED HEREUNDER, AND DISCLAIMS LIABILITY FOR DAMAGES RESULTING FROM
 11 |     THE USE OF THIS DOCUMENT OR THE INFORMATION OR WORKS PROVIDED
 12 |     HEREUNDER.
 13 | 
 14 | Statement of Purpose
 15 | 
 16 | The laws of most jurisdictions throughout the world automatically confer
 17 | exclusive Copyright and Related Rights (defined below) upon the creator
 18 | and subsequent owner(s) (each and all, an "owner") of an original work of
 19 | authorship and/or a database (each, a "Work").
 20 | 
 21 | Certain owners wish to permanently relinquish those rights to a Work for
 22 | the purpose of contributing to a commons of creative, cultural and
 23 | scientific works ("Commons") that the public can reliably and without fear
 24 | of later claims of infringement build upon, modify, incorporate in other
 25 | works, reuse and redistribute as freely as possible in any form whatsoever
 26 | and for any purposes, including without limitation commercial purposes.
 27 | These owners may contribute to the Commons to promote the ideal of a free
 28 | culture and the further production of creative, cultural and scientific
 29 | works, or to gain reputation or greater distribution for their Work in
 30 | part through the use and efforts of others.
 31 | 
 32 | For these and/or other purposes and motivations, and without any
 33 | expectation of additional consideration or compensation, the person
 34 | associating CC0 with a Work (the "Affirmer"), to the extent that he or she
 35 | is an owner of Copyright and Related Rights in the Work, voluntarily
 36 | elects to apply CC0 to the Work and publicly distribute the Work under its
 37 | terms, with knowledge of his or her Copyright and Related Rights in the
 38 | Work and the meaning and intended legal effect of CC0 on those rights.
 39 | 
 40 | 1. Copyright and Related Rights. A Work made available under CC0 may be
 41 | protected by copyright and related or neighboring rights ("Copyright and
 42 | Related Rights"). Copyright and Related Rights include, but are not
 43 | limited to, the following:
 44 | 
 45 |   i. the right to reproduce, adapt, distribute, perform, display,
 46 |      communicate, and translate a Work;
 47 |  ii. moral rights retained by the original author(s) and/or performer(s);
 48 | iii. publicity and privacy rights pertaining to a person's image or
 49 |      likeness depicted in a Work;
 50 |  iv. rights protecting against unfair competition in regards to a Work,
 51 |      subject to the limitations in paragraph 4(a), below;
 52 |   v. rights protecting the extraction, dissemination, use and reuse of data
 53 |      in a Work;
 54 |  vi. database rights (such as those arising under Directive 96/9/EC of the
 55 |      European Parliament and of the Council of 11 March 1996 on the legal
 56 |      protection of databases, and under any national implementation
 57 |      thereof, including any amended or successor version of such
 58 |      directive); and
 59 | vii. other similar, equivalent or corresponding rights throughout the
 60 |      world based on applicable law or treaty, and any national
 61 |      implementations thereof.
 62 | 
 63 | 2. Waiver. To the greatest extent permitted by, but not in contravention
 64 | of, applicable law, Affirmer hereby overtly, fully, permanently,
 65 | irrevocably and unconditionally waives, abandons, and surrenders all of
 66 | Affirmer's Copyright and Related Rights and associated claims and causes
 67 | of action, whether now known or unknown (including existing as well as
 68 | future claims and causes of action), in the Work (i) in all territories
 69 | worldwide, (ii) for the maximum duration provided by applicable law or
 70 | treaty (including future time extensions), (iii) in any current or future
 71 | medium and for any number of copies, and (iv) for any purpose whatsoever,
 72 | including without limitation commercial, advertising or promotional
 73 | purposes (the "Waiver"). Affirmer makes the Waiver for the benefit of each
 74 | member of the public at large and to the detriment of Affirmer's heirs and
 75 | successors, fully intending that such Waiver shall not be subject to
 76 | revocation, rescission, cancellation, termination, or any other legal or
 77 | equitable action to disrupt the quiet enjoyment of the Work by the public
 78 | as contemplated by Affirmer's express Statement of Purpose.
 79 | 
 80 | 3. Public License Fallback. Should any part of the Waiver for any reason
 81 | be judged legally invalid or ineffective under applicable law, then the
 82 | Waiver shall be preserved to the maximum extent permitted taking into
 83 | account Affirmer's express Statement of Purpose. In addition, to the
 84 | extent the Waiver is so judged Affirmer hereby grants to each affected
 85 | person a royalty-free, non transferable, non sublicensable, non exclusive,
 86 | irrevocable and unconditional license to exercise Affirmer's Copyright and
 87 | Related Rights in the Work (i) in all territories worldwide, (ii) for the
 88 | maximum duration provided by applicable law or treaty (including future
 89 | time extensions), (iii) in any current or future medium and for any number
 90 | of copies, and (iv) for any purpose whatsoever, including without
 91 | limitation commercial, advertising or promotional purposes (the
 92 | "License"). The License shall be deemed effective as of the date CC0 was
 93 | applied by Affirmer to the Work. Should any part of the License for any
 94 | reason be judged legally invalid or ineffective under applicable law, such
 95 | partial invalidity or ineffectiveness shall not invalidate the remainder
 96 | of the License, and in such case Affirmer hereby affirms that he or she
 97 | will not (i) exercise any of his or her remaining Copyright and Related
 98 | Rights in the Work or (ii) assert any associated claims and causes of
 99 | action with respect to the Work, in either case contrary to Affirmer's
100 | express Statement of Purpose.
101 | 
102 | 4. Limitations and Disclaimers.
103 | 
104 |  a. No trademark or patent rights held by Affirmer are waived, abandoned,
105 |     surrendered, licensed or otherwise affected by this document.
106 |  b. Affirmer offers the Work as-is and makes no representations or
107 |     warranties of any kind concerning the Work, express, implied,
108 |     statutory or otherwise, including without limitation warranties of
109 |     title, merchantability, fitness for a particular purpose, non
110 |     infringement, or the absence of latent or other defects, accuracy, or
111 |     the present or absence of errors, whether or not discoverable, all to
112 |     the greatest extent permissible under applicable law.
113 |  c. Affirmer disclaims responsibility for clearing rights of other persons
114 |     that may apply to the Work or any use thereof, including without
115 |     limitation any person's Copyright and Related Rights in the Work.
116 |     Further, Affirmer disclaims responsibility for obtaining any necessary
117 |     consents, permissions or other rights required for any use of the
118 |     Work.
119 |  d. Affirmer understands and acknowledges that Creative Commons is not a
120 |     party to this document and has no duty or obligation with respect to
121 |     this CC0 or use of the Work.
122 | 


--------------------------------------------------------------------------------
/README.md:
--------------------------------------------------------------------------------
  1 | # Deep-Learning-Applications-with-GUI
  2 | 
  3 | List of FOSS Deep Learning (Desktop) Applications with GUI and/or CLI. Most Deep Learning models require a complicated setup and are not ready to use out of the box. At best there is a Colab or Jupiter notebook available, which is good for testing, but not for production use.
  4 | 
  5 | The software in this list is ready to use (CLI or GUI).
  6 | 
  7 | :camera_flash: NeRF 📃 Text 🎨 Image :notes: Audio 🎞️ Video frame interpolation 🖼️🎞️ Video upscaler 🖼️ Image/Video upscaling 🧩 Ready to use Plugins
  8 | 
  9 | # :camera_flash: 3D / NeRF: Neural Radiance Fields
 10 | 
 11 | https://github.com/NVlabs/instant-ngp#interactive-training-and-rendering
 12 | 
 13 | # General
 14 | https://github.com/pinokiocomputer/pinokio
 15 | 
 16 | # 📃 Text
 17 | 
 18 | ## Jan
 19 | 
 20 | Jan is an open-source ChatGPT alternative that runs 100% offline on your computer.
 21 | 
 22 | https://github.com/janhq/jan
 23 | 
 24 | ## text-generation-webui
 25 | 
 26 | A gradio web UI for running Large Language Models like LLaMA, llama.cpp, GPT-J, Pythia, OPT, and GALACTICA.
 27 | 
 28 | Its goal is to become the AUTOMATIC1111/stable-diffusion-webui of text generation.
 29 | 
 30 | https://github.com/oobabooga/text-generation-webui
 31 | 
 32 | ## chatai
 33 | 
 34 | https://github.com/Capsize-Games/chatai
 35 | 
 36 | ## Audio to text
 37 | https://github.com/Const-me/Whisper
 38 | 
 39 | https://github.com/kaixxx/noScribe
 40 | 
 41 | https://github.com/CheshireCC/fatser-whisper-GUI  (highly recommended)
 42 | 
 43 | https://github.com/Purfview/whisper-standalone-win (ready to use .exe for cli use)
 44 | 
 45 | https://github.com/FL33TW00D/whisper-turbo (wip)
 46 | 
 47 | ## subtitle
 48 | https://github.com/URUWorks/TeroSubtitler (whisper support)
 49 | 
 50 | ## video-subtitle-generator
 51 | https://github.com/YaoFANGUK/video-subtitle-generator
 52 | 
 53 | ## Translation
 54 | https://github.com/argosopentech/argos-translate#installation
 55 | https://github.com/ymoslem/DesktopTranslator
 56 | 
 57 | ## Chat-Style LLM (Alpaca/Llama) (Chat GTP Clone)
 58 | https://github.com/antimatter15/alpaca.cpp
 59 | https://github.com/tatsu-lab/stanford_alpaca
 60 | https://github.com/tloen/alpaca-lora
 61 | 
 62 | ## dalai
 63 | 
 64 | Run LLaMA and Alpaca on your computer.
 65 | 
 66 | https://github.com/cocktailpeanut/dalai
 67 | 
 68 | ## GTP4All
 69 | 
 70 | Installs a native chat-client with auto-update functionality that runs on your desktop with the GPT4All-J model baked into it.
 71 | 
 72 | https://github.com/nomic-ai/gpt4all
 73 | 
 74 | ## Gpt4All Web UI
 75 | 
 76 | This is a Flask web application that provides a chat UI for interacting with llamacpp based chatbots such as GPT4all, vicuna etc...
 77 | 
 78 | https://github.com/nomic-ai/gpt4all-ui
 79 | 
 80 | ## AutoGPT GUI
 81 | 
 82 | A graphical user interface to AutoGPT
 83 | 
 84 | https://github.com/thecookingsenpai/autogpt-gui
 85 | 
 86 | # 🎨 Image
 87 | 
 88 | ## Fooocus
 89 | 
 90 | Fooocus is an image generating software. 
 91 | 
 92 | https://github.com/lllyasviel/Fooocus
 93 | 
 94 | ## Style2Paints
 95 | 
 96 | Style2paints V4 is the current best AI driven lineart **colorization tool**.
 97 | 
 98 | https://github.com/lllyasviel/style2paints
 99 | 
100 | Source code not available for the latest release.
101 | 
102 | JavaScript, Python, Tensorflow
103 | 
104 | ## StableStudio
105 | 
106 | StableStudio, the open-source version of DreamStudio!
107 | 
108 | https://github.com/Stability-AI/StableStudio
109 | 
110 | ## NMKD Stable Diffusion GUI - AI Image Generator
111 | 
112 | https://github.com/n00mkrad/text2image-gui
113 | 
114 | https://nmkd.itch.io/t2i-gui
115 | 
116 | ## stable-diffusion-webui-forge
117 | 
118 | https://github.com/lllyasviel/stable-diffusion-webui-forge
119 | 
120 | ## stable-diffusion-ui
121 | 
122 | Easiest 1-click way to install and use Stable Diffusion on your own computer. Provides a browser UI for generating images from text prompts and images. Just enter your text prompt, and see the generated image
123 | 
124 | https://github.com/cmdr2/stable-diffusion-ui
125 | 
126 | ## MochiDiffusion
127 | 
128 | This app uses Apple's Core ML Stable Diffusion implementation to achieve maximum performance and speed on Apple Silicon based Macs while reducing memory requirements.
129 | 
130 | https://github.com/godly-devotion/MochiDiffusion
131 | 
132 | ## ComfyUI
133 | 
134 | This ui will let you design and execute advanced stable diffusion pipelines using a graph/nodes/flowchart based interface. For some workflow examples and see what ComfyUI can do you can check out:
135 | 
136 | https://github.com/comfyanonymous/ComfyUI
137 | 
138 | ## Stable Diffusion on AMD GPU
139 | 
140 | https://github.com/azuritecoin/OnnxDiffusersUI (deprecated)
141 | https://github.com/lshqqytiger/stable-diffusion-webui-directml
142 | 
143 | ## Diffusion Bee - Stable Diffusion GUI App for MacOS
144 | 
145 | https://github.com/divamgupta/diffusionbee-stable-diffusion-ui
146 | 
147 | ## WebUI 
148 | 
149 | ready to use colab based notebooks with web ui
150 | 
151 | https://github.com/camenduru/stable-diffusion-webui-colab
152 | 
153 | ## InvokeAI
154 | 
155 |  InvokeAI is a leading creative engine for Stable Diffusion models, empowering professionals, artists, and enthusiasts to generate and create visual media using the latest AI-driven technologies. The solution offers an industry leading WebUI, supports terminal use through a CLI, and serves as the foundation for multiple commercial products. 
156 | 
157 | https://github.com/invoke-ai/InvokeAI
158 | 
159 | ## airunner
160 | 
161 | https://github.com/Capsize-Games/airunner $
162 | 
163 | 
164 | ## Lama Cleaner
165 | 
166 | A free and open-source inpainting tool powered by SOTA AI model.
167 | 
168 | https://github.com/Sanster/lama-cleaner
169 | 
170 | https://panicbyte.itch.io/lama-cleaner
171 | 
172 | ## carefree-creator
173 | 
174 |  AI magics meet Infinite draw board. 
175 | 
176 | https://github.com/carefree0910/carefree-creator
177 | 
178 | ## photoshot
179 | 
180 | An open-source AI avatar generator web app
181 | 
182 | https://github.com/shinework/photoshot
183 | 
184 | ## DeOldify.NET
185 | 
186 | DeOldify.NET: cross-platform application for coloring black and white photos"
187 | 
188 | https://github.com/ColorfulSoft/DeOldify.NET
189 | 
190 | ## Queryable
191 | 
192 | Offline photo search App for iOS
193 | 
194 | https://github.com/mazzzystar/Queryable
195 | 
196 | ## X-AnyLabeling
197 | 
198 | Effortless data labeling with AI support from Segment Anything and other awesome models. 
199 | 
200 | https://github.com/CVHub520/X-AnyLabeling
201 | 
202 | # :notes: Audio
203 | 
204 | ## clone-voice
205 | 
206 | A sound cloning tool with a web interface, using your voice or any sound to record audio
207 | 
208 | https://github.com/jianchang512/clone-voice
209 | 
210 | ## tacotron2-tts-GUI
211 | 
212 | GUI wrapper for synthesize. Allows CPU-only synthesis via a toggleable switch. Portable exe file is available (that runs on CPU only).
213 | 
214 | https://github.com/lokkelvin2/tacotron2-tts-GUI
215 | 
216 | ## riffusion
217 | 
218 | Riffusion is an app for real-time music generation with stable diffusion.
219 | 
220 | https://github.com/riffusion/riffusion-app
221 | 
222 | ## whisper-ui
223 | 
224 | Whisper is a general-purpose speech recognition model. It is trained on a large dataset of diverse audio and is also a multitasking model that can perform multilingual speech recognition, speech translation, and language identification.
225 | 
226 | This is a simple Streamlit UI for OpenAI's Whisper speech-to-text model. It let's you download and transcribe media from YouTube videos, playlists, or local files. You can then browse, filter, and search through your saved audio files.
227 | 
228 | https://github.com/hayabhay/whisper-ui for https://github.com/openai/whisper
229 | 
230 | https://grisk.itch.io/whisper-gui
231 | 
232 | ## buzz 
233 | 
234 | GUI for whisper
235 | 
236 | https://github.com/chidiwilliams/buzz
237 | 
238 | ## Ultimate Vocal Remover GUI v5.5.1
239 | 
240 | This application uses state-of-the-art source separation models to remove vocals from audio files. UVR's core developers trained all of the models provided in this package (except for the Demucs v3 and v4 4-stem models).
241 | 
242 | https://github.com/Anjok07/ultimatevocalremovergui
243 | 
244 | ## SpleeterGUI
245 | 
246 | SpleeterGUI - Music source separation desktop app
247 | 
248 | Windows Desktop Front end for Spleeter - AI source separation
249 | 
250 | https://github.com/boy1dr/SpleeterGui
251 | 
252 | ## SoftVC VITS Singing Voice Conversion Fork
253 | 
254 | https://github.com/34j/so-vits-svc-fork
255 | 
256 | ## subsync subtitle alignment
257 | 
258 | This is an automatic movie subtitle synchronization tool. Synchronization is done by listening to the audio track, translating it if necessary.
259 | 
260 | https://github.com/sc0ty/subsync/
261 | 
262 | ## Handy (Speech to text)
263 | 
264 | A free, open source, and extensible speech-to-text application that works completely offline.
265 | 
266 | https://github.com/cjpais/Handy
267 | 
268 | # 🎞️ Video frame interpolation
269 | 
270 | ## Video Dubbing
271 | 
272 | https://github.com/jianchang512/pyvideotrans
273 | 
274 | ## DAIN-App
275 | 
276 | The backbone of this project is a algorithm called DAIN. (https://github.com/baowenbo/DAIN)
277 | 
278 | https://github.com/BurguerJohn/Dain-App
279 | 
280 | https://grisk.itch.io/dain-app?download
281 | 
282 | ## DAIN Vulkan GUI
283 | 
284 | AI-Powered video interpolater (eg. 30fps -> 60fps) for Vulkan devices. Based on dain-ncnn-vulkan and ffmpeg
285 | 
286 | https://github.com/Mar2ck/DAIN-Vulkan-GUI
287 | 
288 | ## dain-ncnn-vulkan
289 | 
290 | ncnn implementation of DAIN, Depth-Aware Video Frame Interpolation.
291 | 
292 | https://github.com/nihui/dain-ncnn-vulkan
293 | 
294 | ## Dain-App
295 | 
296 | https://github.com/BurguerJohn/Dain-App
297 | 
298 | ## cain-ncnn-vulkan
299 | 
300 | Video Frame Interpolation
301 | 
302 | https://github.com/nihui/cain-ncnn-vulkan
303 | 
304 | ## Flowframes
305 | 
306 | Flowframes is a simple but powerful app that utilizes AI frameworks to interpolate videos in order to increase their framerate with little to no noticable quality loss. (DAIN, CAIN, RIFE)
307 | 
308 | https://github.com/n00mkrad/flowframes
309 | 
310 | https://nmkd.itch.io/flowframes
311 | 
312 | # 🖼️🎞️ Video upscaler
313 | 
314 | ## video2x
315 | 
316 | A lossless video/GIF/image upscaler achieved with waifu2x, Anime4K, SRMD and RealSR. Started in Hack the Valley 2, 2018. 
317 | https://github.com/k4yt3x/video2x
318 | 
319 | ## https://github.com/Communist02/AnimePlayer
320 | 
321 | 
322 | ## 🎞️ Video Deep Face
323 | 
324 | https://github.com/iperov/DeepFaceLive
325 | 
326 | # 🖼️ Image/Video upscaling
327 | 
328 | ## Final2x
329 | 
330 | This is a powerful tool that allows for image super-resolution to arbitrary sizes using multiple models, designed to enhance the resolution and quality of images, making them clearer and more detailed. Presently, it supports several models, such as RealCUGAN, RealESRGAN, Waifu2x, and SRMD.
331 | 
332 | https://github.com/Tohrusky/Final2x
333 | 
334 | ## Upscayl
335 | Upscayl is a cross-platform application built with the Linux-first philosophy. This means that we prioritize Linux builds over others but that doesn't mean we'll break things for other OSes :) Makes use of Real-ESRGAN
336 | 
337 | https://github.com/upscayl/upscayl
338 | 
339 | ## Anime4kSharp
340 | 
341 | https://github.com/shadow578/Anime4kSharp
342 | 
343 | ## dandere2x-tremx
344 | 
345 | video compression 
346 | https://github.com/Tremeschin/dandere2x-tremx
347 | 
348 | ## Cupscale
349 | 
350 | Image Upscaling GUI based on ESRGAN - WORK IN PROGRESS
351 | 
352 | https://github.com/n00mkrad/cupscale
353 | 
354 | ## IEU.Winforms
355 | 
356 | Windows only GUI for ESRGAN with additional features 
357 | https://github.com/ptrsuder/IEU.Winforms
358 | 
359 | ## NVIDIA-Ansel-AI-Enhancer
360 | 
361 | NVIDIA Upscaler, utilizes NVIDIA´s Ansel RTX. Requires latest drivers and a RTX GPU. Free, but not open source. 
362 | 
363 | https://github.com/dynamiquel/NVIDIA-Ansel-AI-Enhancer 
364 | 
365 | ## RealSR
366 | 
367 | Real-World Super-Resolution via Kernel Estimation and Noise Injection 
368 | 
369 | https://github.com/jixiaozhong/RealSR
370 | 
371 | (https://github.com/nihui/RealSR)
372 | 
373 | ## realsr-ncnn-vulkan
374 | 
375 | https://github.com/nihui/realsr-ncnn-vulkan
376 | 
377 | ## Shell upscaler
378 | 
379 | https://github.com/n00mkrad/shell-upscaler
380 | 
381 | Acts as a GUI for esrgan-launcher.
382 | 
383 | ## srmd-ncnn-vulkan
384 | 
385 | https://github.com/nihui/srmd-ncnn-vulkan
386 | 
387 | ## VapourSynth-Super-Resolution-Helper
388 | 
389 | https://github.com/AlphaAtlas/VapourSynth-Super-Resolution-Helper
390 | 
391 | ## Waifu2x-Extension-GUI
392 | 
393 | Photo/Video/GIF enlargement using machine learning
394 | 
395 | https://github.com/AaronFeng753/Waifu2x-Extension-GUI
396 | 
397 | C++, python, qt, caffe, ncnn_vulkan
398 | 
399 | AGPL-3.0 License
400 | 
401 | ## waifu2x-ncnn-vulkan
402 | 
403 | https://github.com/nihui/waifu2x-ncnn-vulkan
404 | 
405 | # GAN / Style Transfer
406 | 
407 | ## TorchRayLib
408 | 
409 | TorchRayLib++: A CMake based AI & generative art platform, integrating the rayib GUI and the PyTorch C++ Deep Learning Library.
410 | 
411 | https://github.com/QuantScientist/TorchRayLib
412 | 
413 | ## more
414 | https://github.com/Seprentes/ScaleUp
415 | https://github.com/HolyWu/vs-realesrgan
416 | https://github.com/tumuyan/RealSR-NCNN-Android
417 | https://github.com/TransparentLC/realesrgan-gui/blob/master/README.en-US.md
418 | https://github.com/Lucchetto/SuperImage
419 | https://github.com/Tohrusky/Final2x
420 | 
421 | # :toolbox: DL Training GUI
422 | 
423 | ## AIDeveloper
424 | 
425 | GUI-based software for training, evaluating and applying deep neural nets for image classification 
426 | 
427 | https://github.com/maikherbig/AIDeveloper
428 | 
429 | # 🧩 Ready to use Plugins
430 | 
431 | ## vstSpleeter
432 | 
433 | audio source separation library vst plugin
434 | 
435 | https://github.com/gvne/vstSpleeter
436 | 
437 | 
438 | ---
439 | 
440 | 
441 | ## various apps (untested)
442 | 
443 | Paper2GUI is an AI desktop Application toolbox for everyone no matter your background with it being free and ready to use out of the box, already supporting 10+ AI models, covering the fields of speech synthesis, video frame filling, video overscoring, target detection, image stylization, OCR recognition, etc. Supporting Windows, Mac OS, Linux systems.
444 | 
445 | https://github.com/Baiyuetribe/paper2gui/blob/main/README_en.md
446 | 
447 | Free: https://www.nvidia.com/en-us/studio/canvas/
448 | 


--------------------------------------------------------------------------------