├── README.md
└── README-CN.md
/README.md:
--------------------------------------------------------------------------------
1 |
2 |
Awesome AI Tools
3 |

4 |
5 |
6 | English | [中文](README-CN.md)
7 |
8 | This repo collects awesome AI tools. Welcome everyone to recommend more awesome AI tools together! Please use the following template as a reference for your recommendations. [issue](https://github.com/ikaijua/Awesome-AITools/issues/233)
9 |
10 |
11 |
12 |
13 | ## All Categories
14 | - [All Categories](#all-categories)
15 | - [ChatGPT and other AI chat assistant](#chatgpt-and-other-ai-chat-assistant)
16 | - [AI Search engine](#ai-search-engine)
17 | - [Open Source LLMs](#open-source-llms)
18 | - [LLM Leaderboard](#llm-leaderboard)
19 | - [GPT LLMs Applications](#gpt-llms-applications)
20 | - [Programming Development](#programming-development)
21 | - [AI Image Creation](#ai-image-creation)
22 | - [Video Creation](#video-creation)
23 | - [AI Cloud Platform](#ai-cloud-platform)
24 | - [LLM Prompts](#llm-prompts)
25 | - [LLM training platform](#llm-training-platform)
26 | - [AI Agent](#ai-agent)
27 | - [Writing](#writing)
28 | - [Translation](#translation)
29 | - [Speech Recognition](#speech-recognition)
30 | - [Text To Speech](#text-to-speech)
31 | - [Music Recognition](#music-recognition)
32 | - [Voice Processing](#voice-processing)
33 | - [AI generated music or sound effects](#ai-generated-music-or-sound-effects)
34 | - [Speech translation](#speech-translation)
35 | - [Video Content Summary](#video-content-summary)
36 | - [Academic research](#academic-research)
37 | - [OCR](#ocr)
38 |
39 | ### ChatGPT and other AI chat assistant
40 | | Name | Description | Links | Fees |
41 | | ---- | ----------------------------- | --- | --- |
42 | | Gemini| Google's LLM, including Gemini-3 pro|[gemini](https://gemini.google.com/)
[ai.google.dev](https://ai.google.dev/)|Free/Paid|
43 | | ChatGPT | OpenAI's AI assistant | [URL](https://chat.openai.com) | Free/Paid |
44 | | Claude| Anthropic's LLM|[URL](https://claude.ai/)| Free/Paid|
45 | | DeepSeek | DeepSeek's AI assistant. [API](https://platform.deepseek.com/api_keys)|[URL](https://chat.deepseek.com/)|Free/Paid|
46 | | Grok | xAI's AI assistant |1.[x.com/grok](https://x.com/i/grok)
2.[grok.com](https://grok.com/)|Free|
47 | | Microsoft Copilot| Microsoft's AI assistant.|[URL](https://copilot.microsoft.com/)|Free|
48 | | Le Chat| Mistral.ai's conversational, AI chat service|[URL](https://chat.mistral.ai/chat)|Free|
49 | | qwen | Alibaba's AI assistant. Includes Qwen3, Qwen3-Code and other Qwen LLMs|[URL](https://chat.qwen.ai/)|Free|
50 |
51 | ### AI Search engine
52 | | Name | Description | Links | Fees |
53 | | --- | --- | --- | --- |
54 | | Perplexity.ai | AI-driven conversational search engine. | [URL](https://www.perplexity.ai) | Free|
55 | | You.com | A search engine in conversation mode | [URL](https://you.com) | Free |
56 | | Morphik.ai | Open source AI-driven search engine for private documents | [URL](https://morphik.ai) [Github](https://github.com/morphik-org/morphik-core) | Free |
57 |
58 | ### Open Source LLMs
59 | | Name | Description | Links | Fees |
60 | | ---- | ----------------------------- | --- | --- |
61 | | DeepSeek-R1 |DeepSeek's first-generation reasoning models, DeepSeek-R1-Zero and DeepSeek-R1. DeepSeek-R1-Zero, a model trained via large-scale reinforcement learning (RL) without supervised fine-tuning (SFT) as a preliminary step, demonstrated remarkable performance on reasoning.|[Github](https://github.com/deepseek-ai/DeepSeek-R1) |Free|
62 | | DeepSeek-V3 |A strong Mixture-of-Experts (MoE) language model with 671B total parameters with 37B activated for each token.|[Github](https://github.com/deepseek-ai/DeepSeek-V3) |Free|
63 | | Qwen3 |Qwen3 is the large language model series developed by Qwen team, Alibaba Cloud.|[Github](https://github.com/QwenLM/Qwen3) |Free|
64 | | Llama 3 | Llama3 is a large language model developed by Meta AI. It is the successor to Meta's Llama2 language model.
Online test address:
[huggingface.co/Meta-Llama-3-70B-Instruct](https://huggingface.co/chat/models/meta-llama/Meta-Llama-3-70B-Instruct) |[GitHub](https://github.com/meta-llama/llama3) | Free |
65 | | Mixtral |Mixtral 8x7B, a high-quality sparse mixture of experts model (SMoE) with open weights. Mixtral outperforms Llama 2 70B on most benchmarks with 6x faster inference. It matches or outperforms GPT3.5 on most standard benchmarks.
paper:https://arxiv.org/pdf/2401.04088.pdf
news:https://mistral.ai/news/mixtral-of-experts/ |[mistral-inference](https://github.com/mistralai/mistral-inference) 
[mistral-finetune](https://github.com/mistralai/mistral-finetune) |Free|
66 | |grok-1|A large language model open sourced by xAI|[Github](https://github.com/xai-org/grok-1) |Free|
67 | |Phi-3| Phi-3, a family of open AI models developed by Microsoft. Phi-3 models are the most capable and cost-effective small language models (SLMs) available, outperforming models of the same size and next size up across a variety of language, reasoning, coding, and math benchmarks.|[Github](https://github.com/microsoft/Phi-3CookBook) |Free|
68 |
69 | ### LLM Leaderboard
70 | | Name | Description | Links | Fees |
71 | | ---- | ----------------------------- | --- | --- |
72 | |LMSYS Chatbot Arena Leaderboard|LMSYS Chatbot Arena is a crowdsourced open platform for LLM evals. Collected over 1,000,000 human pairwise comparisons to rank LLMs with the Bradley-Terry model and display the model ratings in Elo-scale. |[URL](https://lmarena.ai/leaderboard) |Free|
73 | |Artificial Analysis|Artificial Analysis is a platform that provides AI model and service provider comparisons and benchmarks to help users make informed decisions when choosing AI models and service providers. The platform provides comparative data on a wide range of popular AI models, including OpenAI's GPT-4, Meta's Llama 3, and Anthropic's Claude series, covering performance metrics such as response time, latency, and cost.|[URL](https://artificialanalysis.ai/)|Free|
74 | |LiveCodeBench|LiveCodeBench is a holistic and contamination-free evaluation benchmark of LLMs for code that continuously collects new problems over time. Particularly, LiveCodeBench also focuses on broader code-related capabilities, such as self-repair, code execution, and test output prediction, beyond mere code generation. |[URL](https://livecodebench.github.io/leaderboard.html)|Free|
75 | |LLM Stats|LLM Stats, the most comprehensive LLM leaderboard, benchmarks and compares API models using daily‑updated, open‑source community data on capability, price, speed, and context length.|[URL](https://llm-stats.com/)|Free|
76 |
77 | ### GPT LLMs Applications
78 | | Name | Description | Links | Fees |
79 | -|-|-|-
80 | | Poe | AI product built by Quora. Can use ChatGPT, Sage, Dragonfly, Claude bots for free. All you need is an email address to register. GPT-4 can be used once a day for free | [URL](https://poe.com/) | Free, with paid upgrades|
81 | |Cherry Studio|Cherry Studio is a desktop client that supports for multiple LLM providers, available on Windows, Mac and Linux. Support major LLM Cloud Services: OpenAI, Gemini, Anthropic, and more AI Web Service Integration: Claude, Peplexity, Poe, and others Local Model Support with Ollama, LM Studio|[Github](https://github.com/CherryHQ/cherry-studio) |Free|
82 | | HuggingChat|Open source codebase powering the HuggingChat app. [URL](https://huggingface.co/chat/)|[Github](https://github.com/huggingface/chat-ui) |Free|
83 | | Google AI Studio|Google AI Studio is a free, web-based developer tool that enables you to quickly develop prompts and then get an API key to use in your app development. [Available regions](https://ai.google.dev/gemini-api/docs/available-regions#available_regions)|[URL](https://aistudio.google.com/)|Free|
84 | | NotebookLM |AI Research Assistant developed by Google. Upload PDFs, websites, YouTube videos, audio files, Google Docs, or Google Slides, and NotebookLM will summarize them and make interesting connections between topics. Audio Overview feature can turn your sources into engaging “Deep Dive” discussions with one click. |[URL](https://notebooklm.google.com/)|Free|
85 | | Learn about |AI learning Assistant developed by Google.Grasp new topics and deepen your understanding with a conversational learning companion that adapts to your unique curiosity and learning goals.|[URL](https://learning.google.com/experiments/learn-about)|Free|
86 | | monica | AI assistant that provides help with a variety of tasks such as searching, reading, writing, translating, drawing, and more. Standalone apps and browser plug-ins available| [URL](https://monica.im)
[chrome extension](https://chromewebstore.google.com/detail/monica-your-ai-copilot-po/ofpnmcalabcbjgholdjcjblkibolbppb)|Free, with paid upgrades|
87 | | ollama |Get up and running with Llama 2, Mistral, Gemma, and other large language models.|[Github](https://github.com/ollama/ollama) | Free |
88 | | openai/openai-python | The official Python library for the OpenAI API, It is generated from [OpenAPI specification ](https://github.com/openai/openai-openapi) with [Stainless](https://stainlessapi.com/) | [Github](https://github.com/openai/openai-python)| Free, need OpenAPI [apikey](https://platform.openai.com/account/api-keys) |
89 | |sashabaranov/go-openai|This library provides unofficial Go clients for OpenAI API. support: ChatGPT, GPT-3, GPT-4, DALL·E 2|[Github](https://github.com/sashabaranov/go-openai)|Free|
90 | |langchain|LangChain is a framework for developing applications powered by language models.|[Github](https://github.com/langchain-ai/langchain) |Free|
91 | |Helicone AI|Helicone is the open-source LLM observability platform for logging, monitoring, and debugging AI applications.|[Github](https://github.com/Helicone/helicone) |Free|
92 | |ChatGPT-Next-Web|One-Click to get a well-designed cross-platform ChatGPT web UI, with GPT3, GPT4 & Gemini Pro support.|[Github](https://github.com/ChatGPTNextWeb/ChatGPT-Next-Web) |Free|
93 | | screenshot-to-code | This simple app converts a screenshot to HTML/Tailwind CSS. It uses GPT-4 Vision to generate the code and DALL-E 3 to generate similar-looking images. You can now also enter a URL to clone a live website! | [GitHub](https://github.com/abi/screenshot-to-code) | Free, need access to GPT-4 Vision|
94 | | Chatbox | Desktop application that uses ChatGPT API (OpenAI API) to store all chat messages and prompts locally, thus reducing the risk of data loss. A bit more stable to use than the web version| [GitHub](https://github.com/Bin-Huang/chatbox) | Free, requires [apikey with OpenAPI](https://platform.openai.com/account/api-keys)|
95 | |together.ai chat|Similar to HuggingChat, with the option of different open source models, support for DeepSeek R1, LLaMA, QWen, Flux Schnell. 60 free messages per day.|[URL](https://chat.together.ai/)|Free/Paid|
96 | | gpt-crawler | Crawl a site to generate knowledge files to create your own custom GPT from a URL | [Github](https://github.com/BuilderIO/gpt-crawler)| Free |
97 | | ChatGPT-Shortcut | Open source, ChatGPT shortcut commands that double productivity, partitioned by domain and function, can filter prompt words by tag, keyword search and one-click copy. |[GitHub](https://github.com/rockbenben/ChatGPT-Shortcut) |Free|
98 | |ChatGPT Sidebar|ChatGPT Sidebar is an artificial intelligence assistant you can use while browsing any website. |[URL](https://chrome.google.com/webstore/detail/chatgpt-sidebar-support-g/difoiogjjojoaoomphldepapgpbgkhkb)|Free|
99 | | WebChatGPT | Open source, expand the ability of networking to chatgpt | [GitHub](https://github.com/qunash/chatgpt-advanced) | Free|
100 | | AIPRM for ChatGPT |Browser plug-in, providing a series of selected ChatGPT instruction templates, and even creating your own, and adjusting AI tone and writing style| [URL](https://chrome.google.com/webstore/detail/aiprm-for-chatgpt/ojnbohmppadfgpejeebfnmnknjdlckgj) | Free|
101 | | MindMac | Feature-rich & privacy-first native ChatGPT app for macOS to use OpenAI, Azure OpenAI, Anthropic Claude, OpenRouter all in one place, designed for maximum productivity. Currently available in 15 languages. | [URL](https://mindmac.app/) | Free, with paid upgrades|
102 | | chathub | Use different chatbots in one app, currently supporting ChatGPT, new Bing Chat, Google Bard, Claude, and 10+ open-source models including Alpaca, Vicuna, ChatGLM etc. | [GitHub](https://github.com/chathub-dev/chathub) |Free/Paid|
103 | | Harbor | Effortlessly run LLM backends, APIs, frontends, and services with one command. | [GitHub](https://github.com/av/harbor) | Free |
104 | |gemini-fullstack-langgraph-quickstart|Get started with building Fullstack Agents using Gemini 2.5 and LangGraph|[Github](https://github.com/google-gemini/gemini-fullstack-langgraph-quickstart) |Free|
105 |
106 | ### Programming Development
107 | | Name | Description | Links | Fees |
108 | | ---- | ----------------------------- | --- | --- |
109 | | Cursor | A collaborative code editor using GPT | [URL](https://www.cursor.so) | Paid/Free Trial |
110 | | GitHub Copilot | A code writing assistant developed by GitHub and OpenAI | [URL](https://github.com/features/copilot) | Paid|
111 | | Trae | Trae is your helpful coding partner. It offers features like AI Q&A, code auto-completion, and agent-based AI programming capabilities. | [URL](https://www.trae.ai/) | Free|
112 | | MarsCode |Built-in AI programming assistant with capabilities like code completion, explanation, and debugging for faster development.|[URL](https://www.marscode.com/)|Free|
113 | | Amazon CodeWhisperer | A code writing assistant developed by Amazon| [URL](https://aws.amazon.com/cn/codewhisperer)| Free for Individual Use|
114 | | Codeium | Powerful in-IDE AI coding assistant|[URL](https://codeium.com/)|Free/Paid|
115 | | scalene |Scalene: a high-performance, high-precision CPU, GPU, and memory profiler for Python with AI-powered optimization proposals|[Github](https://github.com/plasma-umass/scalene) |Free|
116 | | Fitten Code | Fitten Code is an AI programming assistant driven by Fitten LLM models, which can automatically generate code, improve development efficiency, help you debug, and save your time. It can also chat with you and solve your programming problems.freeand supports over 80 languages: Python, C++,JavaScript, TypeScript, Java, etc. Fitten Code supports Visual Studio Code and JetBrains series IDEs, including IntelliJ IDEA, PyCharm, WebStorm, etc.|[URL](https://code.fittentech.com/en?lang=en)| Free |
117 | | Plandex | Open source, terminal-based AI programming engine for complex tasks | [GitHub](https://github.com/plandex-ai/plandex) | Free |
118 | | Roundtable | Zero-configuration MCP server that unifies multiple AI coding assistants for enhanced development workflows. Intelligent client management platform enabling seamless coordination between Claude Code, Cursor, GPT-4, and other AI development tools. | [GitHub](https://github.com/askbudi/roundtable)  [Website](https://askbudi.ai/roundtable) | Free |
119 | | Mistral/Codestral|[Empowering developers and democratising coding with Mistral AI.](https://mistral.ai/news/codestral/), models:https://huggingface.co/mistralai/Codestral-22B-v0.1|[URL](https://chat.mistral.ai/chat)|Free|
120 | | Kodus | Open Source Code Review Agent | [GitHub](https://github.com/kodustech/kodus-ai/)
| Free/Paid|
121 |
122 | ### AI Image Creation
123 | | Name | Description | Links | Fees |
124 | | ---- | ----------------------------- | --- | --- |
125 | | Nano Banana/Nano Banana Pro|Google's advanced AI model for image generation and editing. No. 1 in the LMArea Text to Image and Image Edit leadboard. Online website:
1. [gemini](https://gemini.google.com/app)
2.[aistudio](https://aistudio.google.com/prompts/new_chat?model=gemini-2.5-flash-image-preview)
3. [lmarea.ai](https://lmarena.ai/?mode=direct&chat-modality=image)|[URL](https://aistudio.google.com/prompts/new_chat?model=gemini-2.5-flash-image-preview)|Free/Paid|
126 | | Z-Image | Z-Image is a high-performance image generation model recently open-sourced by Alibaba's Tongyi Lab. It strikes a balance between "extreme speed" and "high quality," making it highly suitable for scenarios requiring rapid image generation. Z-Image-Turbo Online Demo: https://huggingface.co/spaces/mrfakename/Z-Image-Turbo | [Github](https://github.com/Tongyi-MAI/Z-Image)  | Free |
127 | | Midjourney | Enter text or pictures to create pictures | [URL](https://www.midjourney.com) | Paid |
128 | | ChatGPT Images |GPT Image 1.5|[URL](https://chatgpt.com/images)|Free/Paid|
129 | | Photoshop AI| Adobe Photoshop generative-fill| [URL](https://www.adobe.com/products/photoshop/generative-fill.html) |Paid|
130 | | Stable diffusion webui | Open source project, input text or pictures to create pictures, Stable diffusion webui is the GUI of Stable diffusion, and it is an image user interface that visualizes stable diffusion. It also integrates many other useful extension scripts. | [GitHub](https://github.com/AUTOMATIC1111/stable-diffusion-webui) | Free|
131 | | civitai | civitai.com is a website platform for sharing AI image creation model resources, with a large number of models, has become the main model exchange place in the SD open source community | [URL](https://civitai.com/) | Free|
132 | | clipdrop | clipdrop by stability.ai. Has many AI image processing tools, such as stable diffusion XL, uncrop, reimage XL, stable doodle. | [URL](https://clipdrop.co/) | Free/Paid |
133 | | firefly | Adobe's AI image processing web site |[URL](https://firefly.adobe.com/)|Free/Paid|
134 | | ideogram.ai | Enter text to create pictures. A product developed by a company founded by many ex-Googlers |[URL](https://ideogram.ai/)| Free/Paid |
135 | | Nero AI | AI picture upscale, AI repair scratches, AI picture coloring, AI picture noise removal, AI one-click to change the background, AI magical erasing pen, AI portrait. API doc:https://ai.nero.com/ai-api/docs/|[URL](https://ai.nero.com/)|Paid/Trial|
136 | | Skybox AI | Generate 360-degree panoramic images using text prompts | [URL](https://skybox.blockadelabs.com/)| Free/Paid|
137 | | remove.bg |Remove Image Background|[URL](https://www.remove.bg/)|Free/Paid|
138 | | ControlNet |ControlNet is a neural network structure to control diffusion models by adding extra conditions.|[Github](https://github.com/lllyasviel/ControlNet) |Free|
139 |
140 | ### Video Creation
141 |
142 | | Name | Description | Links | Fees |
143 | | ---- | ----------------------------- | --- | --- |
144 | | Wan2.6 |AI Video Creation Tool by Alibaba | [URL](https://create.wan.video/) | Paid/Free trial |
145 | | Sora | Sora is an AI model published by OpenAI that can create realistic and imaginative scenes from text instructions. | [URL](https://openai.com/sora) | Paid |
146 | | KLING AI|AI Video Creation Tool by kuaishou. |[URL](https://klingai.com/)|Free/Paid|
147 | | hailuoai|AI Video Creation Tool by Minimax|[URL](https://hailuoai.com/video)|Free/Paid|
148 | | Dream Machine|By Luma AI. Dream Machine is an AI model that makes high quality, realistic videos fast from text and images.[Official introductory video](https://www.youtube.com/watch?v=Zb3tffmBPRE)|[URL](https://lumalabs.ai/dream-machine)|Free/Paid|
149 | | capcut | Subtitle-generated speech, speech recognition, and very convenient and powerful video editing|[URL](https://www.capcut.com/)|Free/Paid|
150 | | Runway | Gen-2: Text/Image to video
Gen-1: Video to video. Featured video: https://runwayml.com/staff-picks | [URL](https://runwayml.com/) | Paid/Free trial|
151 | | pixverse | Create Amazing AI Videos from Text & Photos |[URL](https://app.pixverse.ai/)|Paid/Free trial|
152 | | Pika | Text/Image to video |[URL](https://pika.art/home)|Paid/Free trial|
153 | | Fliki | A website that converts text into audio and video | [URL](https://fliki.ai) | Free/Paid |
154 | | d-id | Generate digital human dubbing video based on text | [URL](https://studio.d-id.com) | Paid/Free trial|
155 | | HeyGen | Generate digital human dubbing video based on text | [URL](https://app.heygen.com/) | Paid/Free trial|
156 | | AnimateDiff | AnimateDiff is a plug-and-play module turning most community models into animation generators, without the need of additional training.| [Github](https://github.com/guoyww/AnimateDiff) |Free|
157 | |vivago.ai/video|Text to Video; Image to Video; 4K enhance|[URL](https://vivago.ai/video)|Free|
158 | | MaxVideoAI | Multi-engine AI video generation platform (Sora-style, Pika, Veo, Kling & more) | [URL](https://maxvideoai.com) | Free/Paid |
159 |
160 |
161 | ### AI Cloud Platform
162 | | Name | Description | Links | Fees |
163 | | ---- | ----------------------------- | --- | --- |
164 | |together.ai|The AI Acceleration Cloud. Train, fine-tune-and run inference on AI models blazing fast, at low cost, and at production scale.|[URL](https://www.together.ai/) |Free/Paid|
165 |
166 | ### LLM Prompts
167 | | Name | Description | Links | Fees |
168 | | ---- | ----------------------------- | --- | --- |
169 | |f/awesome-chatgpt-prompts|This repo includes ChatGPT prompt curation to use ChatGPT better.|[Github](https://github.com/f/awesome-chatgpt-prompts)  |Free|
170 |
171 | ### LLM training platform
172 | | Name | Description | Links | Fees |
173 | | ---- | ----------------------------- | --- | --- |
174 | | lm-sys/FastChat | An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena. | [Github](https://github.com/lm-sys/FastChat) | Free |
175 |
176 |
177 | ### AI Agent
178 | | Name | Description | Links | Fees |
179 | | ---- | ----------------------------- | --- | --- |
180 | |Gemini CLI|An open-source AI agent that brings the power of Gemini directly into your terminal.|[Github](https://github.com/google-gemini/gemini-cli/)|Free|
181 | |agentscope|Agent-Oriented Programming for Building LLM Applications, Open-sourced by Alibaba|[Github](https://github.com/agentscope-ai/agentscope)|Free|
182 | |Auto-GPT|Open source, An experimental open-source attempt to make GPT-4 fully autonomous.|[GitHub](https://github.com/Torantulino/Auto-GPT) |Free|
183 | |OthersideAI/self-operating-computer|A framework to enable multimodal models to operate a computer.|[Github](https://github.com/OthersideAI/self-operating-computer) |Free,GPT-4v required|
184 | |AppAgent|Multimodal Agents as Smartphone Users, an LLM-based multimodal agent framework designed to operate smartphone apps.|[Github](https://github.com/mnotgod96/AppAgent) |Free|
185 | |microsoft/autogen|AutoGen is an open-source programming framework for building AI agents and facilitating cooperation among multiple agents to solve tasks. |[Github](https://github.com/microsoft/autogen) |Free|
186 | |potpie-ai/potpie|Open Source AI Agents for your codebase in minutes. Use pre-built agents for Q&A, Testing, Debugging and System Design or create your own purpose-built agents. |[URL](https://potpie.ai) , [Github](https://github.com/potpie-ai/potpie) |Free Trial|
187 | |saplings|A framework for building agents that use search algorithms to complete tasks. |[Github](https://github.com/shobrook/saplings) |Free|
188 | |MastraAI|Mastra is an opinionated TypeScript framework that helps you build AI applications and features quickly. It gives you the set of primitives you need: workflows, agents, RAG, integrations and evals|[Github](https://github.com/mastra-ai/mastra) |Free|
189 |
190 | ### Writing
191 | | Name | Description | Links | Fees |
192 | | ---- | ----------------------------- | --- | --- |
193 | | Notion AI | AI-assisted note-taking software | [URL](https://www.notion.so)| with certain free AI trials, AI features $10/month |
194 | | Deep L Write | English and German writing tools to fix writing errors and rewrite sentences promptly. | [URL](https://www.deepl.com/write) | Free version to use with text word limit / paid upgrade available |
195 | | grammarly | Edit and correct your grammar, spelling, punctuation, and more with your personal writing assistant, grammar checker, and editor.| [URL](https://app.grammarly.com/) | Free/Paid|
196 | | TextCraft | Add-in for Microsoft Word that seamlessly integrates essential AI tools, including text generation, proofreading, and more, directly into the user interface. | [URL](https://github.com/suncloudsmoon/TextCraft) | Free |
197 |
198 |
199 |
200 | ### Translation
201 | | Name | Description | Links | Fees |
202 | | ---- | ----------------------------- | --- | --- |
203 | | Google Translate|Support text, picture, document and URL|[URL](https://translate.google.com/)|Free|
204 | | Deep L | Accurate and instant translation tool, currently supporting 31 languages | [URL](https://www.deepl.com/translator) | Free/Paid|
205 | | immersive-translate | Open source project. Immersive bilingual web translation extension | [GitHub](https://github.com/immersive-translate/immersive-translate/) | Free |
206 | | openai-translator | Open source project. Crossword translation browser plugin and cross-platform desktop application based on ChatGPT API | [GitHub](https://github.com/yetone/openai-translator) | Free, requires OpenAI API key |
207 | |RTranslator |RTranslator is an open-source, free, and offline real-time translation app for Android.|[Github](https://github.com/niedev/RTranslator) |Free|
208 |
209 | ### Speech Recognition
210 | | Name | Description | Links | Fees |
211 | | ---- | ----------------------------- | --- | --- |
212 | | whisper | OpenAPI open source robust speech recognition model through large-scale weak supervision | [GitHub](https://github.com/openai/whisper) | Free |
213 | | whisper.cpp | Port of OpenAI's Whisper model in C/C++|[Github](https://github.com/ggml-org/whisper.cpp) |Free|
214 | | buzz | An open source desktop software based on OpenAI's Whisper to recognize speech and generate subtitles | [GitHub](https://github.com/chidiwilliams/buzz) | Free |
215 | | WhisperDesktop| Open source, OpenAI-based Whisper, a desktop application for Windows, uses the GPU for processing, which will be faster than on the CPU with good GPU performance.|[GitHub](https://github.com/Const-me/Whisper) |Free|
216 | | whisperX | WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)| [whisperX](https://github.com/m-bain/whisperX)  |Free|
217 | | whisper-web | ML-powered speech recognition directly in your browser. Built with [Transformers.js](https://github.com/xenova/transformers.js). [Demo](https://huggingface.co/spaces/Xenova/whisper-web) | [GitHub](https://github.com/xenova/whisper-web) |Free|
218 |
219 | ### Text To Speech
220 | | Name | Description | Links | Fees |
221 | | ---- | ----------------------------- | --- | --- |
222 | | index-tts2 |Bilibili's Open-Source Industrial-Grade Controllable High-Efficiency Zero-Sample Text-to-Speech System.
Online Demo: https://huggingface.co/spaces/IndexTeam/IndexTTS-2-Demo
Paper: https://arxiv.org/abs/2506.21619|[Github](https://github.com/index-tts/index-tts)  |Free|
223 | | Azure Text to speech| The best and most realistic voice tools currently available| [URL](https://speech.microsoft.com/portal/voicegallery) |Paid / 500,000 characters per month free|
224 | | Hailuo AI Text to Speech | Offer over 300 voices in 17 languages and multiple accents, covering a wide range of styles and age groups to provide the voice effects you need.|[URL](https://www.hailuo.ai/audio)|Limited-time Free|
225 | | coqui-ai/tts | A deep learning toolkit for Text-to-Speech, battle-tested in research and production
Online Demo: https://huggingface.co/spaces/coqui/xtts| [Github](https://github.com/coqui-ai/tts)  | Free|
226 | | elevenlabs | Intelligent AI Text to Speech |[URL](https://elevenlabs.io/)|Free/Paid|
227 | | netease-youdao/EmotiVoice | A Multi-Voice and Prompt-Controlled TTS Engine. EmotiVoice speaks both English and Chinese, and with over 2000 different voices. The most prominent feature is emotional synthesis, allowing you to create speech with a wide range of emotions, including happy, excited, sad, angry and others.|[Github](https://github.com/netease-youdao/EmotiVoice) | Free|
228 | | tetos |A unified interface for multiple Text-to-Speech (TTS) providers. Supported TTS providers: Edge TTS, OpenAI TTS, Azure TTS, Google TTS, Volcengine TTS, Baidu TTS|[Github](https://github.com/frostming/tetos) |Free|
229 | | ChatTTS |ChatTTS is a text-to-speech model designed specifically for dialogue scenario such as LLM assistant. It supports both English and Chinese languages. Our model is trained with 100,000+ hours composed of chinese and english. Website:https://chattts.com/|[Github](https://github.com/2noise/ChatTTS)|Free|
230 |
231 | ### Music Recognition
232 | | Name | Description | Links | Fee |
233 | | ---- | ----------------------------- | --- | --- |
234 | |shazam| Download the shazaom app for music recognition, which is pretty fast |[URL](https://www.shazam.com/)| Free|
235 |
236 | ### Voice Processing
237 | | Name | Description | Links | Fees |
238 | | ---- | ----------------------------- | --- | --- |
239 | |so-vits-svc| SoftVC VITS Singing Voice Conversion.|[GitHub](https://github.com/svc-develop-team/so-vits-svc) |Free|
240 | |vocalremover| Extract vocal and music|[URL](https://vocalremover.org/)|Free|
241 | |lala.ai|Extract vocal, accompaniment and various instruments from any audio and video|[URL](https://www.lalal.ai/)|Free/Paid|
242 |
243 | ### AI generated music or sound effects
244 | | Name | Description | Link | Fees |
245 | | ---- | -------------------------- | --- | --- |
246 | |suno.ai|The AI music creation tool Suno can generate custom songs based on text prompts in mere second|[URL](https://www.suno.ai/)||Free/Paid|
247 | |udio|Create music from simple text prompts by specifying topics, genres, and other descriptors which are then transformed into professional quality tracks.|[URL](https://www.udio.com/)||
248 | |mureka.ai|Text to music|[URL](https://www.mureka.ai/)|Free/Paid|
249 | |elevenlabs/sound-effects|Imagine a sound and bring it to life, or explore a selection of the best sound effects generated by the community.|[URL](https://elevenlabs.io/app/sound-effects)|Free|
250 | |suno-ai/bark|Bark is a transformer-based text-to-audio model created by Suno. Bark can generate highly realistic, multilingual speech as well as other audio - including music, background noise and simple sound effects.|[Github](https://github.com/suno-ai/bark) |Free|
251 | |audiocraft|Open source library for audio/music generation by Meta, which mainly includes two models, MusicGen: text-to-music model, AudioGen: text-generated sound model. [MusicGen Online Demo](https://huggingface.co/spaces/facebook/MusicGen)|[GitHub](https://github.com/facebookresearch/audiocraft) |Free|
252 | |Stable Audio|AI music and sound effect generation application by stability.ai|[URL](https://www.stableaudio.com/)|Free/Paid|
253 | |OptimizerAI|Sound effect generation
[Official Introduction](https://twitter.com/OptimizerAI/status/1779881263358419243)|[URL](https://www.optimizerai.xyz/) |Free/Paid|
254 | |SFX Engine|AI Sound effect generation |[URL](https://sfxengine.com/) |Free/Paid|
255 | |MuseGen|An AI music studio for lyric writing and song generation |[URL](https://musegen.org) |Free/Paid|
256 |
257 | ### Speech translation
258 | | Name | Description | Links | Fees |
259 | | ---- | ----------------------------- | --- | --- |
260 | | Seamless |Seamless is a family of AI models that enable more natural and authentic communication across languages.[Online Demo](https://seamless.metademolab.com/expressive?utm_source=metaai&utm_medium=web&utm_campaign=fair10&utm_content=blog)|[Github](https://github.com/facebookresearch/seamless_communication) |Free|
261 |
262 |
263 | ### Video Content Summary
264 | | Name | Description | Links | Fees |
265 | | ---- | ----------------------------- | --- | --- |
266 | | ChatGPT for YouTube | Chrome plugin, quickly summarize Youtube video content, need to log in chatgpt account or apikey | [URL](https://chatgpt4youtube.com/)| Free |
267 | | Chat Youtube | Give a Youtube link, it will give a summary, and you can ask it questions about the content of the video |[URL](https://chatyoutube.com) | Free |
268 |
269 | ### Academic research
270 | | Name | Description | Links | Fees |
271 | | ---- | ----------------------------- | --- | --- |
272 | | alphaxiv | An open academic discussion community based on the arXiv platform that allows users to comment line-by-line, ask questions, and interact in real-time by replacing the paper's linking domain (arxiv.org for alphaxiv.org) directly on the paper's page. And provides AI features such as Ask AI and AI-generated article blogs | [URL](https://www.alphaxiv.org/)| Free |
273 |
274 |
275 | ### OCR
276 | | Name | Description | Links | Fees |
277 | | ---- | ----------------------------- | --- | --- |
278 | |Umi-OCR|Comes with a highly efficient offline OCR engine. As long as the computer performance is sufficient, it can be faster than online OCR services.|[Github](https://github.com/hiroi-sora/Umi-OCR) |Free|
279 | |allenai/olmocr|A toolkit for training language models to work with PDF documents in the wild. Online demo: https://olmocr.allenai.org/|[Github](https://github.com/allenai/olmocr) |Free|
280 |
281 |
282 | [](https://star-history.com/#ikaijua/Awesome-AITools&Date)
283 |
284 |
--------------------------------------------------------------------------------
/README-CN.md:
--------------------------------------------------------------------------------
1 | [English](README.md) | 中文
2 |
3 | **这个仓库收集整理AI相关的实用工具,欢迎大家一起推荐更多实用的AI工具,[推荐参考模板](https://github.com/ikaijua/Awesome-AITools/issues/232)**
4 |
5 | - [AI新闻动态](https://github.com/ikaijua/Awesome-AITools/discussions?discussions_q=is%3Aopen+label%3A%22AI+news%22)
6 | - [赞赏支持](#赞赏支持)
7 |
8 |
9 | ## 全部分类
10 | - [ChatGPT及类似大语言模型AI助手](#chatgpt及类似大语言模型ai助手)
11 | - [开源大语言模型](#开源大语言模型)
12 | - [大语言模型排行榜](#大语言模型排行榜)
13 | - [GPT/LLMs 应用](#gpt-llms应用)
14 | - [编程开发](#编程开发)
15 | - [AI图像创作](#ai图像创作)
16 | - [AI视频创作](#ai视频创作)
17 | - [AI云平台](#ai云平台)
18 | - [ChatGPT Prompts](#chatgpt-prompts)
19 | - [大语言模型训练-评估平台](#大语言模型训练-评估平台)
20 | - [AI工具箱类软件](#ai工具箱类软件)
21 | - [AI Agent](#ai-agent)
22 | - [AI搜索](#ai搜索)
23 | - [阅读](#阅读)
24 | - [写作](#写作)
25 | - [翻译工具](#翻译工具)
26 | - [语音识别-生成字幕](#语音识别-生成字幕)
27 | - [文字转语音](#文字转语音)
28 | - [音乐识别](#音乐识别)
29 | - [变声软件](#变声软件)
30 | - [声音克隆](#声音克隆)
31 | - [语音翻译](#语音翻译)
32 | - [语音合成](#语音合成)
33 | - [语音处理](#语音处理)
34 | - [AI生成音乐-音效](#ai生成音乐-音效)
35 | - [视频翻译](#视频翻译)
36 | - [学术科研](#学术科研)
37 | - [OCR图像识别文字](#ocr图像识别文字)
38 | - [视频内容总结](#视频内容总结)
39 | - [AI生成模特试装和商品图](#ai生成模特试装和商品图)
40 | - [人形机器人](#人形机器人)
41 |
42 | ## 评测
43 | - [大语言模型评测](#大语言模型评测)
44 |
45 | ## 精选文章
46 | - [chatgpt相关文章](#chatgpt相关文章)
47 |
48 | ### ChatGPT及类似大语言模型AI助手
49 | | 名称 | 说明 | 链接 | 费用 |
50 | | ---- | ----------------------------- | --- | --- |
51 | | Gemini| Google 的对话式AI工具和大语言模型,最新的 Gemini 2.5 pro 和 Gemini 2.5 Flash 模型。最新推出的Gemini 2.5 Flash Image (Nano Banana)在LMArea文本转图像和图像编辑排行榜中位列榜首,人物一致性实现了突破性的进步|1. [gemini](https://gemini.google.com/)
2. [aistudio](https://aistudio.google.com)|免费|
52 | | 通义千问 |阿里的大语言模型
qwen.ai中可体验最新的模型和不同的模型,最强的 Qwen3-Max-Thinking-Preview,有深度研究的选项|[URL](https://www.qianwen.com/)|免费|
53 | | ChatGPT | openAI的chatgpt,最新模型 GPT-5 应用示例: [B站视频:豆包 vs GPT,语音对决!豆包的魅力女友让人难以招架~](https://www.bilibili.com/video/BV1EgymYmEhB/)[B站视频:这9款工具帮你榨干ChatGPT,解锁隐藏玩法](https://www.bilibili.com/video/BV1qs4y1D7ED) [B站视频:格斗之王!AI写出来的AI竟然这么强!](https://www.bilibili.com/video/BV1DT411H7ph)
[可汗学院创始人Khan最新TED演讲:GPT-4作为AI学习私教,可能带来教育史上最大变革](https://www.bilibili.com/video/BV1Xa4y137rR)|[URL](https://chat.openai.com) | 免费/付费|
54 | | 豆包 | 字节跳动旗下的AI聊天软件 ; [豆包chrome插件](https://chromewebstore.google.com/detail/dbjibobgilijgolhjdcbdebjhejelffo)
体验测试视频:
[B站视频:豆包 vs GPT,语音对决!豆包的魅力女友让人难以招架~](https://www.bilibili.com/video/BV1EgymYmEhB/)
[B站视频:百模大战-抖音子公司推出AI聊天机器人豆包](https://www.bilibili.com/video/BV1b84y1o7E4/)|[URL](https://www.doubao.com/)|免费|
55 | | 腾讯元宝/混元 |腾讯元宝提供了DeepSeek R1和腾讯自家的混元模型可使用;腾讯混元 AI Studio 提供了各种 AI 工具,包括 AI 对话助手、文生图、文/图生视频等各种模型和工具|1.[腾讯元宝智能助手](https://hunyuan.tencent.com/bot)
2. [混元 AI Studio](https://hunyuan.tencent.com/)|免费|
56 | | DeepSeek | DeepSeek的AI助手。 [API](https://platform.deepseek.com/api_keys)|[URL](https://chat.deepseek.com/)|免费/付费|
57 | | Claude|Anthropic研发的AI助手Claude。以编程能力强著称。最新模型是 Claude Opus 4 和Claude Sonnet 4|[URL](https://claude.ai/)| 免费/付费|
58 | | 月之暗面的Kimi Chat|支持联网,文章总结能力比较强。[chrome插件:Kimi浏览器助手](https://chromewebstore.google.com/detail/icmdpfpmbfijfllafmfogmdabhijlehn)
[张鹏对谈月之暗面杨植麟:大模型创业需要新的组织范式](https://www.xiaoyuzhoufm.com/episode/659d17352e26fb9934b8dceb)|1. [kimi](https://kimi.moonshot.cn/)
2. [Moonshot AI开放平台](https://platform.moonshot.cn/)|免费|
59 | | Grok | xAI研发的AI助手,结合了x上的文章内容。马斯克的AI公司的产品 |[URL](https://x.com/i/grok)|免费|
60 | | 微软Copilot | 微软的Copilot,包含了多种AI工具和插件 | [URL](https://copilot.microsoft.com/) | 免费 |
61 | | Le Chat| Mistral AI 推出了为 Le Chat 的聊天助手 |[URL](https://chat.mistral.ai/chat)|免费|
62 | | 智谱AI | 最新的GLM-4.6模型 | 1. [URL](https://chat.z.ai/)
2. [API开发者网站](https://open.bigmodel.cn/)| 免费|
63 |
64 | ### 开源大语言模型
65 | | 名称 | 说明 | 链接 | 费用 |
66 | | ---- | ----------------------------- | --- | --- |
67 | | DeepSeek-R1 |DeepSeek 的第一代推理模型 DeepSeek-R1-Zero 和 DeepSeek-R1。DeepSeek-R1-Zero 是一种通过大规模强化学习(RL)训练的模型,没有监督微调(SFT)作为初步步骤,在推理性能表现卓越。|[Github](https://github.com/deepseek-ai/DeepSeek-R1) |免费|
68 | | DeepSeek-V3 |DeepSeek推出的大语言模型,MoE 模型,671B 参数,激活 37B,在 14.8T token 上进行了预训练。|[Github](https://github.com/deepseek-ai/DeepSeek-V3) |免费|
69 | | Llama 3 | Llama3是Meta AI开发的开源的大型语言模型, 它是Llama 语言模型v3版本。
Llama3在线测试地址:[huggingface.co/Meta-Llama-3-70B-Instruct](https://huggingface.co/chat/models/meta-llama/Meta-Llama-3-70B-Instruct)|[GitHub](https://github.com/meta-llama/llama3) | 免费 |
70 | | Mixtral-8x7B |法国人工智能初创公司 Mistral AI开源的一种具有开放权重的稀疏专家混合模型 (SMoE),在大多数基准测试中都优于 Llama 2 70B 和 GPT-3.5
论文地址:https://arxiv.org/pdf/2401.04088.pdf
论文主页:https://mistral.ai/news/mixtral-of-experts/ |[Github](https://github.com/mistralai/mistral-src) |免费|
71 | |grok-1|马斯克的xAI公司开源的大语言模型|[Github](https://github.com/xai-org/grok-1) |免费|
72 | | Qwen(通义千问) |阿里研发的通义千问大模型系列
在线Demo地址:
[Qwen-7B-Chat-Demo](https://modelscope.cn/studios/qwen/Qwen-7B-Chat-Demo/summary)
[Qwen-72B-Chat-Demo](https://modelscope.cn/studios/qwen/Qwen-72B-Chat-Demo/summary)
[Qwen1.5 72B 在线体验](https://huggingface.co/spaces/Qwen/Qwen1.5-72B-Chat)| [Qwen-7B](https://github.com/QwenLM/Qwen-7B) 
[Qwen1.5](https://github.com/QwenLM/Qwen1.5)| 免费 |
73 | | ChatGLM2-6B | 中英双语对话模型 ChatGLM-6B 的第二代版本 | [GitHub](https://github.com/THUDM/ChatGLM2-6B) | 免费|
74 | | Phi-3| Phi-3是微软开发的开放式人工智能模型系列。Phi-3 模型是目前能力最强、最具成本效益的小型语言模型(SLM),在各种语言、推理、编码和数学基准测试中,其性能均优于相同大小和更大的模型。|[Github](https://github.com/microsoft/Phi-3CookBook) |免费|
75 |
76 | ### 大语言模型排行榜
77 | | Name | Description | Links | Fees |
78 | | ---- | ----------------------------- | --- | --- |
79 | |LMSYS Chatbot Arena Leaderboard|LMSYS Chatbot Arena 是一个用于大语言模型评估的众包开放平台。收集了超过 1,000,000 次人类成对比较,用 Bradley-Terry 模型对 LLM 进行排名,并以 Elo 标度显示模型评级。
B站视频:[量子位/1v1单挑90万轮之后,最强大模型是……](https://www.bilibili.com/video/BV1Qs421w7df/) |[URL](https://lmarena.ai/leaderboard) |免费|
80 | |Artificial Analysis|Artificial Analysis 是一个提供 AI 模型和服务商比较及基准测试的资源平台,帮助用户在选择 AI 模型和服务提供商时做出明智决策。平台提供多种流行 AI 模型的比较数据,包括 OpenAI 的 GPT-4、Meta 的 Llama 3 和 Anthropic 的 Claude 系列,涵盖了响应速度、延迟和成本等性能指标。|[URL](https://artificialanalysis.ai/)|免费|
81 | |LiveCodeBench|LiveCodeBench 是一个全面且无污染的 LLM 代码评估基准,它会持续收集新的问题。LiveCodeBench 尤其关注更广泛的代码相关功能,例如自我修复、代码执行和测试输出预测,而不仅仅是代码生成。 |[URL](https://livecodebench.github.io/leaderboard.html)|免费|
82 |
83 | ### GPT-LLMs应用
84 | | 名称 | 说明 | 链接 | 费用 |
85 | | ---- | ----------------------------- | --- | --- |
86 | | Google AI Studio|Google AI Studio 是一个基于 Web 的免费平台,允许开发者使用 Google 的大型语言模型(如 Gemini)进行原型设计和实验。它提供了一个易于使用的界面,你可以快速构建文本生成、代码生成、聊天机器人等应用。[可用的国家和地区](https://ai.google.dev/gemini-api/docs/available-regions#available_regions)
介绍:B站视频:[一枚卓子/Google AI Studio教程|体验Gemini 2.0 flash 模型,和它视频聊天,创造提示词机器人](https://www.bilibili.com/video/BV1ejkgYcEi5/)|[URL](https://aistudio.google.com/)|免费|
87 | |Cherry Studio|Cherry Studio 是一款支持多个大语言模型(LLM)服务商的桌面客户端,兼容 Windows、Mac 和 Linux 系统。支持主流 LLM 云服务:OpenAI、Gemini、Anthropic、硅基流动等;集成了流行 AI Web 服务:Claude、Peplexity、Poe、腾讯元宝、知乎直答等;支持 Ollama、LM Studio 本地模型部署|[Github](https://github.com/CherryHQ/cherry-studio) |免费|
88 | | NotebookLM |NotebookLM是谷歌推出的一款强大的虚拟研究助手,它可以将各种类型的文件,包括文本、视频、音频甚至数据集,转化成生动有趣的播客节目(播客音频目前只支持英语)。除此之外,NotebookLM 还可以生成常见问题解答、学习指南、目录、时间轴和简报等,并支持用户进行自由对话和事实核查。|[URL](https://notebooklm.google.com/)|免费|
89 | | Learn about |谷歌开发的人工智能学习助手。它是一个会话式的学习伙伴,能适应您独特的好奇心和学习目标,帮助您掌握新主题并加深理解。|[URL](https://learning.google.com/experiments/learn-about)|免费|
90 | | Poe | 美版知乎 Quora 构建的AI 产品,有web和客户端。目前的情况是ChatGPT、Sage、Dragonfly、Claude 机器人可以免费、无限制、实时使用。只需要一个邮箱即可注册。可以随时切换AI而对话不中断,并且对话记录是在线保存并且同步到客户端的。chatgpt-4可以每天免费使用一次 视频介绍:[B站视频:神器!与chatGPT类似的新人工智能问答AI:Poe, 美国知乎Quaro最新产品,专业回答](https://www.bilibili.com/video/BV13Y411B7Az)| [URL](https://poe.com/) |免费,有付费升级版|
91 | | bot.360|360构建的AI对话机器人,集合了国内主要的一些大模型比如豆包、kimi、MiniMax、通义千问等|[URL](https://bot.360.com/)|免费|
92 | | HuggingChat|Hugging Face 的开源聊天应用程序 Hugging Chat. [URL](https://huggingface.co/chat/)|[Github](https://github.com/huggingface/chat-ui) |免费|
93 | | monica | AI助手,提供搜索、阅读、写作、翻译、绘画等多种任务的帮助。有独立应用和浏览器插件| [URL](https://monica.im)
[chrome插件](https://chromewebstore.google.com/detail/monica-your-ai-copilot-po/ofpnmcalabcbjgholdjcjblkibolbppb)|免费/付费|
94 | | ollama | 在本地环境中轻松运行和管理大型语言模型,如Llama 、Mistral、Gemma2等|[Github](https://github.com/ollama/ollama)  |免费|
95 | | openai/openai-python | OpenAI API 的官方 Python 库,它是使用[Stainless](https://stainlessapi.com/)根据[OpenAPI 规范]((https://github.com/openai/openai-openapi))生成的 | [Github](https://github.com/openai/openai-python)| 免费,需要使用OpenAPI的[apikey](https://platform.openai.com/account/api-keys) |
96 | |sashabaranov/go-openai|OpenAI API的Go语言非官方的SDK,支持ChatGPT、GPT-3、 GPT-4、DALL·E 2|[Github](https://github.com/sashabaranov/go-openai)|免费|
97 | |langchain|是一个强大的框架,旨在帮助开发人员使用语言模型构建端到端的应用程序。它提供了一套工具、组件和接口,可简化创建由大型语言模型 (LLM) 和聊天模型提供支持的应用程序的过程。LangChain 可以轻松管理与语言模型的交互,将多个组件链接在一起,并集成额外的资源,例如 API 和数据库。|[Github](https://github.com/langchain-ai/langchain) |免费|
98 | |ChatGPT-Next-Web|一键免费部署你的跨平台私人 ChatGPT 应用, 支持 GPT3, GPT4 & Gemini Pro 模型|[Github](https://github.com/ChatGPTNextWeb/ChatGPT-Next-Web) |免费|
99 | |anything-llm|开源的文档聊天机器人解决方案|[Github](https://github.com/Mintplex-Labs/anything-llm) |免费|
100 | | screenshot-to-code | 插入截图并将其转换为简洁的 HTML/Tailwind/JS 代码,使用了GPT-4 Vision来生成代码,使用DALL-E 3生成图片 | [GitHub](https://github.com/abi/screenshot-to-code) | 免费,需要有GPT-4 Vision的授权|
101 | | Chatbox | 使用ChatGPT API(OpenAI API)的桌面应用程序, 将所有的聊天信息和提示信息存储在本地,从而减少了数据丢失的风险。比网页版使用更稳定些| [GitHub](https://github.com/Bin-Huang/chatbox) | 免费,需要使用OpenAPI的[apikey](https://platform.openai.com/account/api-keys)|
102 | |together.ai chat|与 HuggingChat 类似,可选择不同的开源模型,支持 DeepSeek R1、LLaMA、QWen 和 Flux Schnell。每天 60 条免费信息。|[URL](https://chat.together.ai/)|免费/付费|
103 | | ChatGPT for Google |开源项目,浏览器插件,在搜索页面增加chatgpt的内容和对话框|[GitHub](https://github.com/wong2/chatgpt-google-extension) |免费,需要chatgpt账号|
104 | | gpt-crawler | 可以爬取指定网站中的内容,并生成json文件,可以直接上传到GPTs的知识库使用 | [Github](https://github.com/BuilderIO/gpt-crawler)| 免费|
105 | | ChatGPT-Shortcut | 开源,让生产力加倍的 ChatGPT 快捷指令,按照领域和功能分区,可对提示词进行标签筛选、关键词搜索和一键复制。| [GitHub](https://github.com/rockbenben/ChatGPT-Shortcut) |免费|
106 | |ChatGPT Sidebar|ChatGPT 边栏是您在浏览任何网站时可以使用的人工智能助手。 视频介绍:[B站视频:CharGPT初体验,浏览器安装人工智能侧边栏AI Sidebar扩展程序](https://www.bilibili.com/video/BV1Y24y1L7JA)|[URL](https://chrome.google.com/webstore/detail/chatgpt-sidebar-support-g/difoiogjjojoaoomphldepapgpbgkhkb)|免费|
107 | | WebChatGPT |开源程序,给chatgpt扩展联网的能力 视频介绍:[B站视频:可以让ChatGPT直接联网的扩展程序](https://www.bilibili.com/video/BV1bY4y1C7N3) | [GitHub](https://github.com/qunash/chatgpt-advanced) | 免费|
108 | | AIPRM for ChatGPT |浏览器插件,提供一系列精选ChatGPT 指令模板,甚至还能够自己创建,还可以调整AI 语气和写作风格 B站视频:[集大成者!ChatGPT百宝箱,内置多种功能,所见即所得!](https://www.bilibili.com/video/BV1LT411S7GK)| [URL](https://chrome.google.com/webstore/detail/aiprm-for-chatgpt/ojnbohmppadfgpejeebfnmnknjdlckgj) | 免费|
109 | | MindMac | 功能丰富、隐私第一的 macOS 原生 ChatGPT 应用程序,可在一个地方使用 OpenAI, Azure OpenAI, Anthropic Claude, OpenRouter,旨在实现最大生产力。 目前有 15 种语言版本。| [URL](https://mindmac.app/) | 免费,有付费升级版 |
110 | | chathub | 浏览器插件,在一个应用中使用不同的聊天机器人,目前支持 ChatGPT、新的 Bing Chat、Google Bard 和 Claude (via Poe),未来将集成更多机器人, 同时与多个聊天机器人聊天,方便比较它们的答案 | [GitHub](https://github.com/chathub-dev/chathub) |免费,付费支持更多功能|
111 |
112 | ### 编程开发
113 | | 名称 | 说明 | 链接 | 费用 |
114 | | ---- | ----------------------------- | --- | --- |
115 | | Trae | 字节跳动推出的类似Cursor的AI编程IDE|[URL](http://trae.com.cn)|免费|
116 | | Cursor | 使用 GPT进行协作的代码编辑器 | [URL](https://www.cursor.so) | 付费/免费试用 |
117 | | GitHub Copilot | GitHub 和 OpenAI 合作开发的一个代码编写助手 [Github Copilot技巧和窍门](https://bilibili.com/video/BV1ic411T7Jd) [Github Copilot X的Chat功能介绍](https://www.bilibili.com/video/BV1Ho4y137Tu/),[Copilot X申请页面](https://github.com/features/preview/copilot-x)| [URL](https://github.com/features/copilot) | 付费 |
118 | | 通义灵码|阿里云开发的代码编写助手,可根据当前代码文件及跨文件的上下文,为你生成行级/函数级代码、单元测试、代码注释等,支持 Java、Python、Go、JavaScript、TypeScript、C/C++、C# 等主流语言,同时兼容 Visual Studio Code、JetBrains IDEs 等主流编程工具|[URL](https://tongyi.aliyun.com/lingma/)|免费|
119 | | 豆包MarsCode|字节跳动旗下的AI代码助手,提供智能补全、智能预测、智能问答等能力|[URL](https://www.marscode.cn/)|免费|
120 | | CodeGeeX | 智谱AI旗下的代码生成大模型,支持200多种主流编程语言的生成及翻译。开源模型:
[CodeGeeX2](https://github.com/THUDM/CodeGeeX2/) 
[CodeGeex4](https://github.com/THUDM/CodeGeeX4)  [【项目原作解读】清华大学郑勤锴:CodeGeeX大规模多语言代码生成模型](https://www.bilibili.com/video/BV1wT41127Tq/) | [URL](https://codegeex.cn/) |免费|
121 | | Amazon CodeWhisperer | 亚马逊开放的AI编程辅助工具,根据你的注释和现有代码,实时生成从片段到完整功能的代码建议。在各种IDE的插件中可以安装,支持15种语言, 包括 Python, Java, and JavaScript等。只需要按照流程注册一个aws builder账号即可。| [URL](https://aws.amazon.com/cn/codewhisperer)| 免费|
122 | | Fitten Code | Fitten Code是由非十大模型驱动的AI编程助手,可以自动生成代码,提升开发效率,调试Bug。还可以对话聊天,解决您编程碰到的问题。免费且支持80多种语言:Python、C++、Javascript、Typescript、Java等。并提供丰富的IDE支持,包括Visual Studio Code、JetBrains系列IDE等。
“技术胖”B站视频:[清华初创对决微软Github,哪家AI编程助手更强](https://www.bilibili.com/video/BV1MH4y1s7sU/)| [URL](https://code.fittentech.com/) | 免费 |
123 | |腾讯云AI代码助手|腾讯云 AI 代码助手主要提供两类功能:AI 助手对话功能和代码补全功能。|[URL](https://console.cloud.tencent.com/acc)|免费|
124 | |Mistral/Codestral|Mistral.ai的代码生成大语言模型,官方介绍:[Empowering developers and democratising coding with Mistral AI.](https://mistral.ai/news/codestral/), 模型下载:https://huggingface.co/mistralai/Codestral-22B-v0.1|[URL](https://chat.mistral.ai/chat) 模型选择Codestral|免费|
125 |
126 | ### AI图像创作
127 | | 名称 | 说明 | 链接 | 费用 |
128 | | ---- | ----------------------------- | --- | --- |
129 | | Nano Banana/Nano Banana Pro|谷歌用于图像生成与编辑的先进人工智能模型。在LMArea文本转图像和图像编辑排行榜中位列榜首。
[Nano Banana 的用法合集](https://github.com/ikaijua/Awesome-AITools/wiki/Nano-Banana-%E7%9A%84%E7%94%A8%E6%B3%95%E5%90%88%E9%9B%86)
在线网站:
1. [aistudio](https://aistudio.google.com/prompts/new_chat?model=gemini-2.5-flash-image-preview)
2. [gemini](https://gemini.google.com/app)
3. [lmarea.ai](https://lmarena.ai/?mode=direct&chat-modality=image)|[URL](https://aistudio.google.com/prompts/new_chat?model=gemini-2.5-flash-image-preview) |免费/付费|
130 | |Z-Image|Z-Image是阿里巴巴通义实验室(Tongyi Lab)于近期开源的一款高性能图像生成模型。它主打“极速”与“高质量”的平衡,非常适合需要快速出图的场景。Z-Image-Turbo在线demo: https://huggingface.co/spaces/mrfakename/Z-Image-Turbo|[Github](https://github.com/Tongyi-MAI/Z-Image) |免费|
131 | | Midjourney | 输入文字或图片进行图片创作。应用示例:
[尝试用chatGPT+midjourney进行科研绘图,被效果震惊到了。。。](https://www.bilibili.com/video/BV1XM411T7uP) | [URL](https://www.midjourney.com) | 付费 |
132 | | Stable diffusion webui | 开源项目,输入文字或图片进行图片创作, Stable diffusion webui是Stable diffusion的GUI是将stable diffusion实现可视化的图像用户操作界面,它本身还集成了很多其它有用的扩展脚本。
新手入门教程:https://www.bilibili.com/video/BV1Qo4y167AK/ AI风格化视频或AI真人视频的效果:1. [【AI动画】欣小萌天台蹦迪 动画版](https://www.bilibili.com/video/BV1RL411U7wR),2. [死磕真人AI动作,人物和背景的终于不闪了,你们觉得哪个更好点?](https://www.bilibili.com/video/BV1Fs4y1V7f7)3. [5分钟,教会你如何生成AI动画](https://www.bilibili.com/video/BV13s4y1D7Ni)| [GitHub](https://github.com/AUTOMATIC1111/stable-diffusion-webui) | 免费|
133 | | 即梦AI|字节跳动旗下的文生图、AI视频生成和AI图片编辑应用|[URL](https://jimeng.jianying.com/ai-tool/home)|免费/付费|
134 | | Photoshop 生成式AI功能| 在Adobe Photoshop中使用生成式AI填充功能。功能介绍: 1. [B站视频:Photoshop 革命性新功能-生成式填充功能介绍](https://www.bilibili.com/video/BV1su411Y79Z/)
2. [巫师后期B站视频:引爆点——Photoshop核弹级更新(创成式AI填充)彻底改变图片行业!](https://www.bilibili.com/video/BV1qo4y1E7tK)| [URL](https://www.adobe.com/products/photoshop/generative-fill.html) |Photoshop 订阅会员可下载Beta版本试用|
135 | | firefly |Adobe 的AI图片处理网站|[URL](https://firefly.adobe.com/)|免费/付费|
136 | | clipdrop | stability.ai 公司旗下的图像处理网站,包含文生图、AI扩图、图生图、去除背景等功能 | [URL](https://clipdrop.co/)| 免费/ 付费|
137 | | civitai | Civitai(C站)是一个用于分享AI图像创作模型资源的网站平台,拥有大量模型,已成为SD开源社区主要的模型交流场所 |[URL](https://civitai.com/)|免费|
138 | | 文心一格 | 百度旗下的文生图和AI图片编辑应用| [URL](https://yige.baidu.com/)| 免费/付费 |
139 | | 通义万相 | 阿里旗下的文生图和AI图片创作应用| [URL](https://wanxiang.aliyun.com/) | 免费 |
140 | | 美图的奇想智能MiracleVision|美图的文生图应用|[URL](https://www.miraclevision.com/text-to-image/)|免费|
141 | | ideogram.ai | AI 文字生成图片的网站。前谷歌AI绘画4位大牛创立的公司推出的产品 | [URL](https://ideogram.ai/) | 免费 |
142 | | Skybox AI | 输入文字生成360度全景图片 | [URL](https://skybox.blockadelabs.com/)| 免费/ 付费|
143 | | Nero AI | AI图片放大、修复划痕、AI图片上色、AI图片去噪、AI一键抠图换背景、AI神奇擦除笔、AI写真;[介绍](https://github.com/ikaijua/Awesome-AITools/issues/100)。API文档:https://ai.nero.com/ai-api/docs/|[URL]( https://ai.nero.com/)|付费/试用|
144 | | remove.bg |一键删除图片背景|[URL](https://www.remove.bg/)|免费/付费|
145 | |ControlNet|能够在一个text2image上训练的扩散模型进行高效finetune,并且结合特定的condition输入,得到可控的效果|[Github](https://github.com/lllyasviel/ControlNet) |免费|
146 | |black-forest-labs/flux|FLUX.1 模型的官方推理资源库|[Github](https://github.com/black-forest-labs/flux) |免费|
147 | |GeminiImageApp|一个现代化的全栈 AI 图像处理平台,集成了 Google Gemini、OpenCV 和 YOLO 等先进技术,提供图像问答、生成、编辑、目标检测、图像分割和视频生成等功能。|[Github](https://github.com/0xsline/GeminiImageApp)|免费|
148 |
149 | ### AI视频创作
150 | | 名称 | 说明 | 链接 | 费用 |
151 | | ---- | ----------------------------- | --- | --- |
152 | | 通义万相 | 阿里旗下AI图片和视频创作应用| [URL](https://tongyi.aliyun.com/wanxiang/videoCreation) | 免费/付费 |
153 | | 海螺AI| Minimax的AI视频生成平台|[URL](https://hailuoai.com/video)|免费/付费|
154 | | 快手可灵|支持文生视频和图生视频|[URL](https://kling.kuaishou.com/)|免费/付费|
155 | | 即梦AI|字节跳动旗下的文生图、AI视频生成和AI图片编辑应用|[URL](https://jimeng.jianying.com/ai-tool/home)|免费/付费|
156 | | 剪映 |字幕生成语音、语音生成字幕、字幕翻译、一键图文成片,还有很便捷、强大的视频剪辑功能
识别字幕是vip功能|[URL](https://www.capcut.cn/)|免费/付费|
157 | | PixVerse | 利用文本和照片创建令人惊叹的人工智能视频 |[URL](https://app.pixverse.ai/)|付费/试用|
158 | | 腾讯混元AI视频|文生视频、图生视频功能;对口型和动作驱动功能:可以通过上传照片和音频或选择动作模版生成视频; 需要排队|[URL](https://video.hunyuan.tencent.com/)|免费|
159 | | Sora | OpenAI的文本生成视频的模型。Sora技术报告:https://github.com/ikaijua/Awesome-AITools/discussions/54| [URL](https://sora.com) | 付费 |
160 | | Dream Machine|由 Luma AI 提供。Dream Machine 是一个人工智能模型,能根据文本和图像快速制作出高质量、逼真的视频。[官方介绍视频](https://www.youtube.com/watch?v=Zb3tffmBPRE)|[URL](https://lumalabs.ai/dream-machine)|免费/付费|
161 | | Runway | Gen-2: 文本/图像 AI生成视频
Gen-1: 根据视频AI生成视频
应用示例:
[B站视频:数字生命卡兹克/我用AI做了一部《流浪地球3》的预告片](https://www.bilibili.com/video/BV1hF411f7rg)
精选视频:https://runwayml.com/staff-picks | [URL](https://runwayml.com/) | 免费试用/付费|
162 | | MOKI |美图的AI短片创作工具|[URL](www.moki.cn)| 免费试用/付费|
163 | | Pika | 文本/图像 AI生成视频| [URL](https://pika.art/home)| 免费试用/付费|
164 | | krea.ai| 提供文生图/视频、图片放大、模型训练等功能,Krea ai想做视频和图片界的 POE,目前集成了海螺、luma、Runway和可灵四家最好的视频生成模型。|[URL](https://www.krea.ai/)|免费试用/付费|
165 | | Fliki | 將文字生成音频和视频的网站 | [URL](https://fliki.ai) | 免费试用/付费 |
166 | | d-id | 根据文字生成数字人的配音视频 | [URL](https://studio.d-id.com) | 免费试用/付费 |
167 | | HeyGen | 根据文字生成数字人的配音视频 | [URL](https://app.heygen.com/) | 免费试用/付费 |
168 | | AnimateDiff | Animatediff是香港中文大学团队开源的AI视频生成方法,基于Stable DIffusion的开源基建,8月份开源模型之后,一个月就把AI视频生成的质量提高了几个等级。
介绍文章:[这款工具让你一秒成AI版宫崎骏,AI视频“ChatGPT时刻”快到了](https://mp.weixin.qq.com/s/NgYv6VBSBRIBOFuyUnMnxA)| [Github](https://github.com/guoyww/AnimateDiff) |免费|
169 | |vivago.ai/video| 文本/图像生成视频; 4K视频增强|[URL](https://vivago.ai/video)| 免费|
170 |
171 | ### AI云平台
172 | | 名称 | 说明 | 链接 |费用|
173 | | ---- | ----------------------------- | --- | --- |
174 | | Together AI |Together AI是一个专为生成式AI设计的云平台,提供了从模型推理、微调到GPU集群部署等多种服务。相比其他传统云平台,Together AI 主要聚焦于高效处理开源生成式模型,并为开发者和企业提供更灵活、定制化的解决方案。Together AI 支持多个开源模型,包括 LLaMA、Falcon、FLUX1 等。这些模型覆盖了从自然语言处理、对话系统到代码生成等多个领域,满足了不同场景下的应用需求。用户可以直接调用这些模型,也可以上传自己的数据进行微调,提升模型在特定任务中的表现。 文章介绍:
[Together AI是一个生成式AI服务平台](https://mp.weixin.qq.com/s/qyFPqlotBayTDHaZSmSogw) |[URL](https://www.together.ai/)|免费/付费|
175 |
176 | ### ChatGPT Prompts
177 | | 名称 | 说明 | 链接 |费用|
178 | | ---- | ----------------------------- | --- | --- |
179 | |f/awesome-chatgpt-prompts|This repo includes ChatGPT prompt curation to use ChatGPT better.|[Github](https://github.com/f/awesome-chatgpt-prompts)  |Free|
180 |
181 | ### 大语言模型训练-评估平台
182 | | Name | Description | Links | Fees |
183 | | ---- | ----------------------------- | --- | --- |
184 | | FastChat | 用于训练、服务和评估大型语言模型的开放平台。Vicuna 和 Chatbot Arena 的发布仓库。| [Github](https://github.com/lm-sys/FastChat) | Free |
185 |
186 | ### AI工具箱类软件
187 | | 名称 | 说明 | 链接 | 费用 |
188 | | ---- | ----------------------------- | --- | --- |
189 | |Paper2GUI|一款面向普通人的 AI 桌面 APP 工具箱,免安装即开即用,已支持 40+AI 模型,内容涵盖 AI 绘画、语音合成、视频补帧、视频超分、目标检测、图片风格化、OCR 识别等领域。支持 Windows、Mac、Linux 系统。[B站视频介绍:补帧超分抠图配音,这个开源AI工具箱对小白太友好了!](https://www.bilibili.com/video/BV1jY411u7yU/)|[GitHub](https://github.com/Baiyuetribe/paper2gui) |免费|
190 |
191 | ### AI Agent
192 | | 名称 | 说明 | 链接 | 费用 |
193 | | ---- | ----------------------------- | --- | --- |
194 | |Gemini CLI|一个开源的基于Gemini的命令行终端AI智能体|[Github](https://github.com/google-gemini/gemini-cli/)|免费|
195 | |agentscope|面向Agent的编程:构建大型语言模型应用程序。阿里开源|[Github](https://github.com/agentscope-ai/agentscope)|免费|
196 | |Auto-GPT|开源项目,使用gpt自主地实现你设定的任何目标。演示示例:[爆火的自主人工智能AutoGPT,程序员表示开始真正有点担忧会失业了!](https://www.bilibili.com/video/BV1Ph4y1W7Yj)|[GitHub](https://github.com/Torantulino/Auto-GPT) |免费,需要OpenAI API key|
197 | |OthersideAI/self-operating-computer|一个使用多模态模型(默认模型为GPT-4v)能够操作计算机的框架|[Github](https://github.com/OthersideAI/self-operating-computer) |免费,需要GPT-4v|
198 | |AppAgent|可以操作手机应用程序的AI Agent|[Github](https://github.com/mnotgod96/AppAgent) |免费|
199 | |microsoft/autogen|AutoGen 是一个开源编程框架,用于构建人工智能Agent,并促进多个Agent之间的合作,以解决任务。 |[Github](https://github.com/microsoft/autogen) |免费|
200 | |Taskade AI| 在统一的工作空间内构建、训练和部署自主AI代理,用于任务管理、团队协作和工作流自动化。通过结构化列表、笔记和思维导图提升团队生产力。 | [URL](https://www.taskade.com/) | 每日免费AI额度 / 支持付费升级 |
201 |
202 | ### ai搜索
203 | | 名称 | 说明 | 链接 | 费用 |
204 | | --- | --- | --- | --- |
205 | | 秘塔搜索 | 搜索网络信息并提供汇总信息,并附有参考链接,还创建话题知识库|[URL](https://metaso.cn/)|免费|
206 | | 知乎直答 |知乎的AI搜索,有通用搜索和专业搜索;介绍:B站视频[朋克周/专业报告和学术期刊为你所用,AI搜索迎来新选择](https://www.bilibili.com/video/BV1U6SXYFECC/)|[URL](https://zhida.zhihu.com/)|免费|
207 | | IMA |IMA是腾讯推出的一款AI智能工作台,它集成了搜索、阅读、写作、知识库管理等多种功能。目前只有Mac和Windows客户端。搜索相比其他搜索能覆盖微信公众号文章,支持知识库管理比如上传本地文件、公众号文章或网页链接,构建个人知识库。支持写作但目前不支持文件夹的功能。|[URL](https://ima.qq.com/) |免费|
208 | | You.com | 结合对话模式的搜索引擎 | [URL](https://you.com) | 免费 |
209 | | Perplexity.ai | Perplexity.ai 是一个基于 GPT-3 的 AI 工具,类似 New Bing 的搜寻引擎、会附上参考结果 | [URL](https://www.perplexity.ai) | 免费|
210 | | MindSearch |中科大和上海人工智能实验室联合研发国产开源搜索引擎MindSearch(思・索),采用分层检索策略,先广泛搜索再精确选择,有效管理互联网上的海量信息。[在线Demo](https://mindsearch.openxlab.org.cn/)|[Github](https://github.com/InternLM/mindsearch) |免费|
211 |
212 |
213 | ### 阅读
214 | | 名称 | 说明 | 链接 | 费用 |
215 | | --- | --- | --- | --- |
216 | | 微信读书 | “AI问书”功能,在阅读时遇到不理解的内容,可以通过AI问书功能获得即时解释。AI问书的回答通常包含注释和相关书籍推荐,并且可以通过点击回答中的链接跳转到相关书籍的特定选段,增加回答的可信度[更多介绍](https://github.com/ikaijua/Awesome-AITools/discussions/77#discussioncomment-9559619) | [URL](https://weread.qq.com/) | 免费/付费 |
217 |
218 | ### 写作
219 | | 名称 | 说明 | 链接 | 费用 |
220 | | ---- | ----------------------------- | --- | --- |
221 | | Notion AI | AI辅助的笔记软件,主要包括AI创作文章、翻译、修正语法、摘要和总结等 视频示例:[B站视频:Notion AI完整介绍 \| 十个节省时间的神功能(ChatGPT般强大)](https://www.bilibili.com/video/BV1Lg411b7Cx) | [URL](https://www.notion.so)| 有一定免费的AI试用次数,AI功能10$/每月 |
222 | | verse | 印象笔记推出的AI写作工具 |[URL](https://verse.app.yinxiang.com/product)|免费|
223 | | 写作猫 | 集AI写作、多人协作、文本校对、改写润色、自动配图等功能为一体AI Native内容创作平台| [URL](https://xiezuocat.com/)| 免费|
224 | | Deep L Write | 英文、德文写作工具,可以及時修正写作錯誤、改写句子。 | [URL](https://www.deepl.com/write) | 免費版本使用有文字字数限制/有付费升级版 |
225 | | grammarly | 纠正语法、拼写、标点符号等错误的写作助手| [URL](https://app.grammarly.com/) | 免费/有付费升级版|
226 | | 火山写作 | 写作润色、翻译 | [URL](https://www.writingo.net/document) |免费|
227 | | TextCraft | Microsoft Word 的加载项,无缝集成了包括文本生成、校对等在内的核心 AI 工具,直接嵌入用户界面。| [URL](https://github.com/suncloudsmoon/TextCraft) | 免费 |
228 |
229 |
230 | ### 翻译工具
231 | | 名称 | 说明 | 链接 | 费用 |
232 | | ---- | ----------------------------- | --- | --- |
233 | | Google 翻译|支持不同的格式,包括文本、图片、文档和网址|[URL](https://translate.google.com/)|免费|
234 | | Deep L | 准确即时的翻译工具,目前支持 31 种语言 | [URL](https://www.deepl.com/translator) | 免费/付费|
235 | | immersive-translate | 开源的,沉浸式双语网页翻译扩展 | [GitHub](https://github.com/immersive-translate/immersive-translate/)  | 免费 |
236 | | openai-translator | 基于 ChatGPT API 的划词翻译浏览器插件和跨平台桌面端应用 | [GitHub](https://github.com/yetone/openai-translator) | 免费,需要OpenAI API key |
237 | |RTranslator | RTranslator 是一款开源、免费的安卓离线实时翻译应用程序。|[Github](https://github.com/niedev/RTranslator) |免费|
238 |
239 | ### 语音识别-生成字幕
240 | | 名称 | 说明 | 链接 | 费用 |
241 | | ---- | ----------------------------- | --- | --- |
242 | | whisper | 开源,OpenAPI 开源的通过大规模的弱监督进行鲁棒性的语音识别的模型 | [GitHub](https://github.com/openai/whisper)  | 免费 |
243 | | whisper.cpp | OpenAI 的 Whisper 模型在 C/C++ 中的实现|[Github](https://github.com/ggml-org/whisper.cpp) |Free|
244 | | VideoCaptioner |基于 LLM 的智能字幕助手,无需GPU一键高质量字幕视频合成!支持生成、断句、优化、翻译全流程。让视频字幕制作简单高效!
视频介绍:[痕继痕迹/开源免费!一键生成字幕并翻译,中日英多语言支持!- 字幕生成、断句、优化、翻译全流程处理](https://www.bilibili.com/video/BV1giBqYtEqG/)| [Github](https://github.com/WEIFENG2333/VideoCaptioner) |免费|
245 | | buzz | 开源,基于OpenAI的Whisper识别语音并生成字幕的开源桌面软件,使用CPU进行处理 | [GitHub](https://github.com/chidiwilliams/buzz) | 免费 |
246 | | WhisperDesktop| 开源,基于OpenAI的Whisper,Windows系统的桌面应用,使用GPU进行处理,GPU性能好的话会比CPU上更快。使用介绍:https://www.appinn.com/const-me-whisper/|[GitHub](https://github.com/Const-me/Whisper) |免费|
247 | | whisperX | 开源,一位来自牛津大学的博士生Max Bain开源的模型,WhisperX可以按照单词对齐时间戳,**基本上生成的字幕都是完整的句子**。生成结果除了srt还有json文件,里面有每一行里面单词的时间戳,可以根据需要二次整理字幕。还能识别发言人,准确率还可以。使用示例: 1. **在google colab上使用whisperX生成youtube视频字幕的代码**:[whisperx_youtube_subtitle](https://github.com/JimLiu/whisper-subtitles/blob/main/whisperx_youtube_subtitle.ipynb),可以免费使用colab的GPU,使用GPU T4,2小时40分钟的视频字幕生成6分钟左右,挺快的。| [whisperX](https://github.com/m-bain/whisperX)  |免费|
248 | | 飞书秒记 | 上传视频或者音频可转录为文字,并可一键导出到飞书文档。处理速度很快,一个将近 2 个多小时的视频,约 6 分钟完成。 | [URL](https://www.feishu.cn/product/minutes)| 免费,有企业付费版|
249 | | 通义听悟 | 阿里旗下的语音转录应用 | [URL](https://tingwu.aliyun.com/) | 免费/付费 |
250 | | whisper-web | 在浏览器中运行ML驱动的语音识别! 使用[Transformers.js](https://github.com/xenova/transformers.js)构建。[Demo链接](https://huggingface.co/spaces/Xenova/whisper-web) | [GitHub](https://github.com/xenova/whisper-web) |免费|
251 | |阿里云智能语音交互-语音识别API|试用版3个月免费试用期,录音文件识别免费额度:2小时/日|[URL](https://ai.aliyun.com/nls)|付费/免费试用|
252 |
253 | ### 文字转语音
254 | | 名称 | 说明 | 链接 | 费用 |
255 | | ---- | ----------------------------- | --- | --- |
256 | | index-tts2 |B站开源的一个工业级可控且高效的零样本文本到语音系统。在线 Demo: https://huggingface.co/spaces/IndexTeam/IndexTTS-2-Demo
论文: https://arxiv.org/abs/2506.21619|[Github](https://github.com/index-tts/index-tts)  |免费|
257 | | 微软Azure 文本转语音| 目前最好用最真实的语音工具,包括自媒体配音最常见的云希和晓晓的声音;
效果演示:[痕继痕迹:啊?这是AI合成的?- 盘点那些超逼真的AI语音!](https://www.bilibili.com/video/BV1DC411G7Av/)教程:[免费使用微软的Azure;Azure使用详细教程](https://www.youtube.com/watch?v=YzNfMY_oqhA);| [URL](https://speech.microsoft.com/portal/voicegallery) |付费/每个月有50万字符的免费额度|
258 | | 海螺 AI 语音转文字 | 提供 17 种语言、多种口音的 300 多种声音,涵盖多种风格和年龄段|[URL](https://www.hailuo.ai/audio)|限时免费|
259 | | FireRedTTS‑2 |FireRedTTS‑2 是一种用于多说话人对话生成的长格式流式 TTS 系统,可提供稳定、自然的语音,具有可靠的说话人切换和上下文感知的韵律。小红书开源的。|[URL](https://github.com/FireRedTeam/FireRedTTS2)|免费|
260 | | 剪映 |文本朗读有很多的音色选择|[URL](https://www.capcut.cn/)|免费/vip|
261 | | TTS-Online | 提供超过160种声音选项 美真人配音选择,包含主流的小帅 小美 微软的一些语音,如果你是二次元游戏迷之类网站还提供超过1000+的动漫游戏角色的声音。网站可以提供api。分享者:[issue](https://github.com/ikaijua/Awesome-AITools/issues/31) | [URL](https://www.ttson.cn/)|免费 |
262 | | 火山引擎TTS| 火山引擎的语音合成| [URL](https://www.volcengine.com/product/tts)|付费|
263 | | 配音神器 | 有网页端、windows客户端工具,使用比较方便 |[URL](https://peiyinshenqi.club/)|付费/非 VIP 每天可试用 5 次|
264 | | coqui-ai/tts | 用于文本到语音的深度学习工具包
在线体验Demo网页:https://huggingface.co/spaces/coqui/xtts| [Github](https://github.com/coqui-ai/tts)  | 免费|
265 | | elevenlabs | 文字转语音的服务,提供多种语言 |[URL](https://elevenlabs.io/)|免费/付费|
266 | | netease-youdao/EmotiVoice | EmotiVoice是一个强大的开源TTS引擎,支持中英文双语,包含2000多种不同的音色,以及特色的情感合成功能,支持合成包含快乐、兴奋、悲伤、愤怒等广泛情感的语音。|[Github](https://github.com/netease-youdao/EmotiVoice) | Free|
267 | | tetos |适用于多个文本转语音 (TTS) 提供程序的统一接口,支持Edge TTS、OpenAI TTS、Azure TTS、Google TTS、火山引擎TTS、百度TTS|[Github](https://github.com/frostming/tetos) |免费|
268 | | ChatTTS |ChatTTS是专门为对话场景设计的文本转语音模型,例如LLM助手对话任务。它支持英文和中文两种语言。最大的模型使用了10万小时以上的中英文数据进行训练。官网:https://chattts.com/|[Github](https://github.com/2noise/ChatTTS)|免费|
269 | |FunAudioLLM/CosyVoice|阿里开源的TTS模型|[Github](https://github.com/FunAudioLLM/CosyVoice) |免费|
270 | |fish-speech|输入 10 到 30 秒的声音样本即可生成高质量的 TTS 输出|[Github](https://github.com/fishaudio/fish-speech) |免费|
271 |
272 | ### 音乐识别
273 | | 名称 | 说明 | 链接 | 费用 |
274 | | ---- | ----------------------------- | --- | --- |
275 | |shazam|下载shazaom app可以进行音乐识别,识别速度挺快的|[URL](https://www.shazam.com/)|免费|
276 |
277 | ### 变声软件
278 | | 名称 | 说明 | 链接 | 费用 |
279 | | ---- | ----------------------------- | --- | --- |
280 | |大饼 AI 变声|提供实时的 AI 变声功能|[URL](https://dubbing.tech/)|免费/付费|
281 |
282 | ### 声音克隆
283 | | 名称 | 说明 | 链接 | 费用 |
284 | | ---- | ----------------------------- | --- | --- |
285 | | 剪映 |目前只有APP端有声音克隆的功能,朗读一小段文字就能完成音色的克隆,音色效果很牛。当你添加文本时,在“文本朗读”那个功能中,点击“我的”tab,就能看到这个功能了|[URL](https://www.capcut.cn/)|限免|
286 | | 豆包 |字节跳动的AI聊天应用,豆包app中声音设置可以选择“创建我的声音”,回答问题的时候就可以用克隆的声音来回答了|[URL](https://www.doubao.com/)|免费|
287 |
288 | ### 语音翻译
289 | | 名称 | 说明 | 链接 | 费用 |
290 | | ---- | ----------------------------- | --- | --- |
291 | | Seamless |可以实时翻译100多种语言,延迟不到2秒钟,说话者仍在讲话时就开始翻译。Seamless翻译不仅仅是文字上的转换,还能保持说话者的情感和语气、语调等,使得翻译后的语音更加自然和真实。Seamless模型统一了SeamlessExpressive、SeamlessStreaming和SeamlessM4T v2的功能。旨在实现多语言、表达性和流畅的语音翻译。在线体验[Demo地址](https://seamless.metademolab.com/expressive?utm_source=metaai&utm_medium=web&utm_campaign=fair10&utm_content=blog)|[Github](https://github.com/facebookresearch/seamless_communication) |Free|
292 |
293 | ### 语音合成
294 | | 名称 | 说明 | 链接 | 费用 |
295 | | ---- | ----------------------------- | --- | --- |
296 | |so-vits-svc| So-vits-svc(也称Sovits)是基于VITS、soft-vc、VISinger2等一系列项目开发的一款开源免费 AI 语音转换软件,用户只需准备几十分钟到几个小时不等的语音或歌声数据,就能制作属于自己的 AI 声库,将一段语音或歌声转换为你想要的音色。[更多介绍](https://zh.moegirl.org.cn/zh-hans/So-vits-svc) [B站视频:手把手教学!如何自己训练一个AI歌手 - sovits本地&云端训练教程](https://www.bilibili.com/video/BV1ea4y1G7gx)|[GitHub](https://github.com/svc-develop-team/so-vits-svc) |免费|
297 | |open-mmlab/Amphion|开源音频、音乐和语音生成工具包, 在线使用:https://huggingface.co/amphion
文章介绍:机器之心:[霉霉演唱《稻香》,国内团队的Amphion音频生成火了](https://mp.weixin.qq.com/s/2oR7tu-ltnXnZqNCi-unlA)| [Github](https://github.com/open-mmlab/Amphion) |免费|
298 |
299 | ### 语音处理
300 | | 名称 | 说明 | 链接 | 费用 |
301 | | ---- | ----------------------------- | --- | --- |
302 | |vocalremover|分离人声和伴奏|[URL](https://vocalremover.org/)|有免费的试用额度/付费|
303 | |lala.ai|从任何音频和视频中提取人声、伴奏和各种乐器|[URL](https://www.lalal.ai/)|有免费的试用额度/付费|
304 |
305 | ### AI生成音乐-音效
306 | | 名称 | 说明 | 链接 | 费用 |
307 | | ---- | ----------------------------- | --- | --- |
308 | |海绵音乐|字节跳动推出的AI音乐创作网站,输入提示词和风格来创作音乐|[URL](https://www.haimian.com/)|免费|
309 | |suno.ai|使用AI通过文本来创作音乐 [suno专题页面](https://github.com/ikaijua/Awesome-AITools/discussions/63)
应用示例:
韩雪:[【AI音乐家】我在古镇用AI写歌!](https://www.bilibili.com/video/BV13a4y1m7A5/)
|[URL](https://www.suno.ai/)|免费/付费|
310 | |udio|使用AI通过文本来创作音乐|[URL](https://www.udio.com/)|免费/付费|
311 | |mureka.ai|昆仑万维的AI生成音乐应用|[URL](https://www.mureka.ai/)|Free/Paid|
312 | |elevenlabs/sound-effects|elevenlabs 提供的通过文本生成音效的工具|[URL](https://elevenlabs.io/app/sound-effects)|免费|
313 | |suno-ai/bark|文本转音频模型|[Github](https://github.com/suno-ai/bark) |免费|
314 | |audiocraft|Meta开源的一个用于音频/音乐生成的开源库,其中主要包括两个模型,MusicGen:文本到音乐模型,AudioGen:文本生成声音模型。[MusicGen在线Demo](https://huggingface.co/spaces/facebook/MusicGen)|[GitHub](https://github.com/facebookresearch/audiocraft)
|免费|
315 | |Stable Audio|stability.ai旗下的AI音乐、音效生成应用|[URL](https://www.stableaudio.com/)|免费/付费|
316 | |OptimizerAI|音效生成|[URL](https://www.optimizerai.xyz/) [官方推文介绍](https://twitter.com/OptimizerAI/status/1779881263358419243)|免费/付费|
317 |
318 | ### 视频翻译
319 | | 名称 | 说明 | 链接 | 费用 |
320 | | ---- | ----------------------------- | --- | --- |
321 | |easyvideotrans|着眼于从原始视频到翻译后最终视频的整个工作流程,[在线网站](https://easyvideotrans.com/)|[Github](https://github.com/sutro-planet/easyvideotrans) |免费|
322 | |VideoLingo|VideoLingo 是一站式视频翻译本地化配音工具,能够一键生成 Netflix 级别的高质量字幕,告别生硬机翻,告别多行字幕,还能加上高质量的克隆配音。|[Github](https://github.com/Huanshere/VideoLingo) |免费|
323 |
324 | ### 学术科研
325 | | 名称 | 说明 | 链接 | 费用 |
326 | | ---- | ----------------------------- | --- | --- |
327 | |gpt_academic|为GPT/GLM提供图形交互界面,特别优化论文阅读润色体验,模块化设计支持自定义快捷按钮&函数插件,支持代码块表格显示,Tex公式双显示,新增Python和C++项目剖析&自译解功能,PDF/LaTex论文翻译&总结功能,支持并行问询多种LLM模型,支持清华chatglm等本地模型。兼容llama,rwkv,盘古大模型等。|[GitHub](https://github.com/binary-husky/gpt_academic) |免费|
328 | |alphaxiv|一个基于arXiv平台的开放学术讨论社区,允许用户通过替换论文链接域名(arxiv.org替换为alphaxiv.org)直接在论文页面上进行逐行评论、提问和实时互动。并提供了 Ask AI 和 AI 生成文章博客等 AI 功能|[URL](https://www.alphaxiv.org/)|免费|
329 |
330 | ### OCR图像识别文字
331 | | 名称 | 说明 | 链接 | 费用 |
332 | | ---- | ----------------------------- | --- | --- |
333 | |微信|微信对话框中的图片有提取文字的选项,识别效果很好,使用了几次基本没有什么识别错误。
[2021-03月份 微信AI对OCR功能的介绍:三年磨一剑——微信OCR图片文字提取](https://mp.weixin.qq.com/s/8Odh9TKKoxIYDpr1h-5Y5Q)||免费|
334 | |Umi-OCR|开源、免费的离线OCR软件。支持截屏/粘贴/批量导入图片,段落排版/排除水印,扫描/生成二维码。内置多国语言库。|[Github](https://github.com/hiroi-sora/Umi-OCR) |免费|
335 | |allenai/olmocr|一个用于训练语言模型以处理实际PDF文档的工具包。Demo网址: https://olmocr.allenai.org/|[Github](https://github.com/allenai/olmocr) |免费|
336 |
337 | ### 视频内容总结
338 | | 名称 | 说明 | 链接 | 费用 |
339 | | ---- | ----------------------------- | --- | --- |
340 | | ChatGPT for YouTube | Chrome 插件,快速总结 Youtube 视频內容,需要登录chatgpt账号或者apikey | [URL](https://chatgpt4youtube.com/)| 免费 |
341 | | Chat Youtube | 给一个Youtube 链接,它能给出总结,还可以向它提视频內容相关的问题 |[URL](https://chatyoutube.com) | 免费 |
342 | | BibiGPT | 开源项目,音视频内容 AI 一键总结:哔哩哔哩、YouTube、网页、播客、会议、本地文件等| [GitHub](https://github.com/JimmyLv/BibiGPT) |免费|
343 |
344 | ### AI生成模特试装和商品图
345 | | 名称 | 说明 | 链接 | 费用 |
346 | | ---- | ----------------------------- | --- | --- |
347 | |淘宝的万相营造|AI生成图,包括商品图、服饰图、智能试衣、家居图|[URL](https://agi.taobao.com/image/goods)|免费|
348 | |PhotoStudio|虹软PhotoStudio AI智能商拍为商家设置了极为简单便捷的使用流程:上传衣服图/人台图/真人图,选择模特库中的模特和场景,只需3步即可瀑布式产出服装模特商拍大片。|[URL](www.psai.cn)|付费/试用|
349 |
350 | ### 人形机器人
351 | | 名称 | 说明 | 链接 | 费用 |
352 | | ---- | ----------------------------- | --- | --- |
353 | |Figure 03|获得了微软、OpenAI、英伟达和亚马逊等投资方的投资|[URL](https://www.figure.ai/)|
354 | |Altlas|波士顿动力新的电动人形机器人|[URL](https://bostondynamics.com/atlas/)|
355 | |Optimus Gen 2|特斯拉的人形机器人|[URL](https://www.youtube.com/watch?v=cpraXaw7dyc)|
356 | |Apollo|Apptronik公司的人形机器人|[URL](https://apptronik.com/apollo)|
357 | |GR-1|傅利叶公司的人形机器人|[URL](https://fourierintelligence.com/gr1/)|
358 | |Digit|Agility公司的人形机器人|[URL](https://agilityrobotics.com/products/digit)|
359 | |NEO|1x公司的人形机器人
[Neo Gamma家务机器人视频](https://www.bilibili.com/video/BV1a3PMeGE4s/)|[URL](https://www.1x.tech/androids/neo)|
360 | |H1|宇树科技的人形机器人|[URL](https://www.unitree.com/h1/)|
361 | |Phoenix|sanctuary.ai公司的人形机器人|[URL](https://sanctuary.ai/resources/news/sanctuary-ai-unveils-phoenix-a-humanoid-general-purpose-robot-designed-for-work/)|
362 | |MenteeBot|以色列人形机器人公司 Meetee Robotics 发布的首款双足人形机器人|[URL](https://www.menteebot.com/)|
363 |
364 | ## 精选文章
365 | ### chatgpt相关文章
366 | - [Sparks of Artificial General Intelligence:
367 | Early experiments with GPT-4](https://arxiv.org/pdf/2303.12712v1.pdf): 该论文是一篇长达154页的对 GPT-4 的测试。微软的研究院在很早期就接触到了 GPT-4 的非多模态版本,并进行了详尽的测试。这篇论文不管是测试方法还是测试结论都非常精彩,强烈推荐看一遍。
368 | - [《GPT-4 ,通用人工智能的火花》论文内容精选与翻译](https://orangeblog.notion.site/GPT-4-8fc50010291d47efb92cbbd668c8c893): [Sparks of Artificial General Intelligence:
369 | Early experiments with GPT-4](https://arxiv.org/pdf/2303.12712v1.pdf) 这篇论文的精选和中文翻译。
370 |
371 | ## 其他
372 | ### 赞赏支持
373 | 如果您喜欢这个项目,可以赞赏一下支持我们,谢谢您的支持!ღ( ´・ᴗ・` )ღ
374 |
375 |
376 |
377 | ### Star 历史记录
378 |
379 | [](https://star-history.com/#ikaijua/Awesome-AITools&Date)
380 |
381 |
382 |
383 |
384 |
--------------------------------------------------------------------------------