448 | 📣 Generate Audio (experimental)
449 |
450 |
451 | Converts text to speech and saves the output as an audio file.
452 |
453 | #### Required Inputs
454 |
455 | #### `text`
456 | The text to be converted into speech.
457 | #### `filename_prefix`
458 | Prefix for naming the audio file. (relative to .\ComfyUI\output)
459 |
460 | This node uses a text-to-speech pipeline to convert input text into spoken words, saving the result as a WAV file. The generated audio file is named using the provided filename prefix and is stored relative to the .\ComfyUI-Mana-Nodes directory.
461 |
462 | Model: [https://huggingface.co/spaces/suno/bark](https://huggingface.co/suno/bark)
463 |
464 | #### Foreign Language
465 |
466 | Bark supports various languages out-of-the-box and automatically determines language from input text. When prompted with code-switched text, Bark will even attempt to employ the native accent for the respective languages in the same voice.
467 |
468 | Example:
469 | Buenos días Miguel. Tu colega piensa que tu alemán es extremadamente malo. But I suppose your english isn't terrible.
470 |
471 | #### Non-Speech Sounds
472 |
473 | Below is a list of some known non-speech sounds, but we are finding more every day.
474 |
475 | [laughter]
476 | [laughs]
477 | [sighs]
478 | [music]
479 | [gasps]
480 | [clears throat]
481 | — or … for hesitations
482 | ♪ for song lyrics
483 | capitalization for emphasis of a word
484 | MAN/WOMAN: for bias towards speaker
485 |
486 |
487 | Example:
488 | " [clears throat] Hello, my name is Suno. And, uh — and I like pizza. [laughs] But I also have other interests such as... ♪ singing ♪."
489 |
490 | #### Music
491 |
492 | Bark can generate all types of audio, and, in principle, doesn’t see a difference between speech and music. Sometimes Bark chooses to generate text as music, but you can help it out by adding music notes around your lyrics.
493 |
494 | Example:
495 | ♪ In the jungle, the mighty jungle, the lion barks tonight ♪
496 |
497 | #### Speaker Prompts
498 |
499 | You can provide certain speaker prompts such as NARRATOR, MAN, WOMAN, etc. Please note that these are not always respected, especially if a conflicting audio history prompt is given.
500 |
501 | Example:
502 | WOMAN: I would like an oatmilk latte please.
503 | MAN: Wow, that's expensive!
504 |
505 |
506 |
507 |
508 |