├── .github └── ISSUE_TEMPLATE │ ├── ask_questions.md │ ├── bug_report.md │ ├── config.yaml │ └── error_docs.md ├── .gitignore ├── LICENSE ├── README.md ├── README_ja.md ├── README_zh.md ├── api.py ├── data ├── train_example.jsonl └── val_example.jsonl ├── deepspeed_conf └── ds_stage1.json ├── demo1.py ├── demo2.py ├── demo_libtorch.py ├── demo_onnx.py ├── export.py ├── export_meta.py ├── finetune.sh ├── image ├── aed_figure.png ├── asr_results.png ├── asr_results1.png ├── asr_results2.png ├── dingding_funasr.png ├── dingding_sv.png ├── inference.png ├── sensevoice.png ├── sensevoice2.png ├── ser_figure.png ├── ser_table.png ├── webui.png └── wechat.png ├── model.py ├── requirements.txt ├── utils ├── __init__.py ├── ctc_alignment.py ├── export_utils.py ├── frontend.py ├── infer_utils.py └── model_bin.py └── webui.py /.github/ISSUE_TEMPLATE/ask_questions.md: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/FunAudioLLM/SenseVoice/HEAD/.github/ISSUE_TEMPLATE/ask_questions.md -------------------------------------------------------------------------------- /.github/ISSUE_TEMPLATE/bug_report.md: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/FunAudioLLM/SenseVoice/HEAD/.github/ISSUE_TEMPLATE/bug_report.md -------------------------------------------------------------------------------- /.github/ISSUE_TEMPLATE/config.yaml: -------------------------------------------------------------------------------- 1 | blank_issues_enabled: false -------------------------------------------------------------------------------- /.github/ISSUE_TEMPLATE/error_docs.md: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/FunAudioLLM/SenseVoice/HEAD/.github/ISSUE_TEMPLATE/error_docs.md -------------------------------------------------------------------------------- /.gitignore: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/FunAudioLLM/SenseVoice/HEAD/.gitignore -------------------------------------------------------------------------------- /LICENSE: -------------------------------------------------------------------------------- 1 | Ref to https://github.com/modelscope/FunASR?tab=readme-ov-file#license 2 | -------------------------------------------------------------------------------- /README.md: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/FunAudioLLM/SenseVoice/HEAD/README.md -------------------------------------------------------------------------------- /README_ja.md: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/FunAudioLLM/SenseVoice/HEAD/README_ja.md -------------------------------------------------------------------------------- /README_zh.md: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/FunAudioLLM/SenseVoice/HEAD/README_zh.md -------------------------------------------------------------------------------- /api.py: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/FunAudioLLM/SenseVoice/HEAD/api.py -------------------------------------------------------------------------------- /data/train_example.jsonl: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/FunAudioLLM/SenseVoice/HEAD/data/train_example.jsonl -------------------------------------------------------------------------------- /data/val_example.jsonl: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/FunAudioLLM/SenseVoice/HEAD/data/val_example.jsonl -------------------------------------------------------------------------------- /deepspeed_conf/ds_stage1.json: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/FunAudioLLM/SenseVoice/HEAD/deepspeed_conf/ds_stage1.json -------------------------------------------------------------------------------- /demo1.py: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/FunAudioLLM/SenseVoice/HEAD/demo1.py -------------------------------------------------------------------------------- /demo2.py: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/FunAudioLLM/SenseVoice/HEAD/demo2.py -------------------------------------------------------------------------------- /demo_libtorch.py: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/FunAudioLLM/SenseVoice/HEAD/demo_libtorch.py -------------------------------------------------------------------------------- /demo_onnx.py: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/FunAudioLLM/SenseVoice/HEAD/demo_onnx.py -------------------------------------------------------------------------------- /export.py: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/FunAudioLLM/SenseVoice/HEAD/export.py -------------------------------------------------------------------------------- /export_meta.py: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/FunAudioLLM/SenseVoice/HEAD/export_meta.py -------------------------------------------------------------------------------- /finetune.sh: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/FunAudioLLM/SenseVoice/HEAD/finetune.sh -------------------------------------------------------------------------------- /image/aed_figure.png: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/FunAudioLLM/SenseVoice/HEAD/image/aed_figure.png -------------------------------------------------------------------------------- /image/asr_results.png: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/FunAudioLLM/SenseVoice/HEAD/image/asr_results.png -------------------------------------------------------------------------------- /image/asr_results1.png: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/FunAudioLLM/SenseVoice/HEAD/image/asr_results1.png -------------------------------------------------------------------------------- /image/asr_results2.png: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/FunAudioLLM/SenseVoice/HEAD/image/asr_results2.png -------------------------------------------------------------------------------- /image/dingding_funasr.png: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/FunAudioLLM/SenseVoice/HEAD/image/dingding_funasr.png -------------------------------------------------------------------------------- /image/dingding_sv.png: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/FunAudioLLM/SenseVoice/HEAD/image/dingding_sv.png -------------------------------------------------------------------------------- /image/inference.png: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/FunAudioLLM/SenseVoice/HEAD/image/inference.png -------------------------------------------------------------------------------- /image/sensevoice.png: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/FunAudioLLM/SenseVoice/HEAD/image/sensevoice.png -------------------------------------------------------------------------------- /image/sensevoice2.png: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/FunAudioLLM/SenseVoice/HEAD/image/sensevoice2.png -------------------------------------------------------------------------------- /image/ser_figure.png: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/FunAudioLLM/SenseVoice/HEAD/image/ser_figure.png -------------------------------------------------------------------------------- /image/ser_table.png: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/FunAudioLLM/SenseVoice/HEAD/image/ser_table.png -------------------------------------------------------------------------------- /image/webui.png: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/FunAudioLLM/SenseVoice/HEAD/image/webui.png -------------------------------------------------------------------------------- /image/wechat.png: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/FunAudioLLM/SenseVoice/HEAD/image/wechat.png -------------------------------------------------------------------------------- /model.py: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/FunAudioLLM/SenseVoice/HEAD/model.py -------------------------------------------------------------------------------- /requirements.txt: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/FunAudioLLM/SenseVoice/HEAD/requirements.txt -------------------------------------------------------------------------------- /utils/__init__.py: -------------------------------------------------------------------------------- 1 | -------------------------------------------------------------------------------- /utils/ctc_alignment.py: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/FunAudioLLM/SenseVoice/HEAD/utils/ctc_alignment.py -------------------------------------------------------------------------------- /utils/export_utils.py: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/FunAudioLLM/SenseVoice/HEAD/utils/export_utils.py -------------------------------------------------------------------------------- /utils/frontend.py: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/FunAudioLLM/SenseVoice/HEAD/utils/frontend.py -------------------------------------------------------------------------------- /utils/infer_utils.py: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/FunAudioLLM/SenseVoice/HEAD/utils/infer_utils.py -------------------------------------------------------------------------------- /utils/model_bin.py: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/FunAudioLLM/SenseVoice/HEAD/utils/model_bin.py -------------------------------------------------------------------------------- /webui.py: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/FunAudioLLM/SenseVoice/HEAD/webui.py --------------------------------------------------------------------------------