Wesep

We aim to build a toolkit focusing on front-end processing in the cocktail party set up, including target speaker extraction and ~~speech separation (Future work)~~

Install for development & deployment

Clone this repo

https://github.com/wenet-e2e/wesep.git

Create conda env: pytorch version >= 1.12.0 is required !!!

conda create -n wesep python=3.9
conda activate wesep
conda install pytorch=1.12.1 torchaudio=0.12.1 cudatoolkit=11.3 -c pytorch -c conda-forge
pip install -r requirements.txt
pre-commit install  # for clean and tidy code

The Target Speaker Extraction Task

Target speaker extraction (TSE) focuses on isolating the speech of a specific target speaker from overlapped multi-talker speech, which is a typical setup in the cocktail party problem. WeSep is featured with flexible target speaker modeling, scalable data management, effective on-the-fly data simulation, structured recipes and deployment support.

Features (To Do List)

Data Pipe Design

Following Wenet and WesSeaker, WeSep organizes the data processing modules as a pipeline of a set of different processors. The following figure shows such a pipeline with essential processors.

Discussion

For Chinese users, you can scan the QR code on the left to join our group directly. If it has expired, please scan the personal Wechat QR code on the right.

Citations

If you find wespeaker useful, please cite it as

@inproceedings{wang24fa_interspeech,
  title     = {WeSep: A Scalable and Flexible Toolkit Towards Generalizable Target Speaker Extraction},
  author    = {Shuai Wang and Ke Zhang and Shaoxiong Lin and Junjie Li and Xuefei Wang and Meng Ge and Jianwei Yu and Yanmin Qian and Haizhou Li},
  year      = {2024},
  booktitle = {Interspeech 2024},
  pages     = {4273--4277},
  doi       = {10.21437/Interspeech.2024-1840},
}

Name		Name	Last commit message	Last commit date
Latest commit History 28 Commits
.github/workflows		.github/workflows
examples		examples
resources		resources
runtime		runtime
tools		tools
wesep		wesep
.clang-format		.clang-format
.flake8		.flake8
.gitignore		.gitignore
.pre-commit-config.yaml		.pre-commit-config.yaml
CPPLINT.cfg		CPPLINT.cfg
README.md		README.md
requirements.txt		requirements.txt
setup.py		setup.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Wesep

Install for development & deployment

The Target Speaker Extraction Task

Features (To Do List)

Data Pipe Design

Discussion

Citations

About

Releases

Packages

Contributors 5

Languages

wenet-e2e/wesep

Folders and files

Latest commit

History

Repository files navigation

Wesep

Install for development & deployment

The Target Speaker Extraction Task

Features (To Do List)

Data Pipe Design

Discussion

Citations

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 5

Languages

Packages