Add CTC recipe to AISHELL-1 by BenoitWang · Pull Request #1576 · speechbrain/speechbrain

BenoitWang · 2022-09-14T18:04:15Z

Hi @mravanelli @TParcollet , this PR adds a typical CTC-wav2vec recipe to AISHELL-1.
Test CER: 5.06%
Dev CER: 4.52%

Some points:

chinese-wav2vec2-large (from Tencent) is used which is pretrained on 10k hours Chinese data
bert-base-chinese is used as the tokenizer, ctc is trained on chars
In prepare.py, pandas is not necessary to be used to generate csv, so it is deleted together with some unused variables.

TParcollet · 2022-09-14T21:00:34Z

Huge ! Is this comparable to the SOTA around?

BenoitWang · 2022-09-14T22:04:53Z

Hi @TParcollet , I think it's good for a system pure-CTC/greedy/without LM.
Hybrid models from espnet got better CER:

model	Test CER	Dev CER	LM
our ctc-wav2vec	5.06%	4.52%	No
espnet: branchformer-beam10-ctc0.4	4.4%	4.1%	No
espnet: conformer-beam20-ctc0.3	4.9%	4.5%	No

TParcollet · 2022-09-14T22:32:59Z

I see, not bad, but we use extra pre-training while they don't, correct ?

BenoitWang · 2022-09-14T22:52:24Z

Yes exact. Fair enough, but their branchformer is quite something according to the results.

anautsch

Hi @BenoitWang minor details only.

The yaml file combines related hparam files well; as is for the train script.

Is the AISHELL-1 prepare script completely stripped of the extra dependency to pandas for all its recipes?
(not a bad thing to reduce dependencies, although pandas is neat, - just asking - pandas is not in the SB requirements and neither it is explicitly stated fo AISHELL-1, so it's a good catch)

BenoitWang · 2022-09-19T14:41:41Z

Hi @anautsch thanks for the review, the fix is done. And yes that's why I want to reduce pandas, it is only used to generate csv for all the recipes.

… into aishell-ctc

BenoitWang · 2022-10-06T15:27:50Z

Hi @TParcollet @anautsch @Adel-Moumen,

Thank you all for the reviews and tests! The HF link is added, here's a brief summary of the PR:

add a CTC recipe
fix naming problems
fix dynamic batching conflicts for seq2seq & transformer recipes

anautsch · 2022-10-07T13:55:44Z

lgtm.

Tested recipes in --debug mode & the wav2vec2 with ddp.

Side note: we have an internal issue with --debug and eval checkpointing - this comes clear when running this transformer wav2vec2 recipe - here's the relevant log

   asr_brain.evaluate(
  File "speechbrain/core.py", line 1260, in evaluate
    self.on_evaluate_start(max_key=max_key, min_key=min_key)
  File "train_with_wav2vect.py", line 272, in on_evaluate_start
    ckpt = sb.utils.checkpoints.average_checkpoints(
  File "speechbrain/utils/checkpoints.py", line 1174, in average_checkpoints
    return averager(parameter_iterator)
  File "speechbrain/utils/checkpoints.py", line 1080, in average_state_dicts
    raise ValueError("No state dicts to average.")
ValueError: No state dicts to average.

BenoitWang added 9 commits September 14, 2022 19:11

add ctc recipe

a05c989

add readme

4a38c31

clean prepare

fd720bf

update train_with_wav2vec.yaml

5bd8def

update train_with_wav2vec.yaml

f8a5779

Merge remote-tracking branch 'upstream/develop' into aishell-ctc

b66436b

pre-commit tests

b2ccd9b

add to recipes.csv

4aaf376

consistency tests

1062a06

anautsch suggested changes Sep 19, 2022

View reviewed changes

Comment thread recipes/AISHELL-1/ASR/CTC/README.md Outdated

Comment thread recipes/AISHELL-1/ASR/CTC/train_with_wav2vec.py Outdated

minor fixes

dd93c85

anautsch approved these changes Sep 19, 2022

View reviewed changes

TParcollet reviewed Sep 19, 2022

View reviewed changes

Comment thread recipes/AISHELL-1/ASR/CTC/README.md

TParcollet reviewed Sep 19, 2022

View reviewed changes

Comment thread recipes/AISHELL-1/ASR/CTC/README.md Outdated

TParcollet reviewed Sep 19, 2022

View reviewed changes

Comment thread recipes/AISHELL-1/ASR/CTC/prepare.py Outdated

TParcollet reviewed Sep 19, 2022

View reviewed changes

Comment thread recipes/AISHELL-1/ASR/CTC/train_with_wav2vec.py Outdated

BenoitWang and others added 6 commits September 19, 2022 20:57

fix filenames for all aishell recipes and other fixes

2a2309f

add new files names to recipes.csv

6d82dea

Update train_with_wav2vec.yaml

436387c

add num_buckets and fix Tokenizer readme

1cb1652

Merge branch 'aishell-ctc' of https://github.com/BenoitWang/speechbrain…

8881bc3

… into aishell-ctc

fix/add dynamic batching for all the recipes

04a116e

anautsch approved these changes Sep 26, 2022

View reviewed changes

BenoitWang added 2 commits October 6, 2022 16:48

add hf link

28ba0b9

fix conflict

72d5203

BenoitWang added 3 commits October 6, 2022 17:09

fix conflict

f0450cd

add to recipes.csv

f15b8aa

fix names

ccb88d3

anautsch merged commit 39f9f39 into speechbrain:develop Oct 7, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add CTC recipe to AISHELL-1#1576

Add CTC recipe to AISHELL-1#1576
anautsch merged 21 commits into
speechbrain:developfrom
BenoitWang:aishell-ctc

BenoitWang commented Sep 14, 2022

Uh oh!

TParcollet commented Sep 14, 2022

Uh oh!

BenoitWang commented Sep 14, 2022

Uh oh!

TParcollet commented Sep 14, 2022

Uh oh!

BenoitWang commented Sep 14, 2022

Uh oh!

anautsch left a comment

Uh oh!

Uh oh!

Uh oh!

BenoitWang commented Sep 19, 2022

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

BenoitWang commented Oct 6, 2022

Uh oh!

anautsch commented Oct 7, 2022

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

BenoitWang commented Sep 14, 2022

Uh oh!

TParcollet commented Sep 14, 2022

Uh oh!

BenoitWang commented Sep 14, 2022

Uh oh!

TParcollet commented Sep 14, 2022

Uh oh!

BenoitWang commented Sep 14, 2022

Uh oh!

anautsch left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

BenoitWang commented Sep 19, 2022

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

BenoitWang commented Oct 6, 2022

Uh oh!

anautsch commented Oct 7, 2022

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants