Skip to content

Neural Rescoring#1986

Closed
Adel-Moumen wants to merge 35 commits into
speechbrain:unstable-v0.6from
Adel-Moumen:neural-rescoring
Closed

Neural Rescoring#1986
Adel-Moumen wants to merge 35 commits into
speechbrain:unstable-v0.6from
Adel-Moumen:neural-rescoring

Conversation

@Adel-Moumen

@Adel-Moumen Adel-Moumen commented May 22, 2023

Copy link
Copy Markdown
Collaborator

This PR implements neural rescoring which can ranks the top-K candidates produced by the beam search decoding.

The interface lets you use any LMs that you wants. For instance, in general when someone wants to train an ASR system he has to make sure that the tokens used by the system are the same than the pretained LM. However, in this PR, you can take a BPE LM and use it with a CHAR system. No need to match the same granularity!

Todo:

  • add beta/alpha parameters
  • update PR with results
  • documentation
  • TransformerLMRescorer
  • RNNLMRescorer
  • add an example with a GPT2 model (AutoModelForCausalLM)

20 beam size, alpha = 0.6 and beta = 0.25
7.30% test clean with TransformerLM from our sb hub
8.41% test clean with GPT2-medium
9.38% test clean with no lm

Note 1: this PR do not leverage #1596 as it is unnecessary
Note 2: this PR should be merged after the PR on pyctcdecode / CUDACTC / framectc so that I can integrate the n-best rescoring for them.

@Adel-Moumen Adel-Moumen changed the base branch from ctc-prefix-beamsearch to develop July 4, 2023 10:11
@Adel-Moumen Adel-Moumen changed the base branch from develop to ctc-prefix-beamsearch July 4, 2023 10:11
@Adel-Moumen Adel-Moumen changed the base branch from ctc-prefix-beamsearch to unstable-v0.6 July 4, 2023 11:37
@Adel-Moumen Adel-Moumen changed the base branch from unstable-v0.6 to ctc-prefix-beamsearch July 4, 2023 11:37
@Adel-Moumen Adel-Moumen changed the base branch from ctc-prefix-beamsearch to unstable-v0.6 July 4, 2023 11:38
@Adel-Moumen Adel-Moumen changed the base branch from unstable-v0.6 to ctc-prefix-beamsearch July 4, 2023 11:38
@Adel-Moumen Adel-Moumen mentioned this pull request Jul 6, 2023
@mravanelli mravanelli deleted the branch speechbrain:unstable-v0.6 July 31, 2023 18:25
@mravanelli mravanelli closed this Jul 31, 2023
@Adel-Moumen Adel-Moumen reopened this Aug 1, 2023
@Adel-Moumen Adel-Moumen changed the base branch from ctc-prefix-beamsearch to unstable-v0.6 August 2, 2023 10:02
@mravanelli mravanelli self-requested a review August 8, 2023 14:12
@mravanelli mravanelli added the enhancement New feature or request label Aug 8, 2023
@Adel-Moumen

Copy link
Copy Markdown
Collaborator Author

Closing this PR in favour of #2187

@Adel-Moumen Adel-Moumen closed this Oct 2, 2023
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

enhancement New feature or request

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants