pFad - Phone/Frame/Anonymizer/Declutterfier! Saves Data!

NikolaiKyhne · 2024-09-09T12:13:13Z

Added Multi-Window Multi-Head attention (MWMHA) module for Transformer ASR (https://openreview.net/forum?id=Q53QLftNkA).

In general, this contribution adds:

MWMHA implementation
Updated Transformer.py with MWMHA option
Updated TransformerASR.py with MWMHA option
Added small and medium MWMHA LibriSpeech recipe
Added large MWMHA CommonVoice english recipe
Updated LibriSpeech and CommonVoice ASR README with MWMHA results and description

Adel-Moumen · 2024-09-10T13:30:47Z

Hey guys!

Hope you are doing great --- this is a very nice PR!

I just turned this PR draft for now, please turn it public when you think it will be ready to be reviewed. You can ping me as well so that I can have a closer look as soon as possible :)

Thanks for your contribution :)

Best,
Adel

NikolaiKyhne · 2024-09-11T17:46:18Z

Hey @Adel-Moumen!

Thanks for your comment, we have now finished the draft and turned it ready for review :)

Best,
Nikolai

Adel-Moumen · 2024-09-12T09:43:53Z

recipes/CommonVoice/ASR/transformer/README.md

 ## Transformer
 | Language | CV version | hyperparams file |  LM | Val. CER | Val. WER | Test CER | Test WER | Hugging Face link |  Model link | GPUs |
 | ------------- |:-------------:|:---------------------------:| -----:| -----:| -----:| -----:| -----:|:-----------:| :-----------:| :-----------:|
+| English | 16.1 | mwmha_transformer_large.yaml | No | 4.72 | 10.97 | 6.68 | 13.69 | - | [model](https://1drv.ms/f/c/039f8ffe91e06416/Et7KEbSlWNdJhkjLIi7_vGQBMVhGwRRBzCSljh6aA4sJSw?e=dXeuiY) | 1xL40 48GB |


Why is the Val WER so high? I think you swapped CER and WER right ?

No that's right I just double checked, it is the same for Conformer English on CV 16.1 :)

@Adel-Moumen Val WER for MWMHA (10.97) follows the same trend and is quite close to that of the Conformer model (10.48) and is reported correctly, CER and WER are not swapped.

SarthakYadav · 2024-10-15T11:15:44Z

Hi @Adel-Moumen @TParcollet

We've been waiting for a review for some time now. Any chance you can take a look at it soon? :)

NikolaiKyhne and others added 13 commits September 9, 2024 13:50

Create multiwindow_attention.py

9d765e4

added mwmha option

4f63396

Update TransformerASR.py with mwmha option

c7a8fd3

added medium recipe for mwmha transformer

c24435c

added small mwmha transformer recipe

e04f3db

added section about mwmha

7599854

Merge branch 'develop' into mwmha_final

740552d

pre-commit fixes

dc472e7

Added large mwmha transformer recipe for CommomVoice english

bf6c747

Update output folder name to match other recipes

fc16b25

Added about MWMWA section

0d9e0df

Added model links for MWMHA recipes

96d1e25

Added model link for MWMHA recipe

9741a32

Adel-Moumen marked this pull request as draft September 10, 2024 13:29

NikolaiKyhne and others added 14 commits September 10, 2024 15:34

Added mwmha recipes to the list

1537e15

Added mwmha transformer recipe to the list

3e76bee

updated about sections for MW-MHA

55fa0b9

updated docstrings

512ad12

refactoring for pre-commit fixes

588a66a

refactoring for pre-commit fixes

3bf2799

pre-commit fixes (getting rid of unnecessary stuff)

df4f255

pre-commit fixes (getting rid of unnecessary stuff)

427ebed

fix examples

401420b

Merge branch 'develop' into mwmha_final

e03be4a

fix flake8 violations

eec182b

fix MWMHA example

6f298a6

Merge branch 'speechbrain:develop' into mwmha_final

bf80f66

update trunc_normal_ example

8fe5202

NikolaiKyhne marked this pull request as ready for review September 11, 2024 17:46

Adel-Moumen self-requested a review September 12, 2024 08:48

Adel-Moumen assigned NikolaiKyhne Sep 12, 2024

Adel-Moumen reviewed Sep 12, 2024

View reviewed changes

NikolaiKyhne requested a review from Adel-Moumen September 19, 2024 13:59

Adel-Moumen requested a review from TParcollet September 24, 2024 14:20

pFad - Phone/Frame/Anonymizer/Declutterfier! Saves Data!

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Multi-Window Multi-Head Attention implementation for ASR transformer#2675

Multi-Window Multi-Head Attention implementation for ASR transformer#2675
NikolaiKyhne wants to merge 27 commits intospeechbrain:developfrom
NikolaiKyhne:mwmha_final

NikolaiKyhne commented Sep 9, 2024 •

edited

Loading

Uh oh!

Adel-Moumen commented Sep 10, 2024

Uh oh!

NikolaiKyhne commented Sep 11, 2024 •

edited

Loading

Uh oh!

Adel-Moumen Sep 12, 2024

Uh oh!

NikolaiKyhne Sep 12, 2024

Uh oh!

SarthakYadav Sep 16, 2024

Uh oh!

SarthakYadav commented Oct 15, 2024

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Comments

Pfad - The Proxy pFad © 2024 Your Company Name. All rights reserved.

pFad - Phone/Frame/Anonymizer/Declutterfier! Saves Data!

Conversation

NikolaiKyhne commented Sep 9, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Adel-Moumen commented Sep 10, 2024

Uh oh!

NikolaiKyhne commented Sep 11, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Adel-Moumen Sep 12, 2024

Choose a reason for hiding this comment

Uh oh!

NikolaiKyhne Sep 12, 2024

Choose a reason for hiding this comment

Uh oh!

SarthakYadav Sep 16, 2024

Choose a reason for hiding this comment

Uh oh!

SarthakYadav commented Oct 15, 2024

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Comments

Pfad - The Proxy pFad © 2024 Your Company Name. All rights reserved.

NikolaiKyhne commented Sep 9, 2024 •

edited

Loading

NikolaiKyhne commented Sep 11, 2024 •

edited

Loading