Feed forward layer, frontend and encoder #53

DanEnergetics · 2024-05-14T12:45:15Z

Implementation of a simple feed forward encoder that serves to generate good alignments in full-sum HMM training. Apart from the layers, it implements a simple convolutional front-end that acts as a feed-forward layer that takes a window of feature time frames into account.

i6_models/parts/ffnn.py

i6_models/parts/frontend/window_ffnn.py

michelwi · 2024-05-14T15:02:59Z

i6_models/parts/frontend/window_ffnn.py

+        x = x.transpose(1, 2)  # torch 1d convolution is over last dim but we want time conv
+        x = self.conv(x).transpose(1, 2)
+
+        # these settings apparently apply stride correctly to the masking whatever the kernel size


Suggested change

# these settings apparently apply stride correctly to the masking whatever the kernel size

# change masking according to stride value

do be confident about the implementation of others xP

Actually I was not confident in my own implementation here. I had to choose kernel_size = 1 and padding = 0 to achieve correct masking, which is not how this function is supposed to be used I think.

And with kernel_size = cfg.window_size and padding = get_same_padding(cfg.window_size) (i.e. the values given to Conv1d) it does not work?

Weirdly enough yes. I came up with a failing test case in a new branch

i6_models/tests/test_masking.py

Line 7 in 55a6fd6

def test_masking():

i6_models/parts/frontend/window_ffnn.py

Co-authored-by: michelwi <[email protected]>

Atticus1806

Only small comments regarding documentation, else looks good to me.

i6_models/parts/ffnn.py

i6_models/parts/frontend/window_ffnn.py

Co-authored-by: Benedikt Hilmes <[email protected]>

i6_models/parts/frontend/window_convolution.py

Atticus1806

Took the liberty to commit one last change :)

Daniel Mann added 3 commits May 14, 2024 06:34

init

2c96d6b

remove defaults and add checks

0370f7b

apply black

819976b

DanEnergetics requested review from curufinwe and michelwi May 14, 2024 12:45

fix import

6fa109b

DanEnergetics marked this pull request as draft May 14, 2024 12:53

fix more imports, remove path insert in tests

d3a4755

DanEnergetics marked this pull request as ready for review May 14, 2024 13:10

michelwi reviewed May 14, 2024

View reviewed changes

DanEnergetics and others added 2 commits May 14, 2024 18:14

better comments

4a656b6

Co-authored-by: michelwi <[email protected]>

consistent attribute naming

5a7b05c

michelwi requested review from albertz, JackTemaki and Atticus1806 May 22, 2024 15:44

Atticus1806 reviewed May 23, 2024

View reviewed changes

i6_models/parts/ffnn.py Outdated Show resolved Hide resolved

i6_models/parts/ffnn.py Outdated Show resolved Hide resolved

i6_models/parts/frontend/window_ffnn.py Outdated Show resolved Hide resolved

i6_models/parts/frontend/window_ffnn.py Outdated Show resolved Hide resolved

DanEnergetics and others added 4 commits May 23, 2024 09:59

Apply doc string suggestions

8273d49

Co-authored-by: Benedikt Hilmes <[email protected]>

parameter and other renaming

df19d3c

Merge branch 'main' into mann_ffnn_encoder

12ad88e

update paddinng mechanism

1b51cf6

michelwi approved these changes May 23, 2024

View reviewed changes

DanEnergetics requested a review from Atticus1806 May 24, 2024 08:56

Atticus1806 reviewed May 24, 2024

View reviewed changes

i6_models/parts/frontend/window_convolution.py Outdated Show resolved Hide resolved

Update i6_models/parts/frontend/window_convolution.py

74768de

Atticus1806 approved these changes May 24, 2024

View reviewed changes

DanEnergetics merged commit 83ff39e into main May 24, 2024
2 checks passed

DanEnergetics deleted the mann_ffnn_encoder branch May 24, 2024 10:23

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Feed forward layer, frontend and encoder #53

Feed forward layer, frontend and encoder #53

DanEnergetics commented May 14, 2024

michelwi May 14, 2024

DanEnergetics May 14, 2024

michelwi May 15, 2024

DanEnergetics May 23, 2024

Atticus1806 left a comment

Atticus1806 left a comment

	# these settings apparently apply stride correctly to the masking whatever the kernel size
	# change masking according to stride value

Feed forward layer, frontend and encoder #53

Feed forward layer, frontend and encoder #53

Conversation

DanEnergetics commented May 14, 2024

michelwi May 14, 2024

Choose a reason for hiding this comment

DanEnergetics May 14, 2024

Choose a reason for hiding this comment

michelwi May 15, 2024

Choose a reason for hiding this comment

DanEnergetics May 23, 2024

Choose a reason for hiding this comment

Atticus1806 left a comment

Choose a reason for hiding this comment

Atticus1806 left a comment

Choose a reason for hiding this comment