Skip to content

Add DCA paper on skip connections#2

Open
WhymustIhaveaname wants to merge 1 commit intoharsh306:masterfrom
WhymustIhaveaname:add-dca-paper
Open

Add DCA paper on skip connections#2
WhymustIhaveaname wants to merge 1 commit intoharsh306:masterfrom
WhymustIhaveaname:add-dca-paper

Conversation

@WhymustIhaveaname
Copy link
Copy Markdown

We show that running DCA on a plain network without shortcuts gives you the same gradient updates as SGD on a ResNet. Also turns out SGD and PPA are both special cases of DCA. arXiv: 2412.09853

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant