Can we efficiently use neural tangent kernel? #13

breuderink · 2021-04-26T17:57:42Z

Can we use neural tangent kernel [1] to get the benefits of new neural net architectures while using the budgeted kernel machines? See also the paper on the path kernel [2].

The kernel needs to compute the dot product of the gradient of a neural network fed with two different inputs. Perhaps we can efficiently compute this dot product?

[1] Jacot, Arthur, Franck Gabriel, and Clément Hongler. "Neural tangent kernel: Convergence and generalization in neural networks." arXiv preprint arXiv:1806.07572 (2018).
[2] Domingos, Pedro M.. “Every Model Learned by Gradient Descent Is Approximately a Kernel Machine.” ArXiv abs/2012.00152 (2020): n. pag.

breuderink · 2021-05-18T07:31:54Z

To use the neural tangent kernel, we need to compute the dot product of the gradients of a neural network with on different inputs. Perhaps we can use k(a, b) = Tr(AB) = Tr(BA), where A is the product of the Jacobians for the neural net with input a, and B for input b. That way, we might be able to compute the kernel layer by layer without storing all gradients.

breuderink added the enhancement New feature or request label Apr 26, 2021

breuderink changed the title ~~Can we use neural tangent kernel?~~ Can we efficiently use neural tangent kernel? May 18, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Can we efficiently use neural tangent kernel? #13

Can we efficiently use neural tangent kernel? #13

breuderink commented Apr 26, 2021 •

edited

Loading

breuderink commented May 18, 2021 •

edited

Loading

Can we efficiently use neural tangent kernel? #13

Can we efficiently use neural tangent kernel? #13

Comments

breuderink commented Apr 26, 2021 • edited Loading

breuderink commented May 18, 2021 • edited Loading

breuderink commented Apr 26, 2021 •

edited

Loading

breuderink commented May 18, 2021 •

edited

Loading