Part II - Toward NNGP and NTK

Neural Tangent Kernel(NTK) “In short, NTK represent the changes of the weights before and after the gradient descent update” Let’s start the journey of revealing the black-box neural networks. Setup a Neural Network First of all, we need to define a simple neural network with 2 hidden layers y(x,w) where y is the neural network with weights wRm and, x,y¯N is the dataset which is a set of the input data and the output data with N data points....

February 19, 2021 · 10 min · SY Chou

Part I - Toward NNGP and NTK
  [draft]

Neural Network Gaussian Process(NNGP) Model the neural network as GP, aka neural network Gaussian Process(NNGP). Intuitively, the kernel of NNGP compute the distance between the output vectors of 2 input data points. We define the following functions as neural networks with fully-conntected layers: zi1(x)=bi1+j=1N1 Wij1xj1(x),  xj1(x)=ϕ(bi0+k=1din Wik0xk(x)) where bi1 is the ith-bias of the second layer(the same as first hidden layer), Wij1 is the ith-weights of the first layer(the same as input layer) , function ϕ is the activation function, and x is the input data of the neural network....

March 15, 2021 · 1 min · SY Chou