neuralNetwork

This is an attempt to implement a neural network model with Julia. Traditional approach of updating model parameters in neural networks consists mainly of stochastic gradient descent (SGD). However, SGD is a first order method with slow convergence rate and suffers from the problem of vanishing gradient. Moreover, it is hard to parallelize calculations using SGD. Here, we treat the training process as an optimization problem and propose a method that resembles the alternating direction method of multiplierstreats (ADMM). Variables are set up in a way that allows for good parallelization of heavy computations.

Right now, we have only coded a functional version for feed-forward neural networks. Below are some sample images when we apply our neural nets to the encoder-decoder problem using the MNIST dataset.

|

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
images		images
.gitattributes		.gitattributes
.gitignore		.gitignore
README.md		README.md
encoder_decoder.jl		encoder_decoder.jl
neuralNetwork.jl		neuralNetwork.jl

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

neuralNetwork

About

Releases

Packages

Languages

Yuanchu/neuralNetwork

Folders and files

Latest commit

History

Repository files navigation

neuralNetwork

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages