Least Square Solution Layer (for ELM) #2565

Macbull · 2015-06-06T11:44:47Z

Currently this pull request is in progress. I need help to decide which Library should be used.

Adding Extreme Learning Machine support to caffe. Recently ELM are being considered alternatives to Deep Neural Nets due to their fast training and comparable accuracy.
reference : http://www.extreme-learning-machines.org
ELM does not need any solver to optimise parameters.
This implementation can be used to construct ELM Auto-Encoders easily by just using promo file (no additional layer will be required)

This PR includes two layers :

LSLayer : Least square Layer will be used by ELM or Online Sequential ELM
Transpose Layer : This layer will make possible for two layers to share weights but in transposed form. Improve / Fix Weight Sharing #1211. It is requirement for implementing ELM-Auto Encoders.

Issues Left :

Pseudo Inverse and transpose functions are not available in cuBLAS. So we need to select between different libraries to use those functions for GPU.
Libraries I am considering : CULA or MAGMA for GPU. Please suggest !!!
Testing of layers once libraries are decided.

Future Plans : Implementing other variants of ELM like Sparse ELM or Kernelled ELM etc.

bhack · 2015-06-06T12:12:57Z

http://elmorigin.weebly.com/

Macbull · 2015-06-06T12:27:57Z

@bhack : I am aware of that discussion. Here is reply of inventor of ELM : http://www.ntu.edu.sg/home/egbhuang/pdf/ELM-Rosenblatt-Neumann.pdf

Moreover, whosever is correct, You can not ignore the good performance of ELMs or whatever they may be called, and that's what I think is the focus of caffe. May be we can change the name of method in comments (the only place where I mentioned ELM).

lunzueta · 2015-06-06T12:35:21Z

Following up bhack's comment, regarding the ELM method itself, take into account also this:
https://www.facebook.com/yann.lecun/posts/10152872571572143

And especially this:
http://theanonymousemail.com/view/?msg=ZHEZJ1AJ

I'm not sure if this method should be integrated in Caffe, yet...

Macbull · 2015-06-06T12:44:52Z

@lunzueta
I was not able to open first link (Facebook server is down right now.) but I have read second link. And I don't think It raises any issue with performance of ELM. As said earlier, this PR does not adds any ELM layer but adds a Least square layer, which can be used to find least square solutions, which is used in ELM.

lunzueta · 2015-06-06T12:55:16Z

Here there are more comments about this discussion and also about Yann Lecun's recent comments about ELM: http://www.reddit.com/r/MachineLearning/comments/34u0go/yann_lecun_whats_so_great_about_extreme_learning/

They also discuss about the performance of ELM and so on. This is a quite recent discussion, so I think that we should be careful with this issue before integrating this method in Caffe.

Macbull · 2015-06-06T13:14:28Z

@lunzueta : yes, We should be careful, that's why we are discussing it. Right :D ? In comments, they are discussing performance of ELM, and most of them agree that it performs well considering its simplicity and low training time. I haven't gone much deeper in those conversation but I want to raise some points :

As regarding complain, read the above publication. It was an invited article and clearly addresses the complaint.
Regarding Yann Lecun's comment : ELM performs well beyond SVM in most of the cases, and comparable in others. (both in terms of time and accuracy)
I would say it again, I have not implement ELM, I have implemented Least Square solution layer, which can be used anywhere you want, may be in LS-SVM. I will use that for constructing ELM for my own purpose.

Macbull · 2015-06-06T13:17:21Z

Moreover the complaint was anonymous, but the paper I mentioned in above comments is published, and I am sure that it would have gone through rigorous evaluation before acceptance due to the delicacy of the issue , it is addressing.

…ngly

…tion to math_functions.cpp and header file

…k of data layer)

)

zdx3578 · 2015-12-30T03:35:10Z

High performance implementation of Extreme Learning Machines (fast randomized neural networks). https://github.com/akusok/hpelm , good code and paper ;
caffe python layer can reference hpelm code？

akusok · 2016-01-02T08:29:09Z

Hello guys, I am an author of HPELM. I would be glad to integrate my toolbox with you code if you tell me what you want :-)

About ELM in general:

In performance ELM = MLP, but training time is 10,000-100,000 times shorter.
Scandal is only about authorship, as ELM starts gaining a lot of citations recently and everybody want these citations for itself
Pseudoinverse is not done in practice for ELM, transpose is available in BLAS as a parameter at multiplying two matrices. Generally, with less than 3000 neurons CPU is the fastest, with more than that GPU improves speed.

ducha-aiki · 2016-01-02T15:19:25Z

@akusok

In performance ELM = MLP, but training time is 10,000-100,000 times shorter.
what about sota on CIFAR-10/100? :)

shelhamer · 2017-04-14T07:22:11Z

Closing as out-of-scope.

Macbull added 9 commits June 6, 2015 16:40

Adding ls_layer.cpp declarations only

33a0c28

Adding ls_layer.cpp LayerSetUp definition

98ae3b6

Adding ls_layer.cpp forward definition

bbdbd53

Adding LSLayer to common_layers.hpp

0b14dcf

Adding LSLayer parameters to caffe.proto

64b1ac9

Adding Transpose Layer declarations

2cfa399

Adding Transpose Layer definitions

66daaed

Adding transpose layer to common_layers.hpp

cc9d1df

Merge branch 'master' into ELM

5ff5757

Macbull changed the title ~~Elm~~ ELM Jun 6, 2015

Macbull changed the title ~~ELM~~ Least Square Layer (for ELM) Jun 6, 2015

Macbull changed the title ~~Least Square Layer (for ELM)~~ Least Square Solution Layer (for ELM) Jun 6, 2015

Macbull closed this Jun 8, 2015

Macbull added 2 commits June 9, 2015 04:55

Fixing a error in caffe.proto(parameter was not named)

aefb08f

removing gpu function declarations

946d1ed

Macbull reopened this Jun 9, 2015

Macbull added 7 commits June 9, 2015 06:44

fixed missing semicolon

1157590

removing gpu functions

785a03f

Adding omatcopy and dgels function to math_functions.hpp

af7d1a7

Adding definitions of omatcopy and dgels(sgels) to math_functions.cpp

9dd1458

Adding omatcopy for double

0d93392

replacing float with Dtype for omatcopy

a1206c3

Transpose layer completed

10172bf

Macbull added 24 commits June 20, 2015 21:07

fixing ls layer

b372d0c

Merge branch 'ELM' of https://github.com/Macbull/ELM-Caffe into ELM

bccc29f

csv file was space seperated, so made the change in functions accordi…

8b13040

…ngly

adding ?gelss to math_functions.hpp

51d1193

adding ?gelss definations to cpp

267f114

using gelss in ls_layer

4b8e3f3

fixing gelss (float)

d8f2b6e

fixing gelss for double and showing status of gelss in LOG

fd88c82

no change at all

1c0a068

adding gelsd(min-norm solution using svd and divide and conquer) func…

07ba7bc

…tion to math_functions.cpp and header file

using gelsd in LS layer

9a9d557

Merge branch 'ELM' into remotesensing

c7efd9a

fixing capital letters and overwriting problem of y and beta

32938f8

Deleting Rank after computation in ?gels? functions

9e03394

removing remotesensing from ELM branch

dbc6c94

cpp file for training or testing ELM Classification

efc20cb

adding protofile and model and training dataset for ELM CLassification

3a55ebd

solve test output[0] and out[1] problem

c465f2f

making seperate prototxt for train and test , due to input of net(lac…

177f889

…k of data layer)

Uniform filler and bias added to inner product layer

da357e3

Adding train prototxt (missed in last commit

3a6e261

)

fixing alignment of tr.prototxt

eff989f

fixing alignment of ts.prototxt

a98656e

fixing alignment of ts.prototxt again

bf0c857

Macbull mentioned this pull request Aug 11, 2015

Transposed weights sharing == tied weights for AE #670

Closed

shelhamer closed this Apr 14, 2017

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Least Square Solution Layer (for ELM) #2565

Least Square Solution Layer (for ELM) #2565

Macbull commented Jun 6, 2015

bhack commented Jun 6, 2015

Macbull commented Jun 6, 2015

lunzueta commented Jun 6, 2015

Macbull commented Jun 6, 2015

lunzueta commented Jun 6, 2015

Macbull commented Jun 6, 2015

Macbull commented Jun 6, 2015

zdx3578 commented Dec 30, 2015

akusok commented Jan 2, 2016

ducha-aiki commented Jan 2, 2016

shelhamer commented Apr 14, 2017

Least Square Solution Layer (for ELM) #2565

Least Square Solution Layer (for ELM) #2565

Conversation

Macbull commented Jun 6, 2015

bhack commented Jun 6, 2015

Macbull commented Jun 6, 2015

lunzueta commented Jun 6, 2015

Macbull commented Jun 6, 2015

lunzueta commented Jun 6, 2015

Macbull commented Jun 6, 2015

Macbull commented Jun 6, 2015

zdx3578 commented Dec 30, 2015

akusok commented Jan 2, 2016

ducha-aiki commented Jan 2, 2016

shelhamer commented Apr 14, 2017