Use of default clip markers as [0,1,1, 1...,1] #14

aurotripathy · 2016-07-25T20:01:31Z

This seems like the right place to get answers to Caffe LSTM questions :-). You can count on an answer.

I'm comparing the implementation of the LSTM layer over here and the (official merged) one in Caffe. They are different.

Are they conceptually the same relative to the clip_marker implementation?

My question is, if the sequence lengths are the same in the input (i.e., they don't vary) and they match the number of time-steps, then do we need to provide the clip_marker input (in the official caffe version)?

Can the network assume it to be [0,1,1, 1...,1]?

My reason to ask this is to debug the network. My own markers may be in error and likely confusing the network?

Thank you.

junhyukoh · 2016-07-26T23:46:00Z

Yes.
My code assumes that the input batch consists of complete sequences (from end to end).
If this is the case, you don't have to provide clip markers.

aurotripathy · 2016-07-27T00:03:01Z

Thank you. What about the official Caffe LSTM implementation (BVLC/caffe#2033).
Asking here as it's unlikely I will get a response there.

junhyukoh · 2016-07-27T00:08:56Z

As far as I know, they have the same protocol.

aurotripathy · 2016-07-27T00:39:02Z

Thank you. One last question.

The Caffe code below is the lstm unit layer implementation. I'm unable to determine whether the cont variable has a default value of zero (or has to be strictly supplied as a "bottom").
Please can you help

template <typename Dtype>
void LSTMUnitLayer<Dtype>::Forward_cpu(const vector<Blob<Dtype>*>& bottom,
    const vector<Blob<Dtype>*>& top) {
  const int num = bottom[0]->shape(1);
  const int x_dim = hidden_dim_ * 4;
  const Dtype* C_prev = bottom[0]->cpu_data();
  const Dtype* X = bottom[1]->cpu_data();
  const Dtype* cont = bottom[2]->cpu_data();
  Dtype* C = top[0]->mutable_cpu_data();
  Dtype* H = top[1]->mutable_cpu_data();
  for (int n = 0; n < num; ++n) {
    for (int d = 0; d < hidden_dim_; ++d) {
      const Dtype i = sigmoid(X[d]);
      const Dtype f = (*cont == 0) ? 0 :
          (*cont * sigmoid(X[1 * hidden_dim_ + d]));
      const Dtype o = sigmoid(X[2 * hidden_dim_ + d]);
      const Dtype g = tanh(X[3 * hidden_dim_ + d]);
      const Dtype c_prev = C_prev[d];
      const Dtype c = f * c_prev + i * g;
      C[d] = c;
      const Dtype tanh_c = tanh(c);
      H[d] = o * tanh_c;
    }
    C_prev += hidden_dim_;
    X += x_dim;
    C += hidden_dim_;
    H += hidden_dim_;
    ++cont;
  }
}

junhyukoh · 2016-07-27T04:12:49Z

It seems like there is no default value in this code unless they provide a virtual bottom[2].

aurotripathy · 2016-07-27T04:56:34Z

Ok, thank you very much.

From BVLC/caffe#2033, it appears that providing the clip_markers is "required".

"RecurrentLayer requires 2 input (bottom) Blobs."

ayushchopra96 · 2017-08-14T04:57:55Z

Hi @junhyukoh @aurotripathy . Is there support to access Hidden state at each timestep.
I needed to simulate an attention mechanism. If not, I would need to implement the same myself.

Thanks

aurotripathy closed this as completed Jul 27, 2016

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Use of default clip markers as [0,1,1, 1...,1] #14

Use of default clip markers as [0,1,1, 1...,1] #14

aurotripathy commented Jul 25, 2016 •

edited

Loading

junhyukoh commented Jul 26, 2016

aurotripathy commented Jul 27, 2016 •

edited

Loading

junhyukoh commented Jul 27, 2016

aurotripathy commented Jul 27, 2016 •

edited

Loading

junhyukoh commented Jul 27, 2016

aurotripathy commented Jul 27, 2016

ayushchopra96 commented Aug 14, 2017

Use of default clip markers as [0,1,1, 1...,1] #14

Use of default clip markers as [0,1,1, 1...,1] #14

Comments

aurotripathy commented Jul 25, 2016 • edited Loading

junhyukoh commented Jul 26, 2016

aurotripathy commented Jul 27, 2016 • edited Loading

junhyukoh commented Jul 27, 2016

aurotripathy commented Jul 27, 2016 • edited Loading

junhyukoh commented Jul 27, 2016

aurotripathy commented Jul 27, 2016

ayushchopra96 commented Aug 14, 2017

aurotripathy commented Jul 25, 2016 •

edited

Loading

aurotripathy commented Jul 27, 2016 •

edited

Loading

aurotripathy commented Jul 27, 2016 •

edited

Loading