Question about the crf model #101

HaoDreamlong · 2021-01-04T01:38:33Z

In the crf model of version v0.3.2, the encoder is endswith a Tanh layer and Scale layer. Why is it necessary to add these two layers?

davidcpage · 2021-01-04T12:42:36Z

This constrains the output scores to lie in a range given by the scale factor - e.g. for Scale(5.0) this is a soft clipping function to the range (-5.0, 5.0). Scores are in log space and this should allow plenty of dynamic range whilst improving training stability but it's possible the Tanh layer could be removed.

HaoDreamlong · 2021-01-05T01:45:04Z

Thank you for your reply，I have one more question about the model of the GlobalNorm layer. I read the pytorch version of the logZ calculation in seqdist.sparse ,I guess it is for some sort of normalization usage. Does it have something to do with the Scale layer? and what is the exactly usage for the GlobalNorm layer?

HaoDreamlong · 2021-01-05T01:51:37Z

And about the first question. If it is for the log space posibilities, isn't it should be in range of (0,1)? and log_p should less than 0?

davidcpage · 2021-01-05T12:40:31Z

The outputs of the network represent scores in a linear-chain CRF. You can use them to compute the log probability of a particular (aligned) output sequence by adding log scores for the transitions at each timestep and subtracting the log of the global sum over (aligned) sequence scores, logZ. Scale() controls dynamic range of the log scores but these do not lie in (0,1) as they are not log probs.

HaoDreamlong · 2021-01-06T01:48:22Z

oh I get it. So since the output repesent the log scores, the loss of the model is the sum of correct paths( aligned? ) scores,and backwards is making the -loss smaller meanwhile making the correct paths reach highest scores. And the decoder should work as finding the way which has highest score. Is it a proper description?

iiSeymour · 2021-01-06T10:48:01Z

Yes, that is right @HaoDreamlong

HaoDreamlong · 2021-01-07T01:53:33Z

Thank you very much. The variables named stay_indices/scores and move_indices/scores， I have a little problem understanding them. Since the stay_indices is representing 5-position 4 hex ,and the move_indices is stay_indices add previous step value . At some extrem situation like stay_indices=341(1 1 1 1 1) and move_indices=342(1 |1 1 1 1 1). Don't they represent the same situation?

davidcpage · 2021-01-08T17:00:39Z

We distinguish between being in state 1 1 1 1 1 and emitting a blank symbol (stay_index/score) and being in state 1 1 1 1 1 and emitting a 1 symbol (move_index/score). This leads to the same pair of before and after states, but a different emitted sequence. The inclusion of a blank symbol makes this a kind of CTC model except here the conditional independence condition is replaced with a CRF.

HaoDreamlong · 2021-01-12T06:05:45Z

The model's decode function, only simply calculate logZ(for different S)twice, and obtain the gradient by auto_grad. It is hard to understand how this work as viterbi decoder. Could you tell me why such a delicate algorithm can produce the right answer?

HaoDreamlong · 2021-01-18T09:50:14Z

@davidcpage The rnn model doesn't need chunk_lengths, is it because the rnn can deal with the blank padding at the end of the input? or I have to make the input full of useful information?

iiSeymour assigned davidcpage Jan 5, 2021

iiSeymour mentioned this issue Mar 12, 2021

Operations involved while basecalling #127

Closed

iiSeymour mentioned this issue Apr 21, 2021

All the quality score were '?????????'. #144

Closed

This was referenced Jan 24, 2023

Understanding the output shape and scores. #327

Closed

Model output shape does not make sense. #280

Closed

iiSeymour closed this as completed May 31, 2023

malton-ont mentioned this issue Dec 24, 2024

Basecalling models nanoporetech/dorado#1169

Closed

NLKaiser mentioned this issue Jan 30, 2025

Questions regarding calculation of the loss #406

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Question about the crf model #101

Question about the crf model #101

HaoDreamlong commented Jan 4, 2021

davidcpage commented Jan 4, 2021

HaoDreamlong commented Jan 5, 2021

HaoDreamlong commented Jan 5, 2021

davidcpage commented Jan 5, 2021

HaoDreamlong commented Jan 6, 2021

iiSeymour commented Jan 6, 2021

HaoDreamlong commented Jan 7, 2021

davidcpage commented Jan 8, 2021

HaoDreamlong commented Jan 12, 2021

HaoDreamlong commented Jan 18, 2021

Question about the crf model #101

Question about the crf model #101

Comments

HaoDreamlong commented Jan 4, 2021

davidcpage commented Jan 4, 2021

HaoDreamlong commented Jan 5, 2021

HaoDreamlong commented Jan 5, 2021

davidcpage commented Jan 5, 2021

HaoDreamlong commented Jan 6, 2021

iiSeymour commented Jan 6, 2021

HaoDreamlong commented Jan 7, 2021

davidcpage commented Jan 8, 2021

HaoDreamlong commented Jan 12, 2021

HaoDreamlong commented Jan 18, 2021