Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

fast and faster R-CNN #35

Open
llcao opened this issue Dec 16, 2015 · 10 comments
Open

fast and faster R-CNN #35

llcao opened this issue Dec 16, 2015 · 10 comments

Comments

@llcao
Copy link

llcao commented Dec 16, 2015

Is it worth implementing the fast R-CNN and faster R-CNN with Lasagne? Both are state-of-the-art image detection methods.

My current guess is faster R-CNN is easier since it purely relies on two networks (one for proposal, another for detection) and would like to work on it. Anyone has tried that?

@ebenolson
Copy link
Member

It's been on my list to try but I probably won't get to it anytime soon. A PR would certainly be welcome!

@christopher-beckham
Copy link

Has anyone written an implementation of one of these in Lasagne yet? Just wondering, since I might have to implement one of these later on this year for a project. It would be nice to know if this has already been done, to save me some work.

@Sentient07
Copy link

I have an implementation of Fast RCNN in Lasagne over VGG16 model. I am not sure if my implementation is correct.
Link to my implementation : https://gist.github.com/Sentient07/bade6167bc923a4f0bf4a14196065cf7
(in the next two days, i'll add the other models, and RPN for Faster-RCNN )
The version of theano you'd need : https://github.com/Sentient07/Theano/tree/ROIPool ( I am keeping it up-to date with the current Master and will do so till it gets merged)
The version of Lasagne you need : https://github.com/Sentient07/Lasagne/tree/ROIPool ( I am waiting to open a PR for my OPs to get merged to theano)
@f0k I'd be glad if you could tell me if my implementation has any mistakes :)

@christopher-beckham
Copy link

Thanks very much @Sentient07 :) Is there any possibility of seeing this as a Lasagne recipe at some point in the future?

@Sentient07
Copy link

Theano is at the moment focused on a release. Until then, I don't think my PR would get merged there. Once it's merged, I'll open a PR on lasagne, then Lasagne recipe :) would take a month's time approximately.

@Sentient07
Copy link

Sentient07 commented Mar 15, 2017

There is a small detail that I missed to mention in my previous comment. On certain GPUs, you have to use the flag floatX=float32 for this model to work(float64 requires more than 4Gb of memory) while using the new backend. The old backend automatically switches to float32

@christopher-beckham
Copy link

If I have some time, I'll be happy to look at the implementation too, but I think it's worth me re-reading Fast-RCNN again before I look at any code!

@f0k
Copy link
Member

f0k commented Mar 16, 2017

If I have some time, I'll be happy to look at the implementation too

That'd be welcome! I'm afraid I won't be able to check the implementation in the near future. If you're able to reproduce results from the paper, that's a good indicator that you've got the basics correct :)

@Sentient07
Copy link

Sentient07 commented Mar 16, 2017 via email

@f0k
Copy link
Member

f0k commented Mar 16, 2017

How many epoch should I run? Is it fine if I compare the results with Caffe implementation at some smaller epoch or should I train it completely and compare the results ?

If you can afford to run it long enough to compare/reproduce the final results, that'd probably be better. But you might want to compare to Caffe in between (to make sure your implementation is not way off). If full training is not feasible, a stripped-down demo that shows it's doing something useful would also be fine for a Lasagne Recipe.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

5 participants