Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Full Labels in LaserMix #34

Open
matildecc opened this issue Oct 30, 2024 · 3 comments
Open

Full Labels in LaserMix #34

matildecc opened this issue Oct 30, 2024 · 3 comments

Comments

@matildecc
Copy link

In your paper, you present a comparison with Full Labels (Fig 1. right)
What does "full" mean in the case of LaserMix? Since it is a semi-supervised approach, what does training with the full train split mean? Is it just the Cylinder3D/FIDNet trained in a supervised way? If so, why the results don't match? Or is there any mixing involved like labeled + labeled?

@ldkong1205
Copy link
Owner

In your paper, you present a comparison with Full Labels (Fig 1. right)

What does "full" mean in the case of LaserMix? Since it is a semi-supervised approach, what does training with the full train split mean? Is it just the Cylinder3D/FIDNet trained in a supervised way? If so, why the results don't match? Or is there any mixing involved like labeled + labeled?

Hi @matildecc, thanks for your interest in our work.

For your question: The "full" setting means that the whole training set is considered as both the labeled set and unlabeled set in semi-supervised learning. The framework is LaserMix, which takes a pair of labeled and unlabeled data for feature learning. Therefore, the performance is higher than simply using Cylinder3D/FIDNet for training on the labeled set.

Should you have any questions, please let me know. Thanks again!

@matildecc
Copy link
Author

Hi @ldkong1205 !
Thank you for your quick answer. And congrats for the work. It's really interesting...

I don't think I understood your answer correctly.
When it's 10%, it means that 10% of the original train split is selected as labeled and 90% as unlabeled. The same idea applies to 20% and 50%. If you're referring to the "full" case, it would mean that 100% of the data is labeled? If yes, how do you perform the mixing part? labeled + labeled? Do the pseudo-labeling still exists ?

@ldkong1205
Copy link
Owner

Hi @ldkong1205 ! Thank you for your quick answer. And congrats for the work. It's really interesting...

I don't think I understood your answer correctly. When it's 10%, it means that 10% of the original train split is selected as labeled and 90% as unlabeled. The same idea applies to 20% and 50%. If you're referring to the "full" case, it would mean that 100% of the data is labeled? If yes, how do you perform the mixing part? labeled + labeled? Do the pseudo-labeling still exists ?

Hi @matildecc, sorry for the late reply!

For your question:

  • "When it's 10%, it means that 10% of the original train split is selected as labeled and 90% as unlabeled."
    • A: Your understanding is correct. This is the standard setting for semi-supervised learning.
  • If you're referring to the "full" case, it would mean that 100% of the data is labeled? If yes, how do you perform the mixing part? labeled + labeled? Does the pseudo-labeling still exist?
    • A: In terms of 100%, we assume that all training samples are both treated as "labeled" and "unlabeled". We perform the mixing just as semi-supervised learning. The pseudo-labels for the "unlabeled" (although they have labels under 100% setting) are still generated from the Teacher net. You can think of this as some sort of "data augmentation" of "self-training", where the training samples are supervised by both the ground truth labels, as well as the pseudo-labels generated by the network itself.

Please let us know if the above resolves your question. We are happy to provide more detailed explanations if you still have issues!

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants