This paper is accepted at Transactions on Machine Learning Research. The link to the paper is: https://openreview.net/pdf?id=R9CgBkeZ6Z.
Please cite this paper, in case you are using the code or the paper:
@article{
agarwal2023auxdrop,
title={Aux-Drop: Handling Haphazard Inputs in Online Learning Using Auxiliary Dropouts},
author={Rohit Agarwal and Deepak Gupta and Alexander Horsch and Dilip K. Prasad},
journal={Transactions on Machine Learning Research},
issn={2835-8856},
year={2023},
url={ https://openreview.net/forum?id=R9CgBkeZ6Z },
note={Reproducibility Certification}
}
This repository contains datasets and implementation code for the paper, titled "Aux-Drop: Handling Haphazard Inputs in Online Learning Using Auxiliary Dropouts".
We use 7 different datasets for this project. The link of all the datasets can be found below. Moreover, the datasets are also given in their respective folders inside Code/Datasets
directory. HIGGS and SUSY are big data, hence they are not provided inside the directory. But to run them, please download HIGGS data and mask from the link given below and save them in the Code/Datasets/HIGGS/data/
folder and Code/Datasets/HIGGS/mask/
folder respectively. Same goes for the SUSY dataset.
https://archive.ics.uci.edu/ml/datasets/statlog+(german+credit+data)
https://www.csie.ntu.edu.tw/~cjlin/libsvmtools/datasets/binary.html
https://archive.ics.uci.edu/ml/datasets/magic+gamma+telescope
https://archive.ics.uci.edu/ml/datasets/adult
https://www.cs.ucr.edu/~eamonn/time_series_data_2018/
- Original Source: https://archive.ics.uci.edu/ml/datasets/HIGGS . It contains 11M datapoints and 28 features.
- We use only the first 1M data points and first 21 features.
- Data: The preprocessed datasets can be found in - https://figshare.com/s/0cd0d6ad4d30a9e91e9a. Save this dataset in the
Code/Datasets/HIGGS/data/
folder. - Mask: The masking for all the experiments with the HIGGS data can be found here - https://figshare.com/s/644fe204eb591e104184. Save all the file from the link to the
Code/Datasets/HIGGS/mask/
folder.
- Original Source: https://archive.ics.uci.edu/ml/datasets/SUSY . It contains 5M datapoints and 18 features.
- We use only the first 1M data points and first 8 features.
- Data: The preprocessed datasets can be found in - https://figshare.com/s/f4098ce6635f702c89b2. Save this dataset in the
Code/Datasets/SUSY/data/
folder. - Mask: The masking for all the experiments with the HIGGS data can be found here - https://figshare.com/s/87330bbbbc31b15d44e5. Save all the file from the link to the
Code/Datasets/SUSY/mask/
folder.
We varied the availability of each auxiliary input feature independently by a uniform distribution of probability
The trapezoidal streams are simulated by splitting the data into 10 chunks. The number of features in each successive chunk increases with the data stream. The first chunk has the first 10% of the total features, the second chunk has the first 20% features, and so on. For more infomation about this, see paper - OLSF (https://ieeexplore.ieee.org/document/7465766).
We demonstrate the effectiveness of Aux-Drop(ODL) in processing the extra information received from auxiliary features in both the SUSY and HIGGS datasets. Here, we design the data in a such way that all of them are sudden features, i.e., there is no information about the existence of these features when the model is defined. The model knows about this feature suddenly at time
We apply the Aux-Drop on two base architectures, ODL and OGD (https://arxiv.org/abs/1711.03705). Moreover, we also see the preformance of the Aux-Drop with few of its variants (changes in the design).
- Aux_Drop_ODL: Aux-Drop applied on the ODL architecture is called Aux_Drop_ODL in the code.
- Aux-Drop_OGD: Aux-Drop applied on the OGD architecture is called Aux_Drop_OGD in the code.
- AuxDrop_ODL_DirectedInAuxLayer_RandomOtherLayer - On ODL framework, Aux-Dropout is applied in AuxLayer and Random dropout in all the other layers
- AuxDrop_ODL_RandomAllLayer - On ODL framework, Random Dropout is applied in all the layers
- AuxDrop_ODL_RandomInAuxLayer - On ODL framework, Random Dropout is applied in the AuxLayer
- AuxDrop_ODL_RandomInFirstLayer_AllFeatToFirst - On ODL framework, Random Dropout is applied in the first layer and all the features (base + auxiliary) are passed to the first layer.
All the metrics are directly taken from the paper (https://link.springer.com/chapter/10.1007/978-3-031-30105-6_46) hence we do not implement here.
All the metrics are directly taken from the paper (https://ojs.aaai.org/index.php/AAAI/article/view/4192) hence we do not implement here.
All the metrics are directly taken from the paper (https://ieeexplore.ieee.org/document/7465766) hence we do not implement here.
We implement the ODL code and run it on for two scenarios.
- Only Base Feature - First, we run it using all the base features. This gives us a lower limit of the performance.
- All Features - Then we run it using all the features (considering all the featues are avaialable). This gives us an upper limit of the performance.
To run the models, see
- main.py: All the comparison models can be run from this.
- baseline.py: To run the Baseline model (ODL)
The class definition for each comparison model is given in
- AuxDrop.py
The class definition for ODL baseline is given in
- ODL.py
The dataloader for each dataset is given in
- dataset.py
For main.py file,
data_name
: "german", "svmguide3", "magic04", "a8a", "ItalyPowerDemand", "SUSY", "HIGGS"type
: "variable_p", "trapezoidal", "obsolete_sudden"model_to_run
: "AuxDrop_ODL", "AuxDrop_OGD", "AuxDrop_ODL_DirectedInAuxLayer_RandomOtherLayer", "AuxDrop_ODL_RandomAllLayer", "AuxDrop_ODL_RandomInAuxLayer", "AuxDrop_ODL_RandomInFirstLayer_AllFeatToFirst"n
: Learning rateaux_feat_prob
: Iftype = "variable_p"
, thenaux_feat_prob
needs to be defined. It is the availability of each auxiliary input feature independently by a uniform distribution of probabilityaux_feat_prob
dropout_p
: The dropout value of AuxLayermax_num_hidden_layers
: Number of hidden layersqtd_neuron_per_hidden_layer
: Number of neurons in each hidden layers except the AuxLayern_classes
: The number of output classesaux_layer
: The position of the AuxLayer in the architecturen_neuron_aux_layer
: Number of neurons in the AuxLayerb
: This is a parameter of ODL framework. It represents the discount rates
: This is a parameter of ODL framework. It represents the smoothing rate
For baseline.py file,
data_name
: "SUSY", "HIGGS"model_to_run
: "ODL"data_type
: "only_base", "all_feat"n
: Learning ratemax_num_hidden_layers
: Number of hidden layersqtd_neuron_per_hidden_layer
: Number of neurons in each hidden layersn_classes
: The number of output classesb
: It represents the discount rates
: It represents the smoothing rate
- numpy
- torch
- pandas
- random
- tqdm
- os
- pickle
To run the Aux-Drop model, change the control parameters accordingly in the main.py file and run
python Code/main.py
To run the baseline ODL model, change the control parameters accordingly in the baseline.py file and run
python Code/baseline.py