Time Series Prediction with 2D data

This repository contains the code for training and testing a deep learning model for time series prediction using multiple networks such as Long Short-Term Memory (LSTM), Transformer and so on.

Introduction

This project focuses on predicting time series data, specifically 2D data, using deep learning models. The code includes data preprocessing, model definition, training, and testing steps.

Requirements

Python 3.x
PyTorch
Matplotlib
NumPy
tqdm

Install the required packages using:

pip install -r requirements.txt

Data

The dataset used in this project contains time series data. The main feature for prediction is temperature, and additional features can be easily configured in the code. You can also use your customed datasets by changeing the variable columns in run.py. The dataset is loaded and processed using the data_factory module.

# Example of customizing the dataset
url = ['data/eng_pred/processed.csv'] // path to your data
columns = ['airTemperature', 'dewTemperature', 'windSpeed', 'hour', 'day_of_week', 'month', 'Power'] # columns' element must conform to the order of the data file columns 
target = ['Power'] # target features

Model Architecture

The implemented model architecture consists of an encoder-decoder LSTM-based Seq2Seq model, Transformer and so on. The Encoder, Decoder, Seq2Seq, and RNN components are defined in the model file. You can customize the architecture by adjusting the hyperparameters in the model classes.

# Example of customizing the model architecture
encoder = Encoder(input_dim=len(columns), hidden_dim=16, num_layers=1, model_name='RNN', dropout=0.1)
decoder = Decoder(output_dim=len(columns), hidden_dim=16, num_layers=1, model_name='RNN', dropout=0.1)
decoder_attention = Decoder_Attention(output_dim=len(columns), embedding_dim = 64, hidden_dim=64, num_layers=2, num_heads=4, model_name='LSTM')
seq2seq = Seq2Seq(encoder, decoder, device = device)
seq2seq_attention = Seq2Seq(encoder, decoder_attention, device = device)

rnn = RNN(input_dim=len(columns), hidden_dim=16, num_layers=1, output_dim=len(columns), model_name='RNN', dropout=0.1)

transformer = Seq2Seq_Transfomer(input_dim=len(columns), output_dim=len(columns), d_model=128, num_encoder_layers = 2, num_decoder_layers = 2, 
                                batch_first=True, dim_feedforward = 256)

Training

To train the model, execute the train function in the run.py script. Hyperparameters such as batch size, learning rate, and optimizer type are configured in the args dictionary.

python run.py

Testing

To assess the model's performance, utilize the test function in the run.py script. This function employs a pre-trained model to make predictions on the test dataset, providing evaluation metrics such as Mean Squared Error (MSE) and Mean Absolute Error (MAE).

python run.py

Results

The model evaluation results, including key performance metrics and visualizations, are stored in the ./results/ directory after running the testing process. This section outlines the files generated during testing and how to interpret them.

Files Generated

metrics.npy: This file contains key performance metrics calculated during the testing phase. Metrics include Mean Absolute Error (MAE), Mean Squared Error (MSE), Root Mean Squared Error (RMSE), Mean Absolute Percentage Error (MAPE), and Mean Squared Percentage Error (MSPE).
pred.npy: The predictions made by the model on the test dataset are stored in this file.
true.npy: This file includes the actual values from the test dataset.

Interpretation

MAE (Mean Absolute Error): The average absolute difference between the predicted and actual values. Lower values indicate better performance.
MSE (Mean Squared Error): The average of the squared differences between predicted and actual values. Lower values signify better accuracy.
RMSE (Root Mean Squared Error): The square root of MSE, providing a measure of the model's prediction error.
MAPE (Mean Absolute Percentage Error): The average percentage difference between predicted and actual values.
MSPE (Mean Squared Percentage Error): Similar to MAPE but uses squared differences.

Visualizations

In the ./results/ directory, visualizations such as prediction vs. ground truth plots are saved. These visualizations help to qualitatively assess the model's performance.

Feel free to explore these results to understand how well the model is capturing patterns and making predictions on the test dataset.

Name		Name	Last commit message	Last commit date
Latest commit History 17 Commits
.vscode		.vscode
checkpoints		checkpoints
data_factory		data_factory
exp		exp
layers		layers
model		model
results/seq2seq_LSTM		results/seq2seq_LSTM
test_results		test_results
utils		utils
DEA.py		DEA.py
README.md		README.md
data_processing.ipynb		data_processing.ipynb
energy_load.ipynb		energy_load.ipynb
requirements.txt		requirements.txt
result.txt		result.txt
run.py		run.py
scaler.pkl		scaler.pkl
soh.py		soh.py
test.py		test.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Time Series Prediction with 2D data

Table of Contents

Introduction

Requirements

Data

Model Architecture

Training

Testing

Results

Files Generated

Interpretation

Visualizations

About

Releases

Packages

Languages

Mborn2run/time_series_task

Folders and files

Latest commit

History

Repository files navigation

Time Series Prediction with 2D data

Table of Contents

Introduction

Requirements

Data

Model Architecture

Training

Testing

Results

Files Generated

Interpretation

Visualizations

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages