Skip to content

Challenge in collaboration with LUISS and one of Italy's most important energy companies

Notifications You must be signed in to change notification settings

enricoromano/Cash-Flow-Forecasting

Repository files navigation

Cash-Flow-Forecasting

Challenge in collaboration with LUISS and one of Italy's most important energy companies

Affinity Analysis (Market Basket Analysis)

Market Basket Analysis (MBA) is an accidental transaction pattern that purchasing some products will affect the purchasing of other products. MBA is used to predict what products that customer interested in. It’s a kind of knowledge discovery in data (KDD) and this technique can be applied in various fields of work. Before performing the affinity analysis, it is strongly advised to handle with missing values, either drop them, or replace them according to a predefined methodology. We decided to drop them because some of them are variables for identification purpose. In our understanding of the datasets provided, we decided to consider only positive values of column Amount, because in EDA, we found out that most of the Amount were positive, which intuitively we associate them as cost/liability since our main goal is to estimate cash flows related to the account payable in the 1st and 2nd quarters of 2022.

immagine

Time Series Forecasting

Prophet

The first algorithm is Prophet. It is a procedure for forecasting time series data based on an additive model where non-linear trends are fit with yearly, weekly, and daily seasonality, plus holiday effects. It works best with time series that have strong seasonal effects and several seasons of historical data. Prophet is robust to missing data and shifts in the trend, and typically handles outliers well.

LSTM

Long Short-Term Memory (LSTM). LSTM is a Recurrent neural network that learn to predict the future from sequences of varying lengths use LSTM cells. Recurrent neural networks, unlike ARIMA and Prophet, can deal with any type of sequential data and are not limited to time series. The fundamental goal of LSTM cells is to remember the critical bits of the sequence while forgetting the less significant ones. This is accomplished using so-called gates, which are functions with various learning objectives.

immagine

In conclusion, through R2, MSE and RMSE we evaluated the models and chose the one that, in our view, is most accurate in calculating cash flow .

Authors

  • Enrico Romano
  • Simone Lu
  • Leonardo Crescenzi

About

Challenge in collaboration with LUISS and one of Italy's most important energy companies

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published