Skip to content

Commit

Permalink
Merge pull request #1 from microsoft/high-freq-execution
Browse files Browse the repository at this point in the history
High freq execution
  • Loading branch information
Mingzhe-Han authored Sep 17, 2021
2 parents b389306 + 6a22136 commit 202bbea
Show file tree
Hide file tree
Showing 10 changed files with 8 additions and 475 deletions.
2 changes: 1 addition & 1 deletion examples/trade/README.md
Original file line number Diff line number Diff line change
@@ -1,5 +1,5 @@
# Universal Trading for Order Execution with Oracle Policy Distillation
This is the experiment code for our AAAI 2021 paper "[Universal Trading for Order Execution with Oracle Policy Distillation](https://seqml.github.io/opd/opd_aaai21.pdf)", including the implementations of all the compared methods in the paper and a general reinforcement learning framework for order execution in quantitative finance.
This is the experiment code for our AAAI 2021 paper "[Universal Trading for Order Execution with Oracle Policy Distillation](https://arxiv.org/abs/2103.10860)", including the implementations of all the compared methods in the paper and a general reinforcement learning framework for order execution in quantitative finance.

## Abstract
As a fundamental problem in algorithmic trading, order execution aims at fulfilling a specific trading order, either liquidation or acquirement, for a given instrument. Towards effective execution strategy, recent years have witnessed the shift from the analytical view with model-based market assumptions to model-free perspective, i.e., reinforcement learning, due to its nature of sequential decision optimization. However, the noisy and yet imperfect market information that can be leveraged by the policy has made it quite challenging to build up sample efficient reinforcement learning methods to achieve effective order execution. In this paper, we propose a novel universal trading policy optimization framework to bridge the gap between the noisy yet imperfect market states and the optimal action sequences for order execution. Particularly, this framework leverages a policy distillation method that can better guide the learning of the common policy towards practically optimal execution by an oracle teacher with perfect information to approximate the optimal trading strategy. The extensive experiments have shown significant improvements of our method over various strong baselines, with reasonable trading actions.
Expand Down
5 changes: 0 additions & 5 deletions examples/trade/model/__init__.py

This file was deleted.

74 changes: 0 additions & 74 deletions examples/trade/model/opd.py

This file was deleted.

79 changes: 0 additions & 79 deletions examples/trade/model/ppo.py

This file was deleted.

52 changes: 0 additions & 52 deletions examples/trade/model/qmodel.py

This file was deleted.

70 changes: 0 additions & 70 deletions examples/trade/model/teacher.py

This file was deleted.

Loading

0 comments on commit 202bbea

Please sign in to comment.