PaddlePaddle 1.0.0-rc0
Pre-releaseRelease Log
Major New Features and Improvements:
-
Support MacOS training, inference, Windows inference (Alpha).
-
Speed up While operator
-
Enhance support for sparse tensor
-
TensorRT integration enhance
-
More fused operators for CPU inference: GRU, LSTM, etc.
-
Some improvements for sequence operators (sequence_pool, sequence_concat, sequence_mask, sequence_enumerate, sequence_slice, etc)
-
Other operator improvements: stack_op, BatchAUC, prelude, crf, pad2d
-
decayed_adagrad support for distributed training
-
Python multi-process reader
-
API doc improvements. Avoid kwargs.
Others:
-
Tighten public APIs. Hide public APIs that are currently not widely used and unlikely to be used in the near future.
-
Clean up some deprecated features.
Known Issues
-
Memory optimization still has space for improvements in next release.
-
Using memory optimization with distributed training should strictly follow some counter-intuitive instructions.
发布日志
主要新功能和优化
-
支持 MacOS 训练和预测,Windows预测(内测)
-
提高while operator的速度
-
增强对sparse tensor的支持
-
TensorRT 集成的加强
-
更多CPU预测的融合operator: GRU, LSTM, etc.
-
优化序列相关operators (sequence_pool, sequence_concat, sequence_mask, sequence_enumerate, sequence_slice, etc)
-
其他operator的优化: stack_op, BatchAUC, prelude, crf, pad2d
-
decayed_adagrad 支持分布式训练
-
Python多进程reader
-
API 文档优化,避免kwargs等问题
其他:
-
规范管理public API. 一些当前不常被使用并且将来不太可能被使用的API被隐藏起来
-
清理一些废弃的功能
已知问题
-
内存优化在下个release还有一些的提高空间
-
内存优化和分布式训练的同时使用需要严格遵循一些不太合乎直觉的步骤