Update the data of quick start. #573

qingqing01 · 2016-11-23T08:20:04Z

quick_start/data/get_data里下载已经预处理过的数据。
增加 quick_start/data/proc_from_raw_data目录，提供从原始数据处理脚本。
相应的修改文档说明。

luotao1 · 2016-11-23T13:31:46Z

demo/quick_start/data/proc_from_raw_data/get_data.sh

+#https://github.com/moses-smt/mosesdecoder
+#wget https://github.com/moses-smt/mosesdecoder/archive/master.zip
+#unzip master.zip
+#rm master.zip


注释掉的部分，可以删去了

luotao1 · 2016-11-23T13:32:28Z

doc/demo/quick_start/index_en.md

@@ -59,12 +59,11 @@ To build your text classification system, your code will need to perform five st
 ## Preprocess data into standardized format
 In this example, you are going to use [Amazon electronic product review dataset](http://jmcauley.ucsd.edu/data/amazon/) to build a bunch of deep neural network models for text classification. Each text in this dataset is a product review. This dataset has two categories: “positive” and “negative”. Positive means the reviewer likes the product, while negative means the reviewer does not like the product.

-`demo/quick_start` in the [source code](https://github.com/baidu/Paddle) provides scripts for downloading data and preprocessing data as shown below. The data process takes several minutes (about 3 minutes in our machine).
+`demo/quick_start` in the [source code](https://github.com/baidu/Paddle) provides script for downloading the preprocessed data as shown below. (If you want to process the raw data, you can use the script `demo/quick_start/data/proc_from_raw_data/get_data.sh`).


https://github.com/PaddlePaddle/Paddle

luotao1 · 2016-11-23T13:32:59Z

doc_cn/demo/quick_start/index.md

@@ -32,13 +32,11 @@

 ## 数据格式准备(Data Preparation)
 在本问题中，我们使用[Amazon电子产品评论数据](http://jmcauley.ucsd.edu/data/amazon/)，
-将评论分为好评(正样本)和差评(负样本)两类。[源码](https://github.com/baidu/Paddle)的`demo/quick_start`里提供了数据下载脚本
-和预处理脚本。
+将评论分为好评(正样本)和差评(负样本)两类。[源码](https://github.com/baidu/Paddle)的`demo/quick_start`里提供了下载已经预处理数据的脚本（如果想从最原始的数据处理，可以使用脚本 `./demo/quick_start/data/proc_from_raw_data/get_data.sh`）。


https://github.com/PaddlePaddle/Paddle

…start

* paddle_tensorrt_infer_en * Update paddle_tensorrt_infer_en.md * Review

Update the data of quick start.

ebd7252

luotao1 reviewed Nov 23, 2016

View reviewed changes

qingqing01 added 2 commits November 24, 2016 10:04

Merge branch 'develop' of https://github.com/baidu/Paddle into quick_…

1edacf7

…start

Update doc and proc_from_raw_data/get_data.sh

0561dd0

qingqing01 mentioned this pull request Nov 25, 2016

执行demo/quick_start/preprocess.sh报perl的错误 #612

Closed

luotao1 approved these changes Nov 28, 2016

View reviewed changes

luotao1 merged commit 2f60248 into PaddlePaddle:develop Nov 28, 2016

qingqing01 mentioned this pull request Dec 5, 2016

demo/quick_start/preprocess.sh takes hours on a VM with 1 CPU #111

Closed

qingqing01 deleted the quick_start branch April 20, 2017 10:34

zhhsplendid pushed a commit to zhhsplendid/Paddle that referenced this pull request Sep 25, 2019

paddle_tensorrt_infer_en (PaddlePaddle#573)

65f013f

* paddle_tensorrt_infer_en * Update paddle_tensorrt_infer_en.md * Review

wangxicoding pushed a commit to wangxicoding/Paddle that referenced this pull request Dec 9, 2021

using new api cross_entropy. (PaddlePaddle#573)

840149d

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Update the data of quick start. #573

Update the data of quick start. #573

qingqing01 commented Nov 23, 2016

luotao1 Nov 23, 2016

luotao1 Nov 23, 2016

luotao1 Nov 23, 2016

Update the data of quick start. #573

Update the data of quick start. #573

Conversation

qingqing01 commented Nov 23, 2016

luotao1 Nov 23, 2016

Choose a reason for hiding this comment

luotao1 Nov 23, 2016

Choose a reason for hiding this comment

luotao1 Nov 23, 2016

Choose a reason for hiding this comment