Skip to content
Merged
Show file tree
Hide file tree
Changes from all commits
Commits
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
223 changes: 0 additions & 223 deletions doc/source/data/preprocessors.rst

This file was deleted.

1 change: 0 additions & 1 deletion doc/source/data/user-guide.rst
Original file line number Diff line number Diff line change
Expand Up @@ -23,6 +23,5 @@ show you how achieve several tasks.
working-with-pytorch
batch_inference
performance-tips
preprocessors
monitoring-your-workload
custom-datasource-example
2 changes: 1 addition & 1 deletion doc/source/train/distributed-xgboost-lightgbm.rst
Original file line number Diff line number Diff line change
Expand Up @@ -204,7 +204,7 @@ machines have 16 CPUs in addition to the 4 GPUs, each actor should have
How to preprocess data for training?
------------------------------------

Particularly for tabular data, Ray Data comes with out-of-the-box :ref:`preprocessors <data-preprocessors>` that implement common feature preprocessing operations.
Particularly for tabular data, Ray Data comes with out-of-the-box :ref:`preprocessors <preprocessor-ref>` that implement common feature preprocessing operations.
You can use this with Ray Train Trainers by applying them on the dataset before passing the dataset into a Trainer. For example:


Expand Down
Original file line number Diff line number Diff line change
Expand Up @@ -510,7 +510,7 @@ Preprocessing structured data
This section is for tabular/structured data. The recommended way for preprocessing unstructured data is to use
Ray Data operations such as `map_batches`. See the :ref:`Ray Data Working with Pytorch guide <working_with_pytorch>` for more details.

For tabular data, we recommend using Ray Data :ref:`preprocessors <data-preprocessors>`, which implement common data preprocessing operations.
For tabular data, we recommend using Ray Data :ref:`preprocessors <preprocessor-ref>`, which implement common data preprocessing operations.
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

For tabular data, use Ray Data :ref:preprocessors <preprocessor-ref>, which implement common data preprocessing operations.

You can use this with Ray Train Trainers by applying them on the dataset before passing the dataset into a Trainer. For example:

.. testcode::
Expand Down