Removing code for testing purposes.

robertnishihara · robertnishihara · commit af8eec897aa1 · 2025-11-18T21:51:24.000-08:00
Signed-off-by: Robert Nishihara &lt;rkn@anyscale.com&gt;
diff --git a/doc/source/data/loading-data.rst b/doc/source/data/loading-data.rst
@@ -652,59 +652,6 @@ Ray Data interoperates with distributed data processing frameworks like `Daft <h
             {'col1': 1, 'col2': '1'}
             {'col1': 2, 'col2': '2'}
 
-.. _loading_huggingface_datasets:
-
-Loading Hugging Face datasets
-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
-
-To read datasets from the Hugging Face Hub, use :func:`~ray.data.read_parquet` (or other
-read functions) with the ``HfFileSystem`` filesystem. This approach provides better
-performance and scalability than loading datasets into memory first.
-
-First, install the required dependencies:
-
-.. code-block:: console
-
-    pip install huggingface_hub
-
-Set your Hugging Face token to authenticate. While public datasets can be read without
-a token, Hugging Face rate limits are more aggressive without a token. To read Hugging
-Face datasets without a token, simply set the filesystem argument to ``HfFileSystem()``.
-
-.. code-block:: console
-
-    export HF_TOKEN=<YOUR HUGGING FACE TOKEN>
-
-For most Hugging Face datasets, the data is stored in Parquet files. You can directly
-read from the dataset path:
-
-.. testcode::
-    :skipif: True
-
-    import os
-    import ray
-    from huggingface_hub import HfFileSystem
-
-    ds = ray.data.read_parquet(
-        "hf://datasets/wikimedia/wikipedia",
-        file_extensions=["parquet"],
-        filesystem=HfFileSystem(token=os.environ["HF_TOKEN"]),
-    )
-
-    print(f"Dataset count: {ds.count()}")
-    print(ds.schema())
-
-.. testoutput::
-
-    Dataset count: 61614907
-    Column  Type
-    ------  ----
-    id      string
-    url     string
-    title   string
-    text    string
-
-
 .. _loading_datasets_from_ml_libraries:
 
 Loading data from ML libraries