Skip to content
Merged
Show file tree
Hide file tree
Changes from all commits
Commits
Show all changes
44 commits
Select commit Hold shift + click to select a range
f1eb981
pick-from-stephen's work
owenowenisme Nov 13, 2025
3221246
update
owenowenisme Nov 14, 2025
90312c3
update requirement_compiled.txt
owenowenisme Nov 14, 2025
be7bf8d
testcontainer
owenowenisme Nov 14, 2025
fad95b2
update test
owenowenisme Nov 14, 2025
bcc6727
add timeout test
owenowenisme Nov 14, 2025
3fc991a
remove python-dotenv
owenowenisme Nov 14, 2025
67040fa
Merge branch 'master' into data/add-kafka-as-datasource
owenowenisme Nov 14, 2025
a934dae
add doc entry
owenowenisme Nov 14, 2025
bfa465a
refine read_kafka
owenowenisme Nov 14, 2025
0e9d832
update req compile txt
owenowenisme Nov 14, 2025
63825e7
update lock
owenowenisme Nov 15, 2025
e08b3a9
make value bytes
owenowenisme Nov 15, 2025
6b9ffc7
make key from msg also bytes
owenowenisme Nov 15, 2025
f9acc46
make offsets only accept int, 'earlist' and 'latest'
owenowenisme Nov 16, 2025
f526946
handle error in _resolve_offsets and make offset non-optional
owenowenisme Nov 16, 2025
4120726
Merge branch 'master' into data/add-kafka-as-datasource
owenowenisme Nov 16, 2025
f28a2e5
nake kafka auth config a type
owenowenisme Nov 16, 2025
4d6aaa7
message header key is string
owenowenisme Nov 16, 2025
71c96d7
make kafkaAurhConfig dataclass
owenowenisme Nov 16, 2025
17421b7
remove parallelism param
owenowenisme Nov 16, 2025
e822674
remove max_records_per_task and make internal variable underscored
owenowenisme Nov 16, 2025
37000dd
gcm set _target_max_block_size in constructor
owenowenisme Nov 16, 2025
eb9cdb4
refine yield logic
owenowenisme Nov 16, 2025
673b4ed
move schema to the front
owenowenisme Nov 16, 2025
981a68d
make trigger err msg raw string
owenowenisme Nov 16, 2025
bb56032
remove redundant check
owenowenisme Nov 16, 2025
7286a39
remove redundant function _convert_headers_to_dict
owenowenisme Nov 16, 2025
772d432
remove unused records_read
owenowenisme Nov 16, 2025
e8d2b68
update description
owenowenisme Nov 16, 2025
69b42b3
make auth a proper dataclasses
owenowenisme Nov 16, 2025
6c35651
update comment
owenowenisme Nov 16, 2025
5f84893
fix target_max_block_size
owenowenisme Nov 16, 2025
496bf0e
update description
owenowenisme Nov 16, 2025
5be90eb
update remaining_timeout_ms
owenowenisme Nov 16, 2025
134104a
Update python/ray/data/read_api.py
owenowenisme Nov 16, 2025
6e60f8a
Update python/ray/data/_internal/datasource/kafka_datasource.py
owenowenisme Nov 16, 2025
956f063
Update python/ray/data/_internal/datasource/kafka_datasource.py
owenowenisme Nov 16, 2025
b3639c0
update comments
owenowenisme Nov 16, 2025
a5f04b9
update optional KafkaAuthConfig
owenowenisme Nov 16, 2025
3e628c4
Merge remote-tracking branch 'upstream/master' into data/add-kafka-as…
owenowenisme Nov 17, 2025
566f497
Merge branch 'master' into data/add-kafka-as-datasource
owenowenisme Nov 19, 2025
f375dab
add log update description
owenowenisme Nov 19, 2025
6d6b71b
update log
owenowenisme Nov 19, 2025
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
9 changes: 9 additions & 0 deletions doc/source/data/api/input_output.rst
Original file line number Diff line number Diff line change
Expand Up @@ -368,6 +368,15 @@ WebDataset

.. _data_source_api:

Kafka
-----

.. autosummary::
:nosignatures:
:toctree: doc/

read_kafka

Datasource API
--------------

Expand Down
14 changes: 14 additions & 0 deletions python/ray/data/BUILD.bazel
Original file line number Diff line number Diff line change
Expand Up @@ -1129,6 +1129,20 @@ py_test(
],
)

py_test(
name = "test_kafka",
size = "medium",
srcs = ["tests/test_kafka.py"],
tags = [
"exclusive",
"team:data",
],
deps = [
":conftest",
"//:ray_lib",
],
)

py_test(
name = "test_metadata_provider",
size = "small",
Expand Down
4 changes: 4 additions & 0 deletions python/ray/data/__init__.py
Original file line number Diff line number Diff line change
Expand Up @@ -27,6 +27,7 @@
from ray.data.iterator import DataIterator, DatasetIterator
from ray.data.preprocessor import Preprocessor
from ray.data.read_api import ( # noqa: F401
KafkaAuthConfig,
from_arrow,
from_arrow_refs,
from_blocks,
Expand Down Expand Up @@ -55,6 +56,7 @@
read_datasource,
read_delta,
read_delta_sharing_tables,
read_kafka,
read_hudi,
read_iceberg,
read_images,
Expand Down Expand Up @@ -162,6 +164,8 @@
"read_datasource",
"read_delta",
"read_delta_sharing_tables",
"read_kafka",
"KafkaAuthConfig",
"read_hudi",
"read_iceberg",
"read_images",
Expand Down
Loading