Skip to content

Conversation

yanxi0830
Copy link
Contributor

@yanxi0830 yanxi0830 commented Mar 15, 2025

What does this PR do?

  • fix datasets api signature mis-match so that llama stack run can start

Test Plan

llama stack run
image

@facebook-github-bot facebook-github-bot added the CLA Signed This label is managed by the Meta Open Source bot. label Mar 15, 2025
@yanxi0830 yanxi0830 changed the base branch from main to pr1573 March 15, 2025 21:47
@yanxi0830 yanxi0830 marked this pull request as ready for review March 15, 2025 21:49
@yanxi0830 yanxi0830 merged commit 2c9d624 into pr1573 Mar 15, 2025
5 checks passed
@yanxi0830 yanxi0830 deleted the fix_resolver_issue branch March 15, 2025 21:56
provider_dataset_id = dataset_id

# infer provider from source
if source.type == DatasetType.rows:
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Infering provider in this way is not scalable for additional dataset sources (for example, relational databases) - adding more dataset sources will become more and more challenging/

yanxi0830 added a commit that referenced this pull request Mar 17, 2025
## PR Stack
- #1573
- #1625
- #1656
- #1657
- #1658
- #1659
- #1660

**Client SDK**
- llamastack/llama-stack-client-python#203

**CI**
- https://github.com/meta-llama/llama-stack-ops/actions/runs/13911304888
<img width="1042" alt="image"
src="https://github.com/user-attachments/assets/69636067-376d-436b-9204-896e2dd490ca"
/>
-- the test_rag_agent_with_attachments is flaky and not related to this
PR

## Doc
<img width="789" alt="image"
src="https://github.com/user-attachments/assets/b88390f3-73d6-4483-b09a-a192064e32d9"
/>


## Client Usage
```python
client.datasets.register(
    source={
        "type": "uri",
        "uri": "lsfs://mydata.jsonl",
    },
    schema="jsonl_messages",
    # optional 
    dataset_id="my_first_train_data"
)

# quick prototype debugging
client.datasets.register(
    data_reference={
        "type": "rows",
        "rows": [
                "messages": [...],
        ],
    },
    schema="jsonl_messages",
)
```

## Test Plan
- CI:
https://github.com/meta-llama/llama-stack-ops/actions/runs/13878055453

```
LLAMA_STACK_CONFIG=fireworks pytest -v tests/integration/datasets/test_datasets.py
```

```
LLAMA_STACK_CONFIG=fireworks pytest -v tests/integration/scoring/test_scoring.py
```

```
pytest -v -s --nbval-lax ./docs/notebooks/Llama_Stack_Benchmark_Evals.ipynb
```
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

CLA Signed This label is managed by the Meta Open Source bot.

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants