You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Is your feature request related to a problem? Please describe.
I'm always frustrated when I try to use the SAR model with userCol and itemCol as string types. Currently, the SAR model only accepts these columns as integer types, which requires additional data preprocessing steps to convert string IDs to integers. This limitation can be cumbersome and time-consuming, especially when dealing with large datasets where user and item IDs are naturally represented as strings.
Describe the solution you'd like
I would like the SAR model to support userCol and itemCol as string types. This would allow for more flexibility and ease of use, as many real-world datasets use string identifiers for users and items. By supporting string types, the SAR model would eliminate the need for additional preprocessing steps, making it more user-friendly and efficient.
Additional context
Add any other context or screenshots about the feature request here.
Example Code
Here is an example of how the feature could be used if implemented:
Fixes#2275
Add support for `userCol` and `itemCol` as string types in the SAR model.
* **Python Files:**
- Add `core/src/main/python/synapse/ml/recommendation/SAR.py` to handle string `userCol` and `itemCol`.
- Modify `core/src/main/python/synapse/ml/recommendation/SARModel.py` to handle string `userCol` and `itemCol` in the `recommendForUserSubset` function.
* **Scala Files:**
- Modify `core/src/main/scala/com/microsoft/azure/synapse/ml/recommendation/SAR.scala` to handle string `userCol` and `itemCol` in the `calculateUserItemAffinities` and `calculateItemItemSimilarity` functions.
- Modify `core/src/main/scala/com/microsoft/azure/synapse/ml/recommendation/SARModel.scala` to handle string `userCol` and `itemCol`.
* **Tests:**
- Update `core/src/test/python/synapsemltest/recommendation/test_ranking.py` to include tests for string `userCol` and `itemCol`.
- Update `core/src/test/scala/com/microsoft/azure/synapse/ml/recommendation/SARSpec.scala` to include tests for string `userCol` and `itemCol`.
* **Documentation:**
- Update `docs/Quick Examples/estimators/core/_Recommendation.md` to include examples with string `userCol` and `itemCol`.
---
For more details, open the [Copilot Workspace session](https://copilot-workspace.githubnext.com/microsoft/SynapseML/issues/2275?shareId=XXXX-XXXX-XXXX-XXXX).
Is your feature request related to a problem? Please describe.
I'm always frustrated when I try to use the SAR model with
userCol
anditemCol
as string types. Currently, the SAR model only accepts these columns as integer types, which requires additional data preprocessing steps to convert string IDs to integers. This limitation can be cumbersome and time-consuming, especially when dealing with large datasets where user and item IDs are naturally represented as strings.Describe the solution you'd like
I would like the SAR model to support
userCol
anditemCol
as string types. This would allow for more flexibility and ease of use, as many real-world datasets use string identifiers for users and items. By supporting string types, the SAR model would eliminate the need for additional preprocessing steps, making it more user-friendly and efficient.Additional context
Add any other context or screenshots about the feature request here.
Example Code
Here is an example of how the feature could be used if implemented:
The text was updated successfully, but these errors were encountered: