Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

SmallPond on S3 Object Store Integration #10

Open
soumilshah1995 opened this issue Feb 28, 2025 · 4 comments
Open

SmallPond on S3 Object Store Integration #10

soumilshah1995 opened this issue Feb 28, 2025 · 4 comments

Comments

@soumilshah1995
Copy link

SmallPond on S3 Object Store Integration

I’ve been experimenting with smallpond locally, and it works great for processing large datasets. I was able to read a Parquet file, run some SQL queries, and successfully convert the results to a pandas DataFrame. Now, I’m curious about the configuration or settings required to make it work with an S3 object store.

What are the necessary configurations to connect SmallPond to an S3 bucket, and do I need any specific credentials or access settings to enable seamless integration? Any guidance or best practices would be greatly appreciated.

@mike-luabase
Copy link

mike-luabase commented Mar 2, 2025

here's a fork with s3 support https://github.com/definite-app/smallpond. I don't think I'll have time to test and contribute this upstream as it doesn't seem smallpond is going to work for my use-case, but feel free to use it!

@soumilshah1995
Copy link
Author

@mike-luabase do you have any examples or any configuration we need to set or simply we can pass S3 URI ?

@mike-luabase
Copy link

mike-luabase commented Mar 2, 2025

@mike-luabase do you have any examples or any configuration we need to set or simply we can pass S3 URI ?

take a look at test_s3.py

@soumilshah1995
Copy link
Author

Works tried your fork thanks a lot

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants