Releases: trojblue/unibox
Releases · trojblue/unibox
0.5.0
v0.5.0 - 2025-01-04
Features
- adding proper colorized logger (9e0d758 by yada).
- adding basic huggingface upload tools (f412a18 by yada).
- adding huggingface storage backend (4c93076 by yada).
- adding basic test suite and txt loader (bc16177 by yada).
- adding basic working loader and tests (aa65789 by yada).
- adding skeleton loader classes (a1e299f by yada).
Bug Fixes
- huggingface uploading an datasets object; s3 incorrect uri passed in (e5bb8d2 by yada).
- adding colorlog dependency (a746230 by yada).
- adding datasets dependency (071e5cc by yada).
- color control characters getting written to logs (ecf1781 by yada).
- image_loader: properly handling image loaders (cf5e535 by yada).
- double write penalty at ub.saves() (ec07b9b by yada).
- adding convert to rgb when using gallery (0e5826d by yada).
Code Refactoring
uniloader-rewrite-stage3
v0.5.0 - 2025-01-04
Features
- adding proper colorized logger (9e0d758 by yada).
- adding basic huggingface upload tools (f412a18 by yada).
- adding huggingface storage backend (4c93076 by yada).
- adding basic test suite and txt loader (bc16177 by yada).
- adding basic working loader and tests (aa65789 by yada).
- adding skeleton loader classes (a1e299f by yada).
Bug Fixes
- huggingface uploading an datasets object; s3 incorrect uri passed in (e5bb8d2 by yada).
- adding colorlog dependency (a746230 by yada).
- adding datasets dependency (071e5cc by yada).
- color control characters getting written to logs (ecf1781 by yada).
- image_loader: properly handling image loaders (cf5e535 by yada).
- double write penalty at ub.saves() (ec07b9b by yada).
- adding convert to rgb when using gallery (0e5826d by yada).
Code Refactoring
uniloader-rewrite-stage2
tests: ignoring boto3 warnings on python3.12
uniloader-rewrite-stage1
fix: image_loader: properly handling image loaders
v0.4.13: adding ub.label_gallery() tool for data labelling
feat: ub.label_gallery
:
view and label images within jupyter notebook:
import unibox as ub
uris = ["https://cdn.donmai.us/180x180/8e/ea/8eea944690c0c0b27e303420cb1e65bd.jpg"] * 9
labels = ['Image 1', 'Image 2', 'Image 3'] * 3
# label data interactively
ub.label_gallery(uris, labels)
# or: view images only
# ub.gallery(uris, labels)
v0.4.12: allow human-readable date in ub.presigns()
feat:
- allow human-readable date in
ub.presigns()
:
import unibox as ub
uri = "s3://bucket-external/dataset/dataset_qft/moody_qft_danbooru.json"
signed = ub.presigns(uri, expiration="1y") # format: https://github.com/xolox/python-humanfriendly
signed
v0.4.11: adding s3 presigning tools
feat:
ub.presigns(s3_uri)
: presigning a s3 uri to create an accessible url. useful for working with s3 uris in transformers pipelines:
import unibox as ub
from transformers import pipeline
# More models in the model hub.
model_name = "openai/clip-vit-large-patch14"
classifier = pipeline("zero-shot-image-classification", model = model_name, device="cuda")
# s3 uri to url
image_to_classify = 's3://bucket-external/dataset/dataset_qft/qft_v5c_twitter-logfav_9.6_60k/100006176_p0.webp'
image_url = ub.presigns(image_to_classify)
# get results
labels = ["a girl", "a boy"]
scores = classifier(image_url, candidate_labels = labels)
scores
# [{'score': 0.9802619218826294, 'label': 'a girl'},
# {'score': 0.0197380892932415, 'label': 'a boy'}]
tweak:
- removing unused methods in s3_client.py
v0.4.10: further ipython import fix
fix:
import unibox
: will not require ipython.
v0.4.9 IPython import fix
fix:
ub.peeks()
: handle missing ipython dependency (when using python 3.8, etc) gracefully
v0.4.8 concurrent_loads() order fix
fix:
ub.concurrent_loads()
: it should now return a list of files in the same order as input list