Skip to content

S3 guide

Unibox uses boto3 under the hood. If boto3 can access your bucket, unibox will work.

Load files from S3

import unibox as ub

# CSV to DataFrame
sales = ub.loads("s3://my-bucket/data/sales.csv")

# Parquet to DataFrame
events = ub.loads("s3://my-bucket/data/events.parquet")

# JSON to dict or list
cfg = ub.loads("s3://my-bucket/configs/app.json")

Save files to S3

import unibox as ub

ub.saves(sales, "s3://my-bucket/data/sales_clean.parquet")
ub.saves(cfg, "s3://my-bucket/configs/app_clean.json")

List objects

import unibox as ub

# List only parquet files
files = ub.ls("s3://my-bucket/data", exts=[".parquet"])
print(files[:3])

Tips

Tip

Use exts to reduce listing noise for large prefixes.

Warning

If you see AccessDenied errors, check IAM permissions and bucket policy.

Next steps