S3 Sync
Rewind your S3 buckets for guaranteed reproducibility
S3 is the industry standard for fast, flexible storage. If you’ve ever collaborated on S3, though, you know that it’s all too easy to overwrite or delete files when iterating on pipelines. S3 sync lets you schedule regular backups of your S3 bucket to XetHub for preserved provenance, with no changes to your existing workflow.
S3 is the industry standard for fast, flexible storage. If you’ve ever collaborated on S3, though, you know that it’s all too easy to overwrite or delete files when iterating on pipelines. S3 sync lets you schedule regular backups of your S3 bucket to XetHub for preserved provenance, with no changes to your existing workflow.
How it works
Create a XetHub repo from your S3 bucket, and watch the snapshots come in. Easily rewind to any previous version of your bucket.
Create a XetHub repo from your S3 bucket, and watch the snapshots come in. Easily rewind to any previous version of your bucket.
Why XetHub?
Create and sync repos directly from an S3 bucket
Create and sync repos directly from an S3 bucket
With just a URL, you can create a repo directly from an S3 bucket and set up syncing as desired. Because XetHub is built on git, every update gets stored as a commit.
With just a URL, you can create a repo directly from an S3 bucket and set up syncing as desired. Because XetHub is built on git, every update gets stored as a commit.
Efficiently store version history
Efficiently store version history
XetHub intelligently reduces the storage needed for each version of your content so you never have to worry about ballooning repo sizes again. Store what you need without compromises.
XetHub intelligently reduces the storage needed for each version of your content so you never have to worry about ballooning repo sizes again. Store what you need without compromises.
Visualize your data
Visualize your data
Don't be stuck simply looking at rows of data or line-by-line diffs. XetHub allows you to visually understand your data at each point in time, making it easy to compare what changed.
Don't be stuck simply looking at rows of data or line-by-line diffs. XetHub allows you to visually understand your data at each point in time, making it easy to compare what changed.