From b0bae80b526043f61773f8288c3510836c72fca1 Mon Sep 17 00:00:00 2001 From: TimmyCarbone Date: Fri, 9 Feb 2024 09:29:36 -0500 Subject: [PATCH 1/2] Update to 1.2.2 --- .gitignore | 1 + CHANGELOG.md | 17 +++++++++++++++++ README.md | 6 +++--- 3 files changed, 21 insertions(+), 3 deletions(-) diff --git a/.gitignore b/.gitignore index ef2362d..2676dc9 100644 --- a/.gitignore +++ b/.gitignore @@ -2,3 +2,4 @@ full/ lite/ release/ +draft/ \ No newline at end of file diff --git a/CHANGELOG.md b/CHANGELOG.md index 9e109ed..7d686aa 100644 --- a/CHANGELOG.md +++ b/CHANGELOG.md @@ -1,5 +1,22 @@ # CHANGELOG.md +## 1.2.2 (2024-02-06) + +**Data:** + + - Added 600k+ photos to the dataset (submitted up to `2024-01-01`) + - Removed unavailable photos (that are removed from the platform) + - The `conversions` period is now from `2023-01-01` to `2024-01-01` + +**Lite dataset link:** + + - Version link: [Version 1.2.2](https://unsplash.com/data/lite/1.2.2) + +**Integrity checks (SHA-256):** + + - Lite: `c187e0f24320bb32dc2acd3c387ca57c06cf5f52369816df4ab27f3cdf30e8de` + - Full: `` + ## 1.2.1 (2023-05-02) **Fix:** diff --git a/README.md b/README.md index 41e1854..eb6368b 100644 --- a/README.md +++ b/README.md @@ -2,12 +2,12 @@ ![](https://unsplash.com/blog/content/images/2020/08/dataheader.jpg) -The Unsplash Dataset is made up of over 330,000+ contributing global photographers and data sourced from hundreds of millions of searches across a nearly unlimited number of uses and contexts. Due to the breadth of intent and semantics contained within the Unsplash dataset, it enables new opportunities for research and learning. +The Unsplash Dataset is made up of over 350,000+ contributing global photographers and data sourced from hundreds of millions of searches across a nearly unlimited number of uses and contexts. Due to the breadth of intent and semantics contained within the Unsplash dataset, it enables new opportunities for research and learning. The Unsplash Dataset is offered in two datasets: - the Lite dataset: available for commercial and noncommercial usage, containing 25k nature-themed Unsplash photos, 25k keywords, and 1M searches -- the Full dataset: available for noncommercial usage, containing 4.8M+ high-quality Unsplash photos, 5M keywords, and over 250M searches +- the Full dataset: available for noncommercial usage, containing 5.4M+ high-quality Unsplash photos, 5M keywords, and over 250M searches As the Unsplash library continues to grow, we’ll release updates to the dataset with new fields and new images, with each subsequent release being [semantically versioned](https://semver.org/). @@ -45,7 +45,7 @@ We're making this data open and available with the hopes of enabling researchers We'd love to see what you create, whether that's a research paper, a machine learning model, a blog post, or just an interesting discovery in the data. Send us an email at [data@unsplash.com](mailto:data@unsplash.com). -If you're using the dataset in a research paper, you can attribute the dataset as `Unsplash Lite Dataset 1.2.1` or `Unsplash Full Dataset 1.2.1` and link to the permalink [`unsplash.com/data`](https://unsplash.com/data). +If you're using the dataset in a research paper, you can attribute the dataset as `Unsplash Lite Dataset 1.2.2` or `Unsplash Full Dataset 1.2.2` and link to the permalink [`unsplash.com/data`](https://unsplash.com/data). ---- From 1d33bb5b6e9a8f7b685bdf01ec21f91c7467932c Mon Sep 17 00:00:00 2001 From: TimmyCarbone Date: Fri, 9 Feb 2024 09:55:22 -0500 Subject: [PATCH 2/2] Add integrity check for Full dataset --- CHANGELOG.md | 4 ++-- 1 file changed, 2 insertions(+), 2 deletions(-) diff --git a/CHANGELOG.md b/CHANGELOG.md index 7d686aa..c58ee53 100644 --- a/CHANGELOG.md +++ b/CHANGELOG.md @@ -1,6 +1,6 @@ # CHANGELOG.md -## 1.2.2 (2024-02-06) +## 1.2.2 (2024-02-09) **Data:** @@ -15,7 +15,7 @@ **Integrity checks (SHA-256):** - Lite: `c187e0f24320bb32dc2acd3c387ca57c06cf5f52369816df4ab27f3cdf30e8de` - - Full: `` + - Full: `7783f029dc12eb54289bbb7be98acaaa585dad9ddcec44641213eed94119918c` ## 1.2.1 (2023-05-02)