• Home
  • Categories
  • Tags
  • Pricing
  • Submit
  1. Home
  2. Themed Directories
  3. Awesome Data - Yelp Dataset Challenge

Awesome Data - Yelp Dataset Challenge

An entry in the Awesome Data Project’s meta-collection that catalogs the Yelp Dataset Challenge, a public subset of Yelp’s business, review, and user data frequently used in data science and machine learning research. It serves as a curated pointer within the Awesome-style directory system to this specific data challenge resource.

🌐Visit Website

About this tool

Awesome Data – Yelp Dataset Challenge

Curated entry in the Awesome Data collection pointing to the Yelp Open Dataset, a large, real-world dataset commonly used for data science and machine learning projects involving local businesses, reviews, and user-generated content.

  • Source: Yelp Open Dataset
  • Brand: Yelp
  • Category: Themed directories
  • Tags: datasets, machine-learning, business

Features

Dataset scope

  • Educational / research focus: Intended for educational and research use (e.g., data science, machine learning, information retrieval, recommendation systems).
  • Real-world business data: Captures real Yelp business and review activity.

Contents & scale

  • Reviews: 6,990,280 reviews
  • Businesses: 150,346 businesses
  • Geographical coverage: 11 metropolitan areas
  • Photos / pictures: 200,100 pictures
  • Attributes & metadata (via JSON files), including for example:
    • Business hours
    • Parking availability
    • Ambience and similar business attributes
    • Check-ins

File structure & formats

  • Main JSON bundle (business/review data):
    • 1 compressed TAR archive (≈ 4.35 GB)
    • Uncompressed contents (≈ 8.65 GB):
      • 1 PDF (documentation)
      • 5 JSON files (core dataset files)
      • Documentation included
  • Photos bundle:
    • 1 compressed TAR archive (≈ 7.45 GB)
    • Uncompressed contents (≈ 7.11 GB):
      • 1 JSON file
      • 1 text file
      • 1 PDF (documentation)
      • 1 folder containing ≈ 200,000 photos
      • Documentation included

Access & downloads

  • Data download (JSON): Download JSON
  • Photos download: Download photos

Use Cases

  • Academic coursework and projects (data mining, statistics, ML)
  • Research on recommendations, NLP, sentiment analysis, and ranking
  • Analysis of local business ecosystems and user behavior

Pricing

  • Presented as an open, educational dataset; no pricing information is provided in the source content.
Surveys

Loading more......

Information

Websitewww.yelp.com
PublishedDec 30, 2025

Categories

1 Item
Themed Directories

Tags

3 Items
#datasets
#machine-learning
#business

Similar Products

3 result(s)
3.5B Web Pages from CommonCrawl 2012

Large-scale web crawl dataset containing 3.5 billion web pages from CommonCrawl (2012), suitable for web mining, search, and network analysis research. Listed as part of an awesome-style collection of computer networks datasets.

30 Seconds of Code

An Awesome-style collection of short, easy-to-understand JavaScript code snippets you can grasp in 30 seconds.

50projects50days

A GitHub repository by Brad Traversy containing 50+ small, focused web development mini projects built with HTML, CSS, and JavaScript, useful as a curated collection of example projects for learning or referencing in awesome-style directories.

Built with
Ever Works
Ever Works

Connect with us

Stay Updated

Get the latest updates and exclusive content delivered to your inbox.

Product

  • Categories
  • Tags
  • Pricing
  • Help

Clients

  • Sign In
  • Register
  • Forgot password?

Company

  • About Us
  • Admin
  • Sitemap

Resources

  • Blog
  • Submit
  • API Documentation
All product names, logos, and brands are the property of their respective owners. All company, product, and service names used in this repository, related repositories, and associated websites are for identification purposes only. The use of these names, logos, and brands does not imply endorsement, affiliation, or sponsorship. This directory may include content generated by artificial intelligence.
Copyright © 2025 Ever. All rights reserved.·Terms of Service·Privacy Policy·Cookies