To catch a fake: Curbing deceptive Yelp ratings and venues

Abstract

The popularity and influence of reviews, make sites like Yelp ideal targets for malicious behaviors. We present Marco, a novel system that exploits the unique combination of social, spatial and temporal signals gleaned from Yelp, to detect venues whose ratings are impacted by fraudulent reviews. Marco increases the cost and complexity of attacks, by imposing a tradeoff on fraudsters, between their ability to impact venue ratings and their ability to remain undetected. We contribute a new dataset to the community, which consists of both ground truth and gold standard data. We show that Marco significantly outperforms state‐of‐the‐art approaches, by achieving 94% accuracy in classifying reviews as fraudulent or genuine, and 95.8% accuracy in classifying venues as deceptive or legitimate. Marco successfully flagged 244 deceptive venues from our large dataset with 7,435 venues, 270,121 reviews and 195,417 users. Furthermore, we use Marco to evaluate the impact of Yelp events, organized for elite reviewers, on the hosting venues. We collect data from 149 Yelp elite events throughout the US. We show that two weeks after an event, twice as many hosting venues experience a significant rating boost rather than a negative impact.

Document Details

Document Type
Pub Defense Publication
Publication Date
Apr 10, 2015
Source ID
10.1002/sam.11264

Entities

People

  • Bogdan Carbunar
  • Duen Horng (polo) Chau
  • Jaime Ballesteros
  • Mahmudur Rahman

Organizations

  • Florida International University
  • Georgia Tech
  • National Science Foundation
  • United States Department of Defense

Tags

Readers

  • Academic Conference Management
  • Military Logistics and Supply Chain Management
  • Sensor Fusion and Tracking Systems.