Skip to content

ahmedbax/shark

This branch is 272 commits behind amplab/shark:master.

Folders and files

NameName
Last commit message
Last commit date
Oct 17, 2013
Aug 27, 2013
May 20, 2013
Oct 14, 2012
Nov 13, 2013
Sep 18, 2013
Nov 24, 2013
Aug 6, 2013
Dec 6, 2012
Aug 28, 2013
Sep 18, 2013
Oct 21, 2013

Repository files navigation

Shark (Hive on Spark)

Shark is a large-scale data warehouse system for Spark designed to be compatible with Apache Hive. It can answer Hive QL queries up to 100 times faster than Hive without modification to the existing data nor queries. Shark supports Hive's query language, metastore, serialization formats, and user-defined functions.

Shark 0.8.0 requires:

  • Scala 2.9.3
  • Hive 0.9
  • Spark 0.8.x

For current documentation, see the Shark Project Wiki