Name: Simba
Owner: InitialDLab
Description: Spatial In-Memory Big data Analytics
Created: 2016-04-18 20:58:15.0
Updated: 2018-05-14 14:35:01.0
Pushed: 2018-03-28 20:06:41.0
Homepage: null
Size: 116350
Language: Scala
GitHub Committers
User | Most Recent Commit | # Commits |
---|
Other Committers
User | Most Recent Commit | # Commits |
---|
Simba is now shipped as a standalone package outside Spark. Current version works with Spark 2.1.x. If you find any issues, please make a ticket in the issue tracking system.
Simba is a distributed in-memory spatial analytics engine based on Apache Spark. It extends the Spark SQL engine across the system stack to support rich spatial queries and analytics through both SQL and the DataFrame API. Besides, Simba introduces native indexing support over RDDs in order to develop efficient spatial operators. It also extends Spark SQL's query optimizer with spatial-aware and cost-based optimizations to make the best use of existing indexes and statistics.
Simba is open sourced under Apache License 2.0. Currently, it is developed based on Spark 1.6.0. For recent updates and further information, please refer to Simba's homepage.
IN RANGE
IN CIRCLERANGE
IN KNN
DISTANCE JOIN
KNN JOIN
Notes: We are still cleaning source codes for some of our features, which will be released to the master and develop branch later.
org.apache.spark.examples
package for testing or debugging.standalone
branches are opened for maintaining Simba standalone package, which aims at building Simba packages standing outside Spark SQL core. Currently, master
branch and develop
branch are built on top of Spark 2.1.x.
The master
branch provides the latest stable version, while the develop
branch is the main development branch where new features will be merged before ready to release. For legacy reasons, we also keep branches which archives old versions of Simba, which is developed based on former Spark versions, in the branches named simba-spark-x.x
. Note that we will only integrate latest features into master
and develop
branches. Please make sure you checkout the correct branch before start using it.