Name: solr-slider
Owner: Lucidworks
Description: Solr on YARN via Slider
Created: 2015-05-19 21:24:18.0
Updated: 2016-06-23 18:35:05.0
Pushed: 2015-05-28 15:38:35.0
Homepage: null
Size: 127
Language: Python
GitHub Committers
User | Most Recent Commit | # Commits |
---|
Other Committers
User | Most Recent Commit | # Commits |
---|
Solr on YARN - Slider package for deploying SolrCloud to a YARN cluster.
Follow the instructions for getting started with Slider: http://slider.incubator.apache.org/docs/getting_started.html
Be sure to add the $SLIDER_HOME/bin
directory to your path.
Also, make sure your conf/slider-client.xml
file sets the ResourceManager address so you don't have to
include the --manager
parameter with every slider command.
roperty>
<name>yarn.resourcemanager.address</name>
<value>localhost:8032</value>
property>
Throughout these instructions, $PROJECT_HOME
refers to the directory where you cloned this project.
1) Start ZooKeeper 3.4.6+ on your local workstation
Running Solr with embedded ZooKeeper is not supported. Thus, you need to deploy at least one standalone ZooKeeper process, but of course at least 3 servers are required to establish quorum for production environments. Take note of the ZooKeeper connection string as you'll need to include it in the Solr deployment metadata below (see step 5).
2) Download a Solr distribution archive (tgz)
Download the latest Solr distribution from: http://lucene.apache.org/solr/downloads.html
Once downloaded, move the distribution archive to $PROJECT_HOME/package/files/solr.tgz
The distribution archive must be named solr.tgz
as the metainfo.xml
file references this path.
3) Create the solr-on-yarn.zip deployment package
Create the Slider package using zip:
-r solr-on-yarn.zip metainfo.xml package/
4) Install the package on HDFS
er install-package --replacepkg --name solr --package $PROJECT_HOME/solr-on-yarn.zip
5) Configure environment specific settings
Edit the $PROJECT_HOME/appConfig-default.json
. At a minimum, you'll need to update the following settings
to match your environment:
"java_home": "/Library/Java/JavaVirtualMachines/jdk1.7.0_67.jdk/Contents/Home",
"site.global.app_root": "${AGENT_WORK_ROOT}/app/install/solr-5.2.0",
"site.global.zk_host": "localhost:2181",
Review the other settings in this file to verify they are correct for your environment.
6) Configure the number of Solr nodes to deploy
Edit yarn.component.instances
in resources-default.json
to set the number of Solr nodes to deploy across your cluster.
7) Deploy Solr on YARN
er create solr --template $PROJECT_HOME/appConfig-default.json \
resources $PROJECT_HOME/resources-default.json
where $YARN_MANAGER
is the host and port of the YARN resource manager, such as localhost:8032.
8) Navigate to the YARN ResourceManager Web UI
Also, you can get the URL of each Solr instance by consulting the Slider registry (so you can reach the Solr Web Admin UI) by doing:
er registry --name solr --getexp servers
This will also show the ZooKeeper connection string used by each Solr (of course it should be the same for all nodes).
NOTE: The registry command requires using Java 7 as there's a bug in Slider that prevents the registry command from working correctly, see: https://issues.apache.org/jira/browse/SLIDER-878