************* Cluster Setup ************* Fully Distributed Mode ====================== A fully distributed mode enables a Tajo instance to run on `Hadoop Distributed File System (HDFS) `_. In this mode, a number of Tajo workers run across a number of the physical nodes where HDFS data nodes run. In this section, we explain how to setup the cluster mode. Settings -------- Please add the following configs to tajo-site.xml file: .. code-block:: xml tajo.rootdir hdfs://nameservice/tajo tajo.master.umbilical-rpc.address hostname:26001 TajoMaster binding address between master and workers. tajo.master.client-rpc.address hostname:26002 TajoMaster binding address between master and remote clients. Workers ------- The file ``conf/workers`` lists all host names of workers, one per line. By default, this file contains the single entry ``localhost``. You can easily add host names of workers via your favorite text editor. For example: :: $ cat > conf/workers host1.domain.com host2.domain.com .... Make base directories and set permissions ----------------------------------------- If you want to know Tajo’s configuration in more detail, see Configuration page. Before launching the tajo, you should create the tajo root dir and set the permission as follows: :: $ $HADOOP_HOME/bin/hadoop fs -mkdir /tajo $ $HADOOP_HOME/bin/hadoop fs -chmod g+w /tajo Launch a Tajo cluster --------------------- Then, execute ``start-tajo.sh`` :: $ $TAJO_HOME/bin/start-tajo.sh .. note:: By default, each worker is set to very little resource capacity. In order to increase parallel degree, please read :doc:`/configuration/worker_configuration`. .. note:: By default, TajoMaster listens on localhost/127.0.0.1 for clients. To allow remote clients to access TajoMaster, please set tajo.master.client-rpc.address config to tajo-site.xml. In order to know how to change the listen port, please refer :doc:`/configuration/service_config_defaults`.