1
0
mirror of https://github.com/gluster/glusterdocs.git synced 2026-02-06 00:48:24 +01:00
Files

122 lines
6.6 KiB
Markdown
Raw Permalink Normal View History

### Purpose
This document will get you up to speed with some hands-on experience
with Gluster by guiding you through the steps of setting it up for the
first time. If you are looking to get right into things, you are in the
right place. If you want just the bare minimum steps, see the [Quick Start Guide](../Quick-Start-Guide/Quickstart.md).
If you want some in-depth information on each of the steps, you are in the right place.
Both the guides will get you
to a working Gluster cluster, so it depends on you how much time you
want to spend. The [Quick Start Guide](../Quick-Start-Guide/Quickstart.md) should have you up and running in ten minutes
2017-10-31 10:26:37 +05:30
or less. This guide can easily be done in a lunch break, and still, gives
you time to have a quick bite to eat. The Getting Started guide can be
done easily in a few hours, depending on how much testing you want to
do.
After you deploy Gluster by following these steps, we recommend that
2016-05-03 17:35:43 +05:30
you read the [Gluster Admin Guide](../Administrator Guide/) to learn how to administer Gluster and
how to select a volume type that fits your needs. Also, be sure to
enlist the help of the Gluster community via the IRC channel or Q&A
2017-10-31 10:26:37 +05:30
section. We want you to be successful in as short a time as possible.
2017-06-08 08:45:09 -04:00
2017-09-13 09:11:48 -04:00
### Overview
Before we begin, lets talk about what Gluster is, dispel a few myths
and misconceptions, and define a few terms. This will help you to avoid
some of the common issues that others encounter most frequently.
#### What is Gluster
2017-10-31 10:26:37 +05:30
Gluster is a distributed scale-out filesystem that allows rapid
provisioning of additional storage based on your storage consumption
needs. It incorporates automatic failover as a primary feature. All of
this is accomplished without a centralized metadata server.
#### What is Gluster without making me learn an extra glossary of terminology?
- Gluster is an easy way to provision your own storage backend NAS
using almost any hardware you choose.
- You can add as much as you want to start with, and if you need more
later, adding more takes just a few steps.
- You can configure failover automatically, so that if a server goes
down, you dont lose access to the data. No manual steps are
required for failover. When you fix the server that failed and bring
it back online, you dont have to do anything to get the data back
2017-10-31 10:26:37 +05:30
except wait. In the meantime, the most current copy of your data
keeps getting served from the node that was still running.
2017-06-08 08:45:09 -04:00
- You can build a clustered filesystem in a matter of minutes… it is
trivially easy for basic setups
- It takes advantage of what we refer to as “commodity hardware”,
which means, we run on just about any hardware you can think of,
from that stack of decomms and gigabit switches in the corner no
one can figure out what to do with (how many license servers do you
really need, after all?), to that dream array you were speccing out
online. Dont worry, I wont tell your boss.
- It takes advantage of commodity software too. No need to mess with
kernels or fine tune the OS to a tee. We run on top of most unix
filesystems, with XFS and ext4 being the most popular choices. We do
have some recommendations for more heavily utilized arrays, but
these are simple to implement and you probably have some of these
configured already anyway.
- Gluster data can be accessed from just about anywhere You can use
traditional NFS, SMB/CIFS for Windows clients, or our own native
GlusterFS (a few additional packages are needed on the client
machines for this, but as you will see, they are quite small).
- There are even more advanced features than this, but for now we will
focus on the basics.
2017-10-31 10:26:37 +05:30
- Its not just a toy. Gluster is enterprise-ready, and commercial
support is available if you need it. It is used in some of the most
taxing environments like media serving, natural resource
exploration, medical imaging, and even as a filesystem for Big Data.
#### Is Gluster going to work for me and what I need it to do?
Most likely, yes. People use Gluster for all sorts of things. You are
encouraged to ask around in our IRC channel or Q&A forums to see if
anyone has tried something similar. That being said, there are a few
places where Gluster is going to need more consideration than others. -
Accessing Gluster from SMB/CIFS is often going to be slow by most
peoples standards. If you only moderate access by users, then it most
likely wont be an issue for you. On the other hand, adding enough
Gluster servers into the mix, some people have seen better performance
with us than other solutions due to the scale out nature of the
technology - Gluster does not support so called “structured data”,
meaning live, SQL databases. Of course, using Gluster to backup and
restore the database would be fine - Gluster is traditionally better
when using file sizes at of least 16KB (with a sweet spot around 128KB
or so).
#### What is the cost and complexity required to set up cluster?
Question: How many billions of dollars is it going to cost to setup a cluster?
Dont I need redundant networking, super fast SSDs,
technology from Alpha Centauri delivered by men in black, etc…?
I have never seen anyone spend even close to a billion, unless they got
the rust proof coating on the servers. You dont seem like the type that
2017-06-08 08:45:09 -04:00
would get bamboozled like that, so have no fear. For the purpose of this
tutorial, if your laptop can run two VMs with 1GB of memory each, you
can get started testing and the only thing you are going to pay for is
coffee (assuming the coffee shop doesnt make you pay them back for the
electricity to power your laptop).
If you want to test on bare metal, since Gluster is built with commodity
hardware in mind, and because there is no centralized meta-data server,
a very simple cluster can be deployed with two basic servers (2 CPUs,
4GB of RAM each, 1 Gigabit network). This is sufficient to have a nice
file share or a place to put some nightly backups. Gluster is deployed
successfully on all kinds of disks, from the lowliest 5200 RPM SATA to
mightiest 1.21 gigawatt SSDs. The more performance you need, the more
consideration you will want to put into how much hardware to buy, but
the great thing about Gluster is that you can start small, and add on as
your needs grow.
#### OK, but if I add servers on later, dont they have to be exactly the same?
In a perfect world, sure. Having the hardware be the same means less
troubleshooting when the fires start popping up. But plenty of people
deploy Gluster on mix and match hardware, and successfully.
Get started by checking some [Common Criteria](./Common_criteria.md)