vSAN

A quick reference to vSAN content

Books

vSAN 8.0 U1 Express Storage Architecture book – paper / kindle (released 25 April 2023)
vSAN 7.0 U3 Deep Dive book – paper / kindle (released 09 May 2022)
A link to the vSAN 6.7 U1 Deep Dive book

Posts

118 Replies to “vSAN”

Pingback: VSAN Part 3 – It is not a Virtual Storage Appliance |
Pingback: Storage is Sexy (Again): Thank you VSAN | I Tech, Therefore I Am
Pingback: VSAN Part 7 – Capabilities and VM Storage Policies |
Pingback: VSAN Part 8 – The role of the SSD |
Pingback: VMware vSphere Virtual SAN VSAN | chassdesk
Pingback: VSAN Part 9 – Host Failure Scenarios & vSphere HA Interop |
Pingback: VSAN Part 10 – Changing VM Storage Policy on-the-fly |
Pingback: VSAN Part 10 – Changing VM Storage Policy on-the-fly | Storage CH Blog
Pingback: A list of VSAN references – from VMware bloggers | VMware vSphere Blog - VMware Blogs
Pingback: VMware vSphere Blog: A list of VSAN references – from VMware bloggers | System Knowledge Base
Pingback: Virtual SAN webinars, make sure to attend!
Pingback: VMware vSphere Virtual SAN VSAN | Bring Your Own Brain
Pingback: VMware’s Storage Portfolio | filipv.net
Pingback: A closer look at EMC ViPR |
Pingback: Vmware VSAN | blog.jgriffiths.org
Pingback: VSAN Part 12 – SPBM extensions in RVC |
Pingback: VSAN Part 13 – Examining the .vswp object |
Pingback: An Introduction to Flash Technology | CormacHogan.com
Pingback: VSAN Part 16 – Reclaiming disks for other uses | CormacHogan.com
Pingback: Looking for Radically Simple Storage? | Virtual Insanity
Pingback: Tech Blast #06 - Dave Does What He Wants | Wahl Network
VMJFK says:

March 6, 2014 at 4:51 pm

Can you give us some detail on calculating Disk Yield? If I have 3 modes with 1tb, will I see 3 tb storage? does a VM that uses 50gb of storage take up 50gb, or 100 gb, or 150 gb?

Reply
1. Cormac says:
  
  March 6, 2014 at 5:15 pm
  
  There should be a sizing guide going live shortly, but all magnetic disks across all the hosts will contribute to the size of the VSAN datastore. The SSDs (or flash devices) do not contribute to capacity. So if you had 1TB of magnetic disk in 3 nodes, your VSAN datastore will be 3TB.
  
  The amount of disk consumed by your VM is based primarily on the failures to tolerate (FTT) setting in the VM Storage Policy. An FTT of 1 implies 2 replicas/mirrors of the VMDK. Therefore a 50GB VMDK created on a VM with an FTT=1 will consume 100GB. A 50GB VMDK created on a VM with an FTT=2 will make 3 replicas/mirrors and therefore consumes 150GB. Hope that makes sense. Lots of documentation coming around this.
  
  Reply
VMJFK says:

March 6, 2014 at 5:23 pm

Thanks Cormac. This is what I assumed, and wanted to check. Look forward to the documentation.

Reply
Pingback: VMware announces VSAN GA, supports 32 host, up to 2M IOPS - Virtxpert
Pingback: VMware Virtual SAN (VSAN) Launched | Blue Shift
Pingback: VSAN Part 18 – VM Home Namespace and VM Storage Policies | CormacHogan.com
Pingback: VSAN-tastic! | SplitBrained
Pingback: Architecture Virtualisée : le VSAN est-il l'avenir ? | EMC FRANCE
Pingback: Newsletter: April 5, 2013 | Notes from MWhite
Pingback: Server SAN? What is it for? Will it stick around? And what exactly is it? | Notes from MWhite
Pingback: VSAN Deploy and Manage links | Dennis Bray's Virtual Place
Santo says:

May 6, 2014 at 4:35 pm

Hi Comac,

Need to understand on the “Note” of VSAN Part 9 topic:

On the vSphere HA interop:

….”Note however that if VSAN hosts also have access to shared storage, either VMFS or NFS, then these datastores may still be used for vSphere HA heartbeats”

Questions:
If for example all the VSAN hosts also have VMFS shared datastore(s) (say using FC SAN), then I can have TWO kind of HA protections which are if the VM located on the VSAN datastore then it gets VSAN HA protection and if the VM located on the shared VMFS datastore then it gets a traditional HA protection?

Thanks

Reply
Pingback: Evolution of Storage – What is your strategy? | vcdx133.com
Pingback: WordPress – Weekly Dump (weekly) | ResponsiPaul
Pingback: Baby Dragon Triplets – VSAN Home Lab | vcdx133.com
Pingback: vCenter Operations Manager and vSphere Data Protection Interop | CormacHogan.com
Pingback: Como montar un lab de VSAN sobre Workstation – Parte 1 | Prio´s Blog
Pingback: Horizon 6 - Whats New | raimes.de
Pingback: VSAN Quick-Config Guide | vcdx133.com
JamesM says:

September 22, 2014 at 12:55 am

Just to clarify on the whole disk consumption based on the FTT setting…going back to your example of a FTT=1 for a 50GB VM….

Are you saying that it will consume an additional 100GB of space due to the 2 replicas created?…or are you saying that the original VM (VMDK) that is created is counted as one of those replicas?

[quote]
“therefore a 50GB VMDK created on a VM with an FTT=1 will consume 100GB”

In regards to being completely clear, would it be better to say

will consume an extra 100GB in addition to the 50GB VM (VMDK)”?

I’ve done countless days and days of researching for the past ~6 months or so but every time I hear that, it throws me off on my understanding of FTT > disk consumption.

Thank you in advance for your time, if you choose to respond.

*I read your book BTW, you and Duncan Epping are rockstars in the world of virtualization….really good read. Couldn’t have asked for more.

-JamesM

Reply
1. Cormac says:
  
  September 22, 2014 at 8:13 am
  
  It means that 2 x 50GB replicas are created for that VMDK James, meaning 100GB in total is consumed on the VSAN datastore (not an additional 100GB). Note however that VMDK are created as thin provisioned on the VSAN datastore, so it won’t consume all of that space immediately, but over time.
  
  Thanks for the kind words on the book – always nice to hear feedback like that.
  
  Reply
  1. JamesM says:
    
    September 23, 2014 at 11:33 pm
    
    Thanks for the reply and clarification…so to make sure I get this right, there will be a single VMDK for the actual VM running in the environment BUT since VSAN is in use, if your FTT=1, then 100GB will be consumed by the 2 replicas that are created (over time with thin provisioning).
    
    I think my confusion is in the semantics of how every everyone explains it.
    
    Reply
    1. Cormac says:
      
      September 24, 2014 at 8:53 am
      
      Yep – you got it. A single 50GB VMDK, made up of two 50GB mirrors/replicas, each replica sitting on a different disks (and host) but the same datastore and eventually consuming 100GB in total on the VSAN datastore
      
      Reply
roydfreeman says:

September 30, 2014 at 11:28 pm

I have a question for you regarding Part 13 in which you refer to “the VM swap file” and the “swap object”. How does the vmx-*.vswp file fit into all this? This file was introduced in 5.0. Does this file belong in the swap object? Is there a second swap object for it? Or does it simply belong to the VM namespace object?

Reply
1. Cormac says:
  
  October 1, 2014 at 2:15 pm
  
  Yes – this is what we are referring to. This is now instantiated as its own object on the VSAN datastore, and does not consumes space in the VM namespace object.
  
  Reply
Pingback: A closer look at Maxta | CormacHogan.com
Pingback: VMware VCP 5.5 Delta Exam (#VCP550D) Passed! | The Virtual Unknown
Pingback: VSAN Cluster – Shutting down
Pingback: All Things Virtual SAN | vmnick
Pingback: vSphere 6.0 Storage Features Part 4: VMFS, VOMA and VAAI | CormacHogan.com
Stevin says:

March 19, 2015 at 7:11 pm

Hi Cormac,
A question about the “Virtual SAN 6.0 Design and Sizing Guide”. On page 46 it states ‘For hybrid configurations, this setting defines how much read flash capacity should be reserved for a storage object. It is specified as a percentage of the logical size of the virtual machine disk object.’. So a percentage of the logical size (used storage). The example on page 47 takes the flash read cache reservation as a percentage of the physical space (allocated storage). What is the truth?

Thanks.

Reply
1. Cormac says:
  
  March 20, 2015 at 8:35 am
  
  These statements are meant to reflect the same thing Stevin. When I say that it is a “percentage of the logical size”, this is not the same as “used storage”.
  
  All VMDKs on VSAN are thin by default. they can be pre-allocated (made thick) through the use of the Object Space Reservation capability-.
  
  However, whether you use that or not, you request a VMDK size during provisioning, e.g. 40GB. Now you may only use a portion of this, e.g. 20GB, as it is thin provisioned.
  
  But Read Cache is based on a % of the requested size (logical size/allocate storage), so 40GB. Hopefully that makes sense.
  
  Cormac
  
  Reply
  1. Stevin says:
    
    March 20, 2015 at 6:07 pm
    
    Hi Cormac, perfectly clear thanks!
    
    Reply
Stevin says:

March 25, 2015 at 1:53 pm

Hi Cormac,
regarding your book Essential VSAN, excellent book btw. The book states: In the initial version of VSAN, there is no proportional share mechanism for this resource when multiple VMs are consuming read cache, so every VM consuming read cache will share it equally. how must i read this? will the total flash read cache size be devided by the number of VMs consuming VSAN storage and that is the amount of flash read cache each VM gets? (this would be a problem for read intensive VMs with more storage than average)
What about the write cache? Every write has to go through the write cache i presume? How is write cache shared between VMs?

thanks again.

Reply
1. Karl says:
  
  December 5, 2016 at 1:59 am
  
  Hi Cormac
  
  I would be very interested to know about read and write cache allocation to VM’s when reservation is set to 0 for VSAN 6.2
  
  If I copy a large file from :C to D: drive in my windows VM, I see very poor transfer rates by comparison to the same copy on a PC (less than half the speed). The transfer rate drops to zero for up to 7 seconds for periods during the transfer. Its almost like its cache allocation has filled up and its waiting for destage to complete.
  
  Thanks
  
  Reply
  1. Cormac says:
    
    December 5, 2016 at 8:31 am
    
    Hi Karl,
    
    extremely difficult to figure this out without getting logs, etc. I would recommend opening a call with support.
    However there were some significant bug fixes in the most recent patch – VMware ESXi 6.0, Patch Release ESXi600-201611001 (VMware ESXi 6.0 Patch 04). Are you running this?
    
    Reply
    1. Karl says:
      
      December 5, 2016 at 10:23 pm
      
      Thanks Cormac. We will be applying the latest patch. I also have a job logged
      
      Reply
Lyne says:

April 9, 2015 at 1:29 pm

Hi Cormac
A question about the ratio for SSD and HDD numbers. what’s the best number for the ratio? From the system level view, If only one HDD I belived the performance will not good( your data will gating on one HDD interface). but if the HDD disk is around 10, The SSD couldn’t provide enough cache to all. Just wonder if there’s a perfect ratio?

Reply
1. Cormac says:
  
  April 9, 2015 at 4:16 pm
  
  It is completely dependent on the VMs that you deploy. If you have very I/O intensive VMs each with large working sets (data is in a state of change), then you will need a large SSD:HDD ratio. If you have very low I/O VMs with quite small working sets, you can get away with a smaller SSD:HDD capacity. Since it is difficult to state which is the best for every customer, we have used a 10% rule-of-thumb to cover most virtualized application workloads.
  
  Reply
  1. Lyne says:
    
    April 13, 2015 at 8:58 am
    
    Appreciate Cormac, understand the ratio will determine the performance. And user configure it with their application case. it provide flexibly to users.
    I may didn’t make it clear.
    The ratio here I mentioned is physical device number not capacity number.
    Or the Performance has no relationship with physical devices number ratio, only affected by SSD:HDD capacity ratio?
    
    Reply
    1. Cormac says:
      
      April 13, 2015 at 4:26 pm
      
      This is one of those depends answers Lyne.
      
      If all of your writes are hitting the cache layer, and all of your reads are also satisfied by the cache layer, and destaging from flash to disk is working well, then 1:1 ratio will work just fine.
      
      If however you have read cache misses that need to be serviced from HDD, or there is a large amount of writes in flash that need to be regularly destaged from flash to HDD, then you will find that a larger ratio, and the use of striping across multiple HDDs for your virtual machine objects can give better performance.
      
      Reply
Lyne says:

April 14, 2015 at 2:58 am

Yes, Cormac.
That’s my concern. we’re struggling with 1SSD：4 HDD and 1SSD：5HDD performance difference.
I think if we have big SSD, it should have less possibility to miss cache.
Even the cache missed, the 4HDD and 5HDD shouldn’t affect big right?
Maybe need to set up environment and collect some test data. 🙂

Reply
Pingback: penguinpunk.net » Storage Field Day 7 – Day 2 – VMware
Pingback: Booting vSphere ESXi 6.0 From USB Stick To Successfully Build a VSAN 6.0 – Including An Apple Mac Mini Late 2014 (7,1) | Cloud Jockey
Morteza says:

January 6, 2016 at 11:32 pm

hi Cormac
I run VSAN with 3 Host’s
and also config LACP between server and switch
SSD Samsung pro 850 with 512GB of size
after run I tested copy speed between 2 VM that run in VSAN and the speed is between 20MB to 60 MB
What is my problem
Notice in Smart Storage Administrative I disabled Caching from SAS and SSD disk, then test speed and speed was very bad
again delete arrays and create with cached enable and again speed was very bad
Please help

Reply
1. Cormac says:
  
  January 7, 2016 at 8:14 am
  
  Please open an SR (support request) with GSS (VMware’s Global Support Services) Morteza. They can advise you.
  
  Reply
2. Andrew Mancey says:
  
  January 22, 2016 at 10:46 am
  
  Hi Morteza,
  
  Noticed you’re using the same Samsung consumer grade SSDs that I thought would work. Are you using them as the caching tier or as the capacity drives? In my case, I used them as the caching tier and had all sorts of issues, even down to Permanent Disk Loss errors randomly appearing, requiring a host reboot. I’ve since moved them to the capacity tier and put in some Enterprise SSDs and so far, haven’t had any further issues.
  
  Thanks
  
  Andrew
  
  Reply
Carlo says:

January 7, 2016 at 10:51 pm

I posted this on the VM/Host affinity groups, but didn’t get a reply. I’m looking at setting up a VSAN stretched cluster. Can you help answer this?

——–

How does VM/Host affinity groups work with fault domains? I’m looking at setting up a VSAN, and setting the fault domain for site A to be site B. As I understand, by doing that, when I set FTT=1, the data will be replicated to site B instead of to another node at site A. This is to cover the case where we lose the entire rack at Site A. The VMs will be able to reboot at Site B off of the replicated data at Site B.

If I were to use VM/Host affinity groups, then wouldn’t I need to replicate to a second node at site A? Would that mean setting FTT=2, and it would replicate to a node at site A, and a node at site B? Maybe VM/Host affinity groups don’t work when using fault domains. Can you help me sort that out?

Reply
1. Cormac says:
  
  January 8, 2016 at 10:29 am
  
  First, VSAN Stretched Cluster only supports FTT=1. Fault Domains and FTT work together.
  
  If you have a failure on site A, the VM/Host Affinity rules will attempt to restart the VM on the same site, i.e. site A.
  
  If you have a complete site failure (e.g. lost power on site A), the VM/Host affinity rules will then attempt to restart the VM on the remote site, i.e. site B.
  
  You still need to use fault Domain with Stretched Cluster, but simply as a way of grouping hosts on each site together.
  
  This should be well documented in the stretched cluster guide. There is also a PoC guide due to be released very soon which will provide you with further detail.
  
  Reply
  1. Carlo Grossman says:
    
    January 14, 2016 at 5:56 pm
    
    Thanks for your reply.
    
    So if a stretched cluster has FTT=1, then doesn’t that mean it will only replicate data to another node at site B? If it only replicates to another node at site B, and a node at site A goes down, how will VM/HA rules be able to restart the VM on the same site A?
    
    Reply
    1. Cormac says:
      
      January 14, 2016 at 5:59 pm
      
      That is why we recommend “should” rules rather than “must” rules got affinity. You can find the full details in the stretched cluster guide here.
      
      Reply
  2. Vjeran Babic says:
    
    January 18, 2016 at 4:20 pm
    
    Hi,
    
    Can you say if removing this limitation of Stretched Cluster (FTT only =1) is on the roadmap? We are looking at implementing it but would like to have 2 copies on the primary site + 1 on the secondary (or maybe 2+2 active-active configuration)
    
    Thanks, Vjeran
    
    Reply
    1. Cormac says:
      
      January 18, 2016 at 4:21 pm
      
      We are definitely looking at this, and the plan is to improve up on it. But there are no dates for the feature that I can share with you I’m afraid.
      
      Reply
      1. Vjeran Babic says:
        
        January 18, 2016 at 4:30 pm
        
        Thanks for the response, good to hear that you are working on it
      2. Vjeran Babic says:
        
        February 11, 2016 at 9:11 am
        
        This didn’t make it in 6.2 release? Sometimes those details don’t get advertised at launch, so I’m still hoping.. 😉
Pingback: Why you can, and should, write a book
Pingback: Architecting IT | Is it time for VMware to Open Source The ESX Hypervisor?
Pingback: VMware VSAN 6.2, what’s new? | VDICloud
Pingback: De VSAN 6.1 à VSAN 6.2 … quelle aventure ! – vBlog.io
Pingback: VSAN Cormac Blog　〜 VSAN におけるオブジェクトとコンポーネントの考え方〜 - Japan Cloud Infrastructure Blog - VMware Blogs
Pingback: VSAN Cormac Blog 〜VASAの役割〜 - Japan Cloud Infrastructure Blog - VMware Blogs
Pingback: VSAN Cormac Blog 〜手動・自動モードについて〜 - Japan Cloud Infrastructure Blog - VMware Blogs
Pingback: FlashSoft I/O Filter VAIO Setup Steps - CormacHogan.com
Pingback: VSANにおける SSDの役割 - Japan Cloud Infrastructure Blog - VMware Blogs
Andreas says:

July 5, 2016 at 4:16 pm

About the Health Check plugin, any thoughts on why it triggers the alarm ‘Site Latency Health’ between host and witness on as low as 15ms when less or equal to 100ms is the recommended figure? is there any way to tweak this?

Reply
1. Cormac says:
  
  July 6, 2016 at 8:23 am
  
  I’m not aware of any way to stop this Andreas, but please check with GSS/Support. They may know of a solution.
  
  Reply
  1. Bilan says:
    
    September 12, 2016 at 9:35 pm
    
    Please see this article. This may be fixed in the next patch. https://kb.vmware.com/selfservice/microsites/search.do?language=en_US&cmd=displayKC&externalId=2146133
    
    Reply
vikrant says:

October 6, 2016 at 4:47 pm

Hi Cormac, I read your book BTW, you and Duncan Epping are really good in the world of virtualization….really good read. you have expertise in virtualization Couldn’t have asked for more

Reply
Xing says:

October 11, 2016 at 9:43 am

Hi Cormac,

I found that some posts have a reply box, but some do not have.
I have read the post about “VSAN 6.2 Part 1 – Deduplication and Compression” and wanted to leave reply there, but it seems that there is no space…

How can I leave a reply to that post?

Reply
1. Cormac says:
  
  October 11, 2016 at 2:50 pm
  
  Posts are closed for comments after a certain period of time.
  
  Reply
  1. Xing says:
    
    October 12, 2016 at 3:28 am
    
    OK, got it.
    
    So I post my questions about “Deduplication and Compression” here as the last option.
    
    My environment is as following:
    1. 3 All Flash ESXi hosts with Dedep and Compression enabled.
    2. Only PSC, VCSA and other 2 VMs have been deployed with less then 1TB totally.
    3. The object space reservation is 0% with default VSAN Storage policy.
    
    But from what i saw is that:
    1. The deduplication and compression overhead is 6.5TB.
    2. The ‘used-total’ grows up to about 2 TB after enable Dedup and compression.
    
    Is that the normal phenomenon after enable the feature? BTW, is there any formula that i can use to calculate the expected consumed capacity?
    
    Thanks.
    
    Reply
    1. Cormac says:
      
      October 12, 2016 at 8:36 am
      
      Yes – the overhead is calculated up front. In my experience, Deduplication and Compression Overhead approx. 5% and CheckSum overhead is anywhere between 1.22 – 1.25. There are some more details here: http://cormachogan.com/2016/02/25/vsan-6-2-part-7-capacity-views/
      
      Reply
      1. Xing says:
        
        October 13, 2016 at 3:18 am
        
        Yes, 5% of the total raw capacity is true. I also found the same description in VMware document center.
        
        Thanks you so much.
pizzingrilli says:

April 27, 2017 at 9:09 am

Hi Cormac,
vSAN is leveraging the new vsanSparse snapshot technology. Does this new snapshot technology also reduce the stunning time during removal of a large snapshot compared to traditional “redo log” snapshots? I didn’t find any comments in the vsan snapshot performance white paper about this.

Reply
1. Cormac says:
  
  April 27, 2017 at 5:46 pm
  
  I think the main difference is the in-memory cache and the granularity that vSANsparse uses – otherwise the techniques are quite similar. However I am not aware of any study to measure the differences. This might have further useful info -https://storagehub.vmware.com/#!/vmware-vsan/vsansparse-tech-note
  
  Reply
Bitmask says:

May 27, 2017 at 5:40 am

Hi Cormac,

I wanted to run a vSAN maintenance scenario by you to see if there are any potential drawbacks, aside from a node failing while performing the maintenance. This is regarding ‘Ensure availability’ and ‘No data migration’ maintenance modes.

Scenario:

A single node in a 4 node vSAN cluster is placed into maintenance mode using the ‘No data migration’ method. Once in maintenance mode, software updates/firmware is applied to the node and it’s unavailable for roughly 30-40 minutes. After the maintenance is completed the node is placed back into production and the administrator immediately moves onto the next node in the cluster to be patched. The admin again uses the same ‘No data migration’ maintenance mode on this node, applies updates for 30-40 minutes and so on. These steps are repeated for the remaining nodes.

Cluster details
vSAN version: 6.2
Hosts in Cluster: 4
Storage Policy on all VMs: FTT=1
Fault Domains: Single FD per host
Disk Configuration: Hybrid

Question:

If the admin is performing maintenance this way without waiting for components to re-sync after each 30-40 minute window and is not using ‘Ensure availability’, would there be potential data issues or a chance of VMs becoming unavailable as a result? This is again without a node failing in the cluster during these maintenance windows. I understand this is not the preferred way of doing maintenance, but I was just curious what could happen and if there were any fail-safes when this occurs.

Reply
1. Cormac says:
  
  May 28, 2017 at 5:47 pm
  
  You definitely need to be careful with this approach. First, you might like to increase the cmmds repair delay timeout value above the 60 minute value (see KB 2075456). This gives you a bit more lee-way, in case it take a bit longer to apply the fw and reboot the host. It will mean that rebuild won’t start if the maintenance runs over 1 hour..
  
  Now there may well be some changes that need to be synced once the host has rebooted. You need to wait for this to complete before starting maintenance on the next host. I like to use RVC commands for this such as rvc.resync_dashboard (you can also use the UI). Only commence work on the next host when you are sure that all objects are fully synced and active.
  
  HTH
  
  Cormac
  
  Reply
Pingback: Useful VMware Resources | Scamallach
Sai Kumar says:

November 15, 2017 at 7:29 am

Hi Cormac,

Can you plz reply on my query “What will happen when the whole environment goes down and power back on again ? Do we run some sort of integrity check ?”

Regards,
Sai

Reply
Pingback: Why upgrade vSAN? Here is a list of features, release by release. - CormacHogan.com
Pingback: The Cheap and Easy Way to Get Started with VMware vSAN - Nicola Marco Decandia
Pingback: vSAN 6.6.1 – Virtual Garage
Stangra says:

April 23, 2019 at 10:13 am

Hi Cormac,

I have a question about vSAN could you please explain more detail :

I have a vSAN Cluster with 3 ESXi host (1SSD 50Gb and 1HDD 300Gb per host), VM storage policy is : Number of Failures to Tolerate = 1, Number of Disk Stripes per Object = 1
If i have a VM with a Virtual disk size is 400Gb, what happen and how vSAN stored/distributed the VM. It cannot stored 2 replica on 2 Host because only 300Gb HDD per host, is it correct ?

Thanks you so much

Reply
1. Cormac says:
  
  April 23, 2019 at 1:58 pm
  
  Correct – you will not be able to provision this VM with that policy, unless you override the policy with a ForceProvision entry. With ForceProvision, it means the VM will be provisioned as an FTT=0, so there will be no protection.
  
  Reply
  1. Stangra says:
    
    April 24, 2019 at 2:09 am
    
    Thanks Cormac for your information. So, if i set Number of Failures to Tolerate = 1, Number of Disk Stripes per Object = 2 is it ok for this case
    
    Reply
    1. Cormac says:
      
      April 24, 2019 at 11:17 am
      
      No – Disk Stripes require unique capacity devices, and you do not have enough devices to accommodate this, as you only have one capacity device per host. At most, you can get get FTT=1 with SW=1. These requirements are called out in the product documentation, and in the vSAN deep dive book.
      
      Reply
Varun Joshi says:

December 7, 2019 at 12:51 pm

Hello Cormac

I’m reading the Stretched cluster guide and do not follow the component bifurcation in the BW calculation section.

____
200 virtual machines with 500GB vmdks (12 components each) using PrevSAN 6.6 policies would require 4.8Mbps of bandwidth to the Witness
host
3 for swap, 3 for VM home space, 6 for vmdks – 12
12 components X 200 VMs – 2,400 components
2Mbps for every 1000 is 2.4 X 2Mbps – 4.8Mbp
____

In this example PFTT-1 and SFTT-0
Component Calculations for VMDK –
SiteA – 500GB – Component0-255GB and Component1-245GB – 2 components
SiteB – 500GB – Component0-255GB and Component1-245GB – 2 components
Either SiteA or SiteB will also have 2 additional Witnesses, one for component0 and the other for component1 – 2 components
Witness site – 1 for component0 and other for component1 – 2components
The above gives us a total of 8 components for VMDK of 500GB – Why do I get 2 addional in the count?

____
The same 200 virtual machines with 500GB vmdks using vSAN 6.6 Policy
Rules for Cross Site protection with local Mirroring would require
3 for swap, 7 for VM home space, 14 for vmdks – 24
24 components X 200 VMs – 4,800 components
2Mbps for every 1000 is 4.8 X 2Mbps – 9.6Mbps
____

In this example PFTT-1 and SFTT-1
My calculation gives me at total of 7 SWAP components (is the article not taking SFTT into account?)
Component0 at SiteA – 1C
Component0 at SiteA – 1C – Mirror/SFTT-1
A witness component at SiteA – 1C
Component0 at SiteB – 1C
Component0 at SiteB – 1C – Mirror/SFTT-1
A witness component at SiteB – 1C
A witness at Witness site – 1C

Similarly I get 7 for VMHome (which is in accordance with the guide). Why is SWAP 3 in the guide?

I get the no. of component for a 500GB vDisk to be 14 –
SiteA – 500GB – Component0-255GB and Component1-245GB – 2 components
SiteA – 500GB – Component0-255GB and Component1-245GB – 2 components – Mirror/SFTT-1
SiteA – 1 Witness for component0 and 1 Witness for Component1. This is because of SFTT. – 2 components
SiteB – 500GB – Component0-255GB and Component1-245GB – 2 components
SiteB – 500GB – Component0-255GB and Component1-245GB – 2 components – Mirror/SFTT-1
SiteB – 1 Witness for component0 and 1 Witness for Component1. This is because of SFTT. – 2 components
Witness site – 1 for component0 and other for component1 – 2 components
Is my understanding correct?

Reply
1. Cormac says:
  
  December 8, 2019 at 4:35 pm
  
  OK – that is a lot of information to take on-board. Let me see if we can simplify it a little bit.
  
  Let’s take your first example: “3 for swap, 3 for VM home space, 6 for vmdks – 12”. In this, we are stating 3 for swap and home since this is Stretched Cluster, so RAID-1 mirroring with a witness for swap and home, giving us 3 components, 1 x SiteA, 1xSiteB, 1 x WitnessSite. Thus this is PFTT=1, SFTT=0.
  
  I don’t understand your next example which states “Either SiteA or SiteB will also have 2 additional Witnesses, one for component0 and the other for component1 – 2 components”. Where are you deriving these additional components from? If this is PFTT=1, SFTT=0, then there are no additional witnesses at either site. To the best of my knowledge, the only time we would have witnesses at SiteA or SiteB is when SFTT>0 and we are protecting VMs in the same site as well as across sites.
  
  Are you using https://storagehub.vmware.com/t/vmware-vsan/vsan-stretched-cluster-guide/bandwidth-calculation-5/ as your reference?
  
  Reply
Varun Joshi says:

December 16, 2019 at 3:05 am

Thanks for your reply. My bad, I missed the email notification.

I have taken these examples from vSAN Stretched Cluster Guide, Pg. no. 28.

Example-1
When PFTT=1 and SFTT=0, VMKD1=500GB (Compopent0=255GB and Component1=245GB)
The Witness component will only exist in the Witness site as local site policy isn’t in place. Which is why the first example gives us 6 components for VMDK1.
This I understand now.

Example-2
When PFTT=1 and SFTT=1 (RAID-1), VMKD1=500GB (Component0=255GB and Component1=245GB)
The guide gets the below nos. –
3 for swap, 7 for VM home space, 14 for vmdks = 24

The no. of components for VMDK1 is 14 [5 at SiteA(4 data and 1 Witness component, SFTT=1), 5 at SiteB(4 data and 1 Witness component, SFTT=1) and 4 at Witness site].
I understand the component bifurcation for VMDK1. But the example has just 3 component for VMSWAP, whereas 7 for VMHome.
Shouldn’t VMHome and VMSwap follow the same PFTT and SFTT, therefore have the same no. of components?

Example-3
When PFTT=1 and SFTT=1 (Erasure coding), VMKD1=500GB (Component0=255GB and Component1=245GB)
The guide gets the below nos. –
3 for swap, 9 for VM home space, 18 for vmdks = 30
For VMKD I also get 18 and below is my calculation –
4 for Component0 in SiteA – 3 data and 1 parity
4 for Component1 in SiteA – 3 data and 1 parity
4 for Component0 in SiteB – 3 data and 1 parity
4 for Component1 in SiteB – 3 data and 1 parity
2 Witness components for Component0 and Component1 at Witness site. Thus giving us a total of 18 components.

For VMHome I also get 9 and below is my calculation –
4 for Component0 in SiteA – 3 data and 1 parity
4 for Component0 in SiteB – 3 data and 1 parity
1 witness components at Witness site. Thus giving us a total of 9 components.

In this example too I do not understand why is VMSwap 3?

Reply
1. Varun Joshi says:
  
  December 16, 2019 at 3:13 am
  
  I downloaded the guide from here –
  https://storagehub.vmware.com/t/vmware-vsan/vsan-stretched-cluster-guide/
  
  And selected the “Export to PDF” option available at the top right.
  
  Reply
2. Varun Joshi says:
  
  December 16, 2019 at 3:14 am
  
  I downloaded the guide from –
  https://storagehub.vmware.com/t/vmware-vsan/vsan-stretched-cluster-guide/
  
  And chose the “Export to PDF” option, available at the top left.
  
  Reply
3. Cormac says:
  
  December 16, 2019 at 8:59 am
  
  I see – so the issue is that the number of components reported by VMswap is incorrect. Swap should also use the same policy assigned to the VM, so this looks like an error in the calculation. It may be that the guide is using older calculations. In the past, swap was only ever FTT=1 (3 components) and did not inherit the VM policy. However that has changes in more recent vSAN versions, and now swap does indeed use the same policy as the rest of the objects that make up the VM. I’ll inform the document maintainers. Thanks for bringing this to our attention.
  
  Reply
Thomas says:

March 2, 2022 at 9:53 am

Hi Cormac, what is your recommendation for the following situation:

We have a 3 Nodes vSAN-Cluster with every node in a separate rack – configured as 3 fault domains.
Now we would like to bring a 4th node inside the cluster and must place it in one of the 3 existing Racks. So if the wrong Rack fails we loose 50% of our nodes and have a split brain situation. Regards Thomas

Reply
1. Cormac says:
  
  March 2, 2022 at 2:00 pm
  
  I don’t see any way of avoiding such a situation if you are only introducing a single node Thomas. Ideally, you would introduce a new node to each FD to maintain availability, but I guess you know this already.
  
  Reply
  1. Thomas Senkel says:
    
    March 2, 2022 at 5:14 pm
    
    Thx Cormac for your quick answer. I already thought that there is no solution for this situation. Is there any chance to work around this problem with a witness appliance? Regards Thomas
    
    Reply
    1. Cormac says:
      
      March 2, 2022 at 6:07 pm
      
      I guess you could implement a 2+2+1 vSAN stretched cluster where 2 data hosts for Fault Domain A are in one rack, 2 data hosts for Fault Domain B are in another rack , and then the witness appliance is deployed on another ESXi host which is in the third rack. There is a bit of work involved in doing something like this, and I’m not sure if I would be comfortable converting a standard vSAN already in production to a stretched vSAN. I think further research would be needed here to see if it even feasible.
      
      Reply
Pingback: VMware vSAN 6.7 2019 Specialist (5V0-21.19) Exam Prep - Digital Thought Disruption

Books

Posts

118 Replies to “vSAN”

Leave a ReplyCancel reply