- Timestamp:
- Jan 15, 2010, 8:21:06 AM (16 years ago)
- File:
-
- 1 edited
Legend:
- Unmodified
- Added
- Removed
-
branches/samba-3.3.x/docs/htmldocs/Samba3-HOWTO/SambaHA.html
r342 r368 1 <html><head><meta http-equiv="Content-Type" content="text/html; charset=ISO-8859-1"><title>Chapter 32. High Availability</title><link rel="stylesheet" href="../samba.css" type="text/css"><meta name="generator" content="DocBook XSL Stylesheets V1.7 4.0"><link rel="home" href="index.html" title="The Official Samba 3.3.x HOWTO and Reference Guide"><link rel="up" href="optional.html" title="Part III. Advanced Configuration"><link rel="prev" href="Backup.html" title="Chapter 31. Backup Techniques"><link rel="next" href="largefile.html" title="Chapter 33. Handling Large Directories"></head><body bgcolor="white" text="black" link="#0000FF" vlink="#840084" alink="#0000FF"><div class="navheader"><table width="100%" summary="Navigation header"><tr><th colspan="3" align="center">Chapter 32. High Availability</th></tr><tr><td width="20%" align="left"><a accesskey="p" href="Backup.html">Prev</a> </td><th width="60%" align="center">Part III. Advanced Configuration</th><td width="20%" align="right"> <a accesskey="n" href="largefile.html">Next</a></td></tr></table><hr></div><div class="chapter" lang="en"><div class="titlepage"><div><div><h2 class="title"><a name="SambaHA"></a>Chapter 32. High Availability</h2></div><div><div class="author"><h3 class="author"><span class="firstname">John</span> <span class="othername">H.</span> <span class="orgname">Samba Team</span> <span class="surname">Terpstra</span></h3><div class="affiliation"><span class="orgname">Samba Team<br></span><div class="address"><p><code class="email"><<a class="email" href="mailto:jht@samba.org">jht@samba.org</a>></code></p></div></div></div></div><div><div class="author"><h3 class="author"><span class="firstname">Jeremy</span> <span class="orgname">Samba Team</span> <span class="surname">Allison</span></h3><div class="affiliation"><span class="orgname">Samba Team<br></span><div class="address"><p><code class="email"><<a class="email" href="mailto:jra@samba.org">jra@samba.org</a>></code></p></div></div></div></div></div></div><div class="toc"><p><b>Table of Contents</b></p><dl><dt><span class="sect1"><a href="SambaHA.html#id2672036">Features and Benefits</a></span></dt><dt><span class="sect1"><a href="SambaHA.html#id2672157">Technical Discussion</a></span></dt><dd><dl><dt><span class="sect2"><a href="SambaHA.html#id2672191">The Ultimate Goal</a></span></dt><dt><span class="sect2"><a href="SambaHA.html#id2672321">Why Is This So Hard?</a></span></dt><dt><span class="sect2"><a href="SambaHA.html#id2673034">A Simple Solution</a></span></dt><dt><span class="sect2"><a href="SambaHA.html#id2673115">High-Availability Server Products</a></span></dt><dt><span class="sect2"><a href="SambaHA.html#id2673255">MS-DFS: The Poor Man's Cluster</a></span></dt><dt><span class="sect2"><a href="SambaHA.html#id2673292">Conclusions</a></span></dt></dl></dd></dl></div><div class="sect1" lang="en"><div class="titlepage"><div><div><h2 class="title" style="clear: both"><a name="id2672036"></a>Features and Benefits</h2></div></div></div><p>2 <a class="indexterm" name="id267 2044"></a>3 <a class="indexterm" name="id267 2051"></a>4 <a class="indexterm" name="id267 2058"></a>1 <html><head><meta http-equiv="Content-Type" content="text/html; charset=ISO-8859-1"><title>Chapter 32. High Availability</title><link rel="stylesheet" href="../samba.css" type="text/css"><meta name="generator" content="DocBook XSL Stylesheets V1.75.2"><link rel="home" href="index.html" title="The Official Samba 3.3.x HOWTO and Reference Guide"><link rel="up" href="optional.html" title="Part III. Advanced Configuration"><link rel="prev" href="Backup.html" title="Chapter 31. Backup Techniques"><link rel="next" href="largefile.html" title="Chapter 33. Handling Large Directories"></head><body bgcolor="white" text="black" link="#0000FF" vlink="#840084" alink="#0000FF"><div class="navheader"><table width="100%" summary="Navigation header"><tr><th colspan="3" align="center">Chapter 32. High Availability</th></tr><tr><td width="20%" align="left"><a accesskey="p" href="Backup.html">Prev</a> </td><th width="60%" align="center">Part III. Advanced Configuration</th><td width="20%" align="right"> <a accesskey="n" href="largefile.html">Next</a></td></tr></table><hr></div><div class="chapter" title="Chapter 32. High Availability"><div class="titlepage"><div><div><h2 class="title"><a name="SambaHA"></a>Chapter 32. High Availability</h2></div><div><div class="author"><h3 class="author"><span class="firstname">John</span> <span class="othername">H.</span> <span class="surname">Terpstra</span></h3><div class="affiliation"><span class="orgname">Samba Team<br></span><div class="address"><p><code class="email"><<a class="email" href="mailto:jht@samba.org">jht@samba.org</a>></code></p></div></div></div></div><div><div class="author"><h3 class="author"><span class="firstname">Jeremy</span> <span class="surname">Allison</span></h3><div class="affiliation"><span class="orgname">Samba Team<br></span><div class="address"><p><code class="email"><<a class="email" href="mailto:jra@samba.org">jra@samba.org</a>></code></p></div></div></div></div></div></div><div class="toc"><p><b>Table of Contents</b></p><dl><dt><span class="sect1"><a href="SambaHA.html#id2678091">Features and Benefits</a></span></dt><dt><span class="sect1"><a href="SambaHA.html#id2678212">Technical Discussion</a></span></dt><dd><dl><dt><span class="sect2"><a href="SambaHA.html#id2678246">The Ultimate Goal</a></span></dt><dt><span class="sect2"><a href="SambaHA.html#id2678376">Why Is This So Hard?</a></span></dt><dt><span class="sect2"><a href="SambaHA.html#id2679095">A Simple Solution</a></span></dt><dt><span class="sect2"><a href="SambaHA.html#id2679176">High-Availability Server Products</a></span></dt><dt><span class="sect2"><a href="SambaHA.html#id2679316">MS-DFS: The Poor Man's Cluster</a></span></dt><dt><span class="sect2"><a href="SambaHA.html#id2679353">Conclusions</a></span></dt></dl></dd></dl></div><div class="sect1" title="Features and Benefits"><div class="titlepage"><div><div><h2 class="title" style="clear: both"><a name="id2678091"></a>Features and Benefits</h2></div></div></div><p> 2 <a class="indexterm" name="id2678099"></a> 3 <a class="indexterm" name="id2678106"></a> 4 <a class="indexterm" name="id2678113"></a> 5 5 Network administrators are often concerned about the availability of file and print 6 6 services. Network users are inclined toward intolerance of the services they depend … … 9 9 A sign in a computer room served to remind staff of their responsibilities. It read: 10 10 </p><div class="blockquote"><blockquote class="blockquote"><p> 11 <a class="indexterm" name="id267 2079"></a>12 <a class="indexterm" name="id267 2085"></a>13 <a class="indexterm" name="id267 2092"></a>14 <a class="indexterm" name="id267 2099"></a>11 <a class="indexterm" name="id2678134"></a> 12 <a class="indexterm" name="id2678140"></a> 13 <a class="indexterm" name="id2678147"></a> 14 <a class="indexterm" name="id2678154"></a> 15 15 All humans fail, in both great and small ways we fail continually. Machines fail too. 16 16 Computers are machines that are managed by humans, the fallout from failure … … 22 22 understand the problem. That is the purpose of this chapter. 23 23 </p><p> 24 <a class="indexterm" name="id267 2123"></a>25 <a class="indexterm" name="id267 2130"></a>26 <a class="indexterm" name="id267 2137"></a>24 <a class="indexterm" name="id2678178"></a> 25 <a class="indexterm" name="id2678185"></a> 26 <a class="indexterm" name="id2678192"></a> 27 27 Parenthetically, in the following discussion there are seeds of information on how to 28 28 provision a network infrastructure against failure. Our purpose here is not to provide … … 33 33 presentation of the current state of knowledge and practice in high availability as it 34 34 applies to the deployment of Samba and other CIFS/SMB technologies. 35 </p></div><div class="sect1" lang="en"><div class="titlepage"><div><div><h2 class="title" style="clear: both"><a name="id2672157"></a>Technical Discussion</h2></div></div></div><p>36 <a class="indexterm" name="id267 2165"></a>37 <a class="indexterm" name="id267 2172"></a>38 <a class="indexterm" name="id267 2179"></a>35 </p></div><div class="sect1" title="Technical Discussion"><div class="titlepage"><div><div><h2 class="title" style="clear: both"><a name="id2678212"></a>Technical Discussion</h2></div></div></div><p> 36 <a class="indexterm" name="id2678220"></a> 37 <a class="indexterm" name="id2678227"></a> 38 <a class="indexterm" name="id2678234"></a> 39 39 The following summary was part of a presentation by Jeremy Allison at the SambaXP 2003 40 40 conference that was held at Goettingen, Germany, in April 2003. Material has been added 41 41 from other sources, but it was Jeremy who inspired the structure that follows. 42 </p><div class="sect2" lang="en"><div class="titlepage"><div><div><h3 class="title"><a name="id2672191"></a>The Ultimate Goal</h3></div></div></div><p>43 <a class="indexterm" name="id267 2199"></a>44 <a class="indexterm" name="id267 2206"></a>45 <a class="indexterm" name="id267 2213"></a>42 </p><div class="sect2" title="The Ultimate Goal"><div class="titlepage"><div><div><h3 class="title"><a name="id2678246"></a>The Ultimate Goal</h3></div></div></div><p> 43 <a class="indexterm" name="id2678254"></a> 44 <a class="indexterm" name="id2678261"></a> 45 <a class="indexterm" name="id2678268"></a> 46 46 All clustering technologies aim to achieve one or more of the following: 47 </p><div class="itemizedlist"><ul type="disc"><li><p>Obtain the maximum affordable computational power.</p></li><li><p>Obtain faster program execution.</p></li><li><p>Deliver unstoppable services.</p></li><li><p>Avert points of failure.</p></li><li><p>Exact most effective utilization of resources.</p></li></ul></div><p>47 </p><div class="itemizedlist"><ul class="itemizedlist" type="disc"><li class="listitem"><p>Obtain the maximum affordable computational power.</p></li><li class="listitem"><p>Obtain faster program execution.</p></li><li class="listitem"><p>Deliver unstoppable services.</p></li><li class="listitem"><p>Avert points of failure.</p></li><li class="listitem"><p>Exact most effective utilization of resources.</p></li></ul></div><p> 48 48 A clustered file server ideally has the following properties: 49 <a class="indexterm" name="id267 2254"></a>50 <a class="indexterm" name="id267 2261"></a>51 <a class="indexterm" name="id267 2268"></a>52 <a class="indexterm" name="id267 2275"></a>53 </p><div class="itemizedlist"><ul type="disc"><li><p>All clients can connect transparently to any server.</p></li><li><p>A server can fail and clients are transparently reconnected to another server.</p></li><li><p>All servers serve out the same set of files.</p></li><li><p>All file changes are immediately seen on all servers.</p><div class="itemizedlist"><ul type="circle"><li><p>Requires a distributed file system.</p></li></ul></div></li><li><p>Infinite ability to scale by adding more servers or disks.</p></li></ul></div></div><div class="sect2" lang="en"><div class="titlepage"><div><div><h3 class="title"><a name="id2672321"></a>Why Is This So Hard?</h3></div></div></div><p>49 <a class="indexterm" name="id2678309"></a> 50 <a class="indexterm" name="id2678316"></a> 51 <a class="indexterm" name="id2678323"></a> 52 <a class="indexterm" name="id2678330"></a> 53 </p><div class="itemizedlist"><ul class="itemizedlist" type="disc"><li class="listitem"><p>All clients can connect transparently to any server.</p></li><li class="listitem"><p>A server can fail and clients are transparently reconnected to another server.</p></li><li class="listitem"><p>All servers serve out the same set of files.</p></li><li class="listitem"><p>All file changes are immediately seen on all servers.</p><div class="itemizedlist"><ul class="itemizedlist" type="circle"><li class="listitem"><p>Requires a distributed file system.</p></li></ul></div></li><li class="listitem"><p>Infinite ability to scale by adding more servers or disks.</p></li></ul></div></div><div class="sect2" title="Why Is This So Hard?"><div class="titlepage"><div><div><h3 class="title"><a name="id2678376"></a>Why Is This So Hard?</h3></div></div></div><p> 54 54 In short, the problem is one of <span class="emphasis"><em>state</em></span>. 55 </p><div class="itemizedlist"><ul type="disc"><li><p>56 <a class="indexterm" name="id267 2340"></a>55 </p><div class="itemizedlist"><ul class="itemizedlist" type="disc"><li class="listitem"><p> 56 <a class="indexterm" name="id2678395"></a> 57 57 All TCP/IP connections are dependent on state information. 58 58 </p><p> 59 <a class="indexterm" name="id267 2352"></a>59 <a class="indexterm" name="id2678406"></a> 60 60 The TCP connection involves a packet sequence number. This 61 61 sequence number would need to be dynamically updated on all 62 62 machines in the cluster to effect seamless TCP failover. 63 </p></li><li ><p>64 <a class="indexterm" name="id267 2368"></a>65 <a class="indexterm" name="id267 2375"></a>63 </p></li><li class="listitem"><p> 64 <a class="indexterm" name="id2678423"></a> 65 <a class="indexterm" name="id2678430"></a> 66 66 CIFS/SMB (the Windows networking protocols) uses TCP connections. 67 67 </p><p> 68 68 This means that from a basic design perspective, failover is not 69 69 seriously considered. 70 </p><div class="itemizedlist"><ul type="circle"><li><p>70 </p><div class="itemizedlist"><ul class="itemizedlist" type="circle"><li class="listitem"><p> 71 71 All current SMB clusters are failover solutions 72 72 they rely on the clients to reconnect. They provide server 73 73 failover, but clients can lose information due to a server failure. 74 <a class="indexterm" name="id267 2400"></a>74 <a class="indexterm" name="id2678455"></a> 75 75 </p></li></ul></div><p> 76 </p></li><li ><p>76 </p></li><li class="listitem"><p> 77 77 Servers keep state information about client connections. 78 </p><div class="itemizedlist"><a class="indexterm" name="id267 2418"></a><ul type="circle"><li><p>CIFS/SMB involves a lot of state.</p></li><li><p>Every file open must be compared with other open files78 </p><div class="itemizedlist"><a class="indexterm" name="id2678473"></a><ul class="itemizedlist" type="circle"><li class="listitem"><p>CIFS/SMB involves a lot of state.</p></li><li class="listitem"><p>Every file open must be compared with other open files 79 79 to check share modes.</p></li></ul></div><p> 80 </p></li></ul></div><div class="sect3" lang="en"><div class="titlepage"><div><div><h4 class="title"><a name="id2672440"></a>The Front-End Challenge</h4></div></div></div><p>81 <a class="indexterm" name="id267 2448"></a>82 <a class="indexterm" name="id267 2454"></a>83 <a class="indexterm" name="id267 2461"></a>84 <a class="indexterm" name="id267 2468"></a>85 <a class="indexterm" name="id267 2475"></a>86 <a class="indexterm" name="id267 2482"></a>87 <a class="indexterm" name="id267 2489"></a>80 </p></li></ul></div><div class="sect3" title="The Front-End Challenge"><div class="titlepage"><div><div><h4 class="title"><a name="id2678494"></a>The Front-End Challenge</h4></div></div></div><p> 81 <a class="indexterm" name="id2678502"></a> 82 <a class="indexterm" name="id2678509"></a> 83 <a class="indexterm" name="id2678516"></a> 84 <a class="indexterm" name="id2678523"></a> 85 <a class="indexterm" name="id2678530"></a> 86 <a class="indexterm" name="id2678537"></a> 87 <a class="indexterm" name="id2678544"></a> 88 88 To make it possible for a cluster of file servers to appear as a single server that has one 89 89 name and one IP address, the incoming TCP data streams from clients must be processed by the … … 91 91 layer level and then feed the SMB packet to different servers in the cluster. 92 92 </p><p> 93 <a class="indexterm" name="id267 2505"></a>94 <a class="indexterm" name="id267 2512"></a>93 <a class="indexterm" name="id2678560"></a> 94 <a class="indexterm" name="id2678567"></a> 95 95 One could split all IPC$ connections and RPC calls to one server to handle printing and user 96 96 lookup requirements. RPC printing handles are shared between different IPC4 sessions it is … … 99 99 Conceptually speaking, all other servers would then provide only file services. This is a simpler 100 100 problem to concentrate on. 101 </p></div><div class="sect3" lang="en"><div class="titlepage"><div><div><h4 class="title"><a name="id2672534"></a>Demultiplexing SMB Requests</h4></div></div></div><p>102 <a class="indexterm" name="id267 2542"></a>103 <a class="indexterm" name="id267 2548"></a>104 <a class="indexterm" name="id267 2556"></a>105 <a class="indexterm" name="id267 2563"></a>101 </p></div><div class="sect3" title="Demultiplexing SMB Requests"><div class="titlepage"><div><div><h4 class="title"><a name="id2678589"></a>Demultiplexing SMB Requests</h4></div></div></div><p> 102 <a class="indexterm" name="id2678596"></a> 103 <a class="indexterm" name="id2678603"></a> 104 <a class="indexterm" name="id2678610"></a> 105 <a class="indexterm" name="id2678618"></a> 106 106 De-multiplexing of SMB requests requires knowledge of SMB state information, 107 107 all of which must be held by the front-end <span class="emphasis"><em>virtual</em></span> server. 108 108 This is a perplexing and complicated problem to solve. 109 109 </p><p> 110 <a class="indexterm" name="id267 2580"></a>111 <a class="indexterm" name="id267 2587"></a>112 <a class="indexterm" name="id267 2593"></a>110 <a class="indexterm" name="id2678635"></a> 111 <a class="indexterm" name="id2678642"></a> 112 <a class="indexterm" name="id2678648"></a> 113 113 Windows XP and later have changed semantics so state information (vuid, tid, fid) 114 114 must match for a successful operation. This makes things simpler than before and is a 115 115 positive step forward. 116 116 </p><p> 117 <a class="indexterm" name="id267 2607"></a>118 <a class="indexterm" name="id267 2614"></a>117 <a class="indexterm" name="id2678662"></a> 118 <a class="indexterm" name="id2678669"></a> 119 119 SMB requests are sent by vuid to their associated server. No code exists today to 120 120 effect this solution. This problem is conceptually similar to the problem of … … 122 122 Terminal Server in Samba. 123 123 </p><p> 124 <a class="indexterm" name="id267 2628"></a>124 <a class="indexterm" name="id2678683"></a> 125 125 One possibility is to start by exposing the server pool to clients directly. 126 126 This could eliminate the de-multiplexing step. 127 </p></div><div class="sect3" lang="en"><div class="titlepage"><div><div><h4 class="title"><a name="id2672640"></a>The Distributed File System Challenge</h4></div></div></div><p>128 <a class="indexterm" name="id267 2648"></a>127 </p></div><div class="sect3" title="The Distributed File System Challenge"><div class="titlepage"><div><div><h4 class="title"><a name="id2678695"></a>The Distributed File System Challenge</h4></div></div></div><p> 128 <a class="indexterm" name="id2678703"></a> 129 129 There exists many distributed file systems for UNIX and Linux. 130 130 </p><p> 131 <a class="indexterm" name="id267 2660"></a>132 <a class="indexterm" name="id267 2666"></a>133 <a class="indexterm" name="id267 2673"></a>134 <a class="indexterm" name="id267 2680"></a>135 <a class="indexterm" name="id267 2687"></a>136 <a class="indexterm" name="id267 2694"></a>131 <a class="indexterm" name="id2678714"></a> 132 <a class="indexterm" name="id2678721"></a> 133 <a class="indexterm" name="id2678728"></a> 134 <a class="indexterm" name="id2678735"></a> 135 <a class="indexterm" name="id2678742"></a> 136 <a class="indexterm" name="id2678749"></a> 137 137 Many could be adopted to backend our cluster, so long as awareness of SMB 138 138 semantics is kept in mind (share modes, locking, and oplock issues in particular). 139 139 Common free distributed file systems include: 140 <a class="indexterm" name="id267 2704"></a>141 <a class="indexterm" name="id267 2711"></a>142 <a class="indexterm" name="id267 2718"></a>143 <a class="indexterm" name="id267 2725"></a>144 </p><div class="itemizedlist"><ul type="disc"><li><p>NFS</p></li><li><p>AFS</p></li><li><p>OpenGFS</p></li><li><p>Lustre</p></li></ul></div><p>145 <a class="indexterm" name="id267 2755"></a>140 <a class="indexterm" name="id2678759"></a> 141 <a class="indexterm" name="id2678766"></a> 142 <a class="indexterm" name="id2678773"></a> 143 <a class="indexterm" name="id2678780"></a> 144 </p><div class="itemizedlist"><ul class="itemizedlist" type="disc"><li class="listitem"><p>NFS</p></li><li class="listitem"><p>AFS</p></li><li class="listitem"><p>OpenGFS</p></li><li class="listitem"><p>Lustre</p></li></ul></div><p> 145 <a class="indexterm" name="id2678810"></a> 146 146 The server pool (cluster) can use any distributed file system backend if all SMB 147 147 semantics are performed within this pool. 148 </p></div><div class="sect3" lang="en"><div class="titlepage"><div><div><h4 class="title"><a name="id2672767"></a>Restrictive Constraints on Distributed File Systems</h4></div></div></div><p>149 <a class="indexterm" name="id267 2775"></a>150 <a class="indexterm" name="id267 2782"></a>151 <a class="indexterm" name="id267 2789"></a>152 <a class="indexterm" name="id267 2796"></a>148 </p></div><div class="sect3" title="Restrictive Constraints on Distributed File Systems"><div class="titlepage"><div><div><h4 class="title"><a name="id2678822"></a>Restrictive Constraints on Distributed File Systems</h4></div></div></div><p> 149 <a class="indexterm" name="id2678830"></a> 150 <a class="indexterm" name="id2678837"></a> 151 <a class="indexterm" name="id2678844"></a> 152 <a class="indexterm" name="id2678851"></a> 153 153 Where a clustered server provides purely SMB services, oplock handling 154 154 may be done within the server pool without imposing a need for this to 155 155 be passed to the backend file system pool. 156 156 </p><p> 157 <a class="indexterm" name="id267 2809"></a>158 <a class="indexterm" name="id267 2816"></a>157 <a class="indexterm" name="id2678864"></a> 158 <a class="indexterm" name="id2678871"></a> 159 159 On the other hand, where the server pool also provides NFS or other file services, 160 160 it will be essential that the implementation be oplock-aware so it can … … 164 164 </p><p> 165 165 Last, all state information must be shared across the server pool. 166 </p></div><div class="sect3" lang="en"><div class="titlepage"><div><div><h4 class="title"><a name="id2672836"></a>Server Pool Communications</h4></div></div></div><p>167 <a class="indexterm" name="id267 2844"></a>168 <a class="indexterm" name="id267 2851"></a>169 <a class="indexterm" name="id267 2858"></a>170 <a class="indexterm" name="id267 2864"></a>166 </p></div><div class="sect3" title="Server Pool Communications"><div class="titlepage"><div><div><h4 class="title"><a name="id2678898"></a>Server Pool Communications</h4></div></div></div><p> 167 <a class="indexterm" name="id2678906"></a> 168 <a class="indexterm" name="id2678913"></a> 169 <a class="indexterm" name="id2678919"></a> 170 <a class="indexterm" name="id2678926"></a> 171 171 Most backend file systems support POSIX file semantics. This makes it difficult 172 172 to push SMB semantics back into the file system. POSIX locks have different properties 173 173 and semantics from SMB locks. 174 174 </p><p> 175 <a class="indexterm" name="id267 2878"></a>176 <a class="indexterm" name="id267 2885"></a>177 <a class="indexterm" name="id267 2891"></a>175 <a class="indexterm" name="id2678940"></a> 176 <a class="indexterm" name="id2678946"></a> 177 <a class="indexterm" name="id2678953"></a> 178 178 All <code class="literal">smbd</code> processes in the server pool must of necessity communicate 179 179 very quickly. For this, the current <em class="parameter"><code>tdb</code></em> file structure that Samba 180 180 uses is not suitable for use across a network. Clustered <code class="literal">smbd</code>s must use something else. 181 </p></div><div class="sect3" lang="en"><div class="titlepage"><div><div><h4 class="title"><a name="id2672922"></a>Server Pool Communications Demands</h4></div></div></div><p>181 </p></div><div class="sect3" title="Server Pool Communications Demands"><div class="titlepage"><div><div><h4 class="title"><a name="id2678983"></a>Server Pool Communications Demands</h4></div></div></div><p> 182 182 High-speed interserver communications in the server pool is a design prerequisite 183 183 for a fully functional system. Possibilities for this include: 184 </p><div class="itemizedlist"><a class="indexterm" name="id267 2936"></a><a class="indexterm" name="id2672942"></a><ul type="disc"><li><p>184 </p><div class="itemizedlist"><a class="indexterm" name="id2678997"></a><a class="indexterm" name="id2679004"></a><ul class="itemizedlist" type="disc"><li class="listitem"><p> 185 185 Proprietary shared memory bus (example: Myrinet or SCI [scalable coherent interface]). 186 186 These are high-cost items. 187 </p></li><li ><p>187 </p></li><li class="listitem"><p> 188 188 Gigabit Ethernet (now quite affordable). 189 </p></li><li ><p>189 </p></li><li class="listitem"><p> 190 190 Raw Ethernet framing (to bypass TCP and UDP overheads). 191 191 </p></li></ul></div><p> 192 192 We have yet to identify metrics for performance demands to enable this to happen 193 193 effectively. 194 </p></div><div class="sect3" lang="en"><div class="titlepage"><div><div><h4 class="title"><a name="id2672976"></a>Required Modifications to Samba</h4></div></div></div><p>194 </p></div><div class="sect3" title="Required Modifications to Samba"><div class="titlepage"><div><div><h4 class="title"><a name="id2679038"></a>Required Modifications to Samba</h4></div></div></div><p> 195 195 Samba needs to be significantly modified to work with a high-speed server interconnect 196 196 system to permit transparent failover clustering. 197 197 </p><p> 198 198 Particular functions inside Samba that will be affected include: 199 </p><div class="itemizedlist"><ul type="disc"><li><p>199 </p><div class="itemizedlist"><ul class="itemizedlist" type="disc"><li class="listitem"><p> 200 200 The locking database, oplock notifications, 201 201 and the share mode database. 202 </p></li><li ><p>203 <a class="indexterm" name="id267 3004"></a>204 <a class="indexterm" name="id267 3011"></a>202 </p></li><li class="listitem"><p> 203 <a class="indexterm" name="id2679065"></a> 204 <a class="indexterm" name="id2679072"></a> 205 205 Failure semantics need to be defined. Samba behaves the same way as Windows. 206 206 When oplock messages fail, a file open request is allowed, but this is 207 207 potentially dangerous in a clustered environment. So how should interserver 208 208 pool failure semantics function, and how should such functionality be implemented? 209 </p></li><li ><p>209 </p></li><li class="listitem"><p> 210 210 Should this be implemented using a point-to-point lock manager, or can this 211 211 be done using multicast techniques? 212 </p></li></ul></div></div></div><div class="sect2" lang="en"><div class="titlepage"><div><div><h3 class="title"><a name="id2673034"></a>A Simple Solution</h3></div></div></div><p>213 <a class="indexterm" name="id267 3042"></a>214 <a class="indexterm" name="id267 3049"></a>215 <a class="indexterm" name="id267 3056"></a>212 </p></li></ul></div></div></div><div class="sect2" title="A Simple Solution"><div class="titlepage"><div><div><h3 class="title"><a name="id2679095"></a>A Simple Solution</h3></div></div></div><p> 213 <a class="indexterm" name="id2679103"></a> 214 <a class="indexterm" name="id2679110"></a> 215 <a class="indexterm" name="id2679117"></a> 216 216 Allowing failover servers to handle different functions within the exported file system 217 217 removes the problem of requiring a distributed locking protocol. 218 218 </p><p> 219 <a class="indexterm" name="id267 3069"></a>220 <a class="indexterm" name="id267 3076"></a>219 <a class="indexterm" name="id2679130"></a> 220 <a class="indexterm" name="id2679138"></a> 221 221 If only one server is active in a pair, the need for high-speed server interconnect is avoided. 222 222 This allows the use of existing high-availability solutions, instead of inventing a new one. … … 225 225 must remember where all services are located a complexity not easily dealt with. 226 226 </p><p> 227 <a class="indexterm" name="id267 3100"></a>227 <a class="indexterm" name="id2679162"></a> 228 228 The <span class="emphasis"><em>virtual server</em></span> is still needed to redirect requests to backend 229 229 servers. Backend file space integrity is the responsibility of the administrator. 230 </p></div><div class="sect2" lang="en"><div class="titlepage"><div><div><h3 class="title"><a name="id2673115"></a>High-Availability Server Products</h3></div></div></div><p>231 <a class="indexterm" name="id267 3123"></a>232 <a class="indexterm" name="id267 3130"></a>233 <a class="indexterm" name="id267 3137"></a>234 <a class="indexterm" name="id267 3144"></a>235 <a class="indexterm" name="id267 3150"></a>230 </p></div><div class="sect2" title="High-Availability Server Products"><div class="titlepage"><div><div><h3 class="title"><a name="id2679176"></a>High-Availability Server Products</h3></div></div></div><p> 231 <a class="indexterm" name="id2679184"></a> 232 <a class="indexterm" name="id2679191"></a> 233 <a class="indexterm" name="id2679198"></a> 234 <a class="indexterm" name="id2679205"></a> 235 <a class="indexterm" name="id2679212"></a> 236 236 Failover servers must communicate in order to handle resource failover. This is essential 237 237 for high-availability services. The use of a dedicated heartbeat is a common technique to … … 239 239 link (LAN or serial). 240 240 </p><p> 241 <a class="indexterm" name="id267 3165"></a>242 <a class="indexterm" name="id267 3172"></a>243 <a class="indexterm" name="id267 3179"></a>244 <a class="indexterm" name="id267 3186"></a>245 <a class="indexterm" name="id267 3193"></a>241 <a class="indexterm" name="id2679227"></a> 242 <a class="indexterm" name="id2679234"></a> 243 <a class="indexterm" name="id2679241"></a> 244 <a class="indexterm" name="id2679248"></a> 245 <a class="indexterm" name="id2679255"></a> 246 246 Many failover solutions (like Red Hat Cluster Manager and Microsoft Wolfpack) 247 247 can use a shared SCSI of Fiber Channel disk storage array for failover communication. … … 249 249 <a class="ulink" href="http://www.redhat.com/docs/manuals/enterprise/RHEL-AS-2.1-Manual/cluster-manager/s1-service-samba.html" target="_top">www.redhat.com</a>. 250 250 </p><p> 251 <a class="indexterm" name="id267 3215"></a>251 <a class="indexterm" name="id2679277"></a> 252 252 The Linux High Availability project is a resource worthy of consultation if your desire is 253 253 to build a highly available Samba file server solution. Please consult the home page at 254 254 <a class="ulink" href="http://www.linux-ha.org/" target="_top">www.linux-ha.org/</a>. 255 255 </p><p> 256 <a class="indexterm" name="id267 3235"></a>257 <a class="indexterm" name="id267 3242"></a>256 <a class="indexterm" name="id2679297"></a> 257 <a class="indexterm" name="id2679304"></a> 258 258 Front-end server complexity remains a challenge for high availability because it must deal 259 259 gracefully with backend failures, while at the same time providing continuity of service 260 260 to all network clients. 261 </p></div><div class="sect2" lang="en"><div class="titlepage"><div><div><h3 class="title"><a name="id2673255"></a>MS-DFS: The Poor Man's Cluster</h3></div></div></div><p>262 <a class="indexterm" name="id267 3263"></a>263 <a class="indexterm" name="id267 3270"></a>261 </p></div><div class="sect2" title="MS-DFS: The Poor Man's Cluster"><div class="titlepage"><div><div><h3 class="title"><a name="id2679316"></a>MS-DFS: The Poor Man's Cluster</h3></div></div></div><p> 262 <a class="indexterm" name="id2679324"></a> 263 <a class="indexterm" name="id2679331"></a> 264 264 MS-DFS links can be used to redirect clients to disparate backend servers. This pushes 265 265 complexity back to the network client, something already included by Microsoft. … … 269 269 Above all, at the cost of complexity of management, a distributed system (pseudo-cluster) can 270 270 be created using existing Samba functionality. 271 </p></div><div class="sect2" lang="en"><div class="titlepage"><div><div><h3 class="title"><a name="id2673292"></a>Conclusions</h3></div></div></div><div class="itemizedlist"><ul type="disc"><li><p>Transparent SMB clustering is hard to do!</p></li><li><p>Client failover is the best we can do today.</p></li><li><p>Much more work is needed before a practical and manageable high-availability transparent cluster solution will be possible.</p></li><li><p>MS-DFS can be used to create the illusion of a single transparent cluster.</p></li></ul></div></div></div></div><div class="navfooter"><hr><table width="100%" summary="Navigation footer"><tr><td width="40%" align="left"><a accesskey="p" href="Backup.html">Prev</a> </td><td width="20%" align="center"><a accesskey="u" href="optional.html">Up</a></td><td width="40%" align="right"> <a accesskey="n" href="largefile.html">Next</a></td></tr><tr><td width="40%" align="left" valign="top">Chapter 31. Backup Techniques </td><td width="20%" align="center"><a accesskey="h" href="index.html">Home</a></td><td width="40%" align="right" valign="top"> Chapter 33. Handling Large Directories</td></tr></table></div></body></html>271 </p></div><div class="sect2" title="Conclusions"><div class="titlepage"><div><div><h3 class="title"><a name="id2679353"></a>Conclusions</h3></div></div></div><div class="itemizedlist"><ul class="itemizedlist" type="disc"><li class="listitem"><p>Transparent SMB clustering is hard to do!</p></li><li class="listitem"><p>Client failover is the best we can do today.</p></li><li class="listitem"><p>Much more work is needed before a practical and manageable high-availability transparent cluster solution will be possible.</p></li><li class="listitem"><p>MS-DFS can be used to create the illusion of a single transparent cluster.</p></li></ul></div></div></div></div><div class="navfooter"><hr><table width="100%" summary="Navigation footer"><tr><td width="40%" align="left"><a accesskey="p" href="Backup.html">Prev</a> </td><td width="20%" align="center"><a accesskey="u" href="optional.html">Up</a></td><td width="40%" align="right"> <a accesskey="n" href="largefile.html">Next</a></td></tr><tr><td width="40%" align="left" valign="top">Chapter 31. Backup Techniques </td><td width="20%" align="center"><a accesskey="h" href="index.html">Home</a></td><td width="40%" align="right" valign="top"> Chapter 33. Handling Large Directories</td></tr></table></div></body></html>
Note:
See TracChangeset
for help on using the changeset viewer.