source: branches/samba-3.3.x/docs-xml/Samba3-Developers-Guide/architecture.xml@ 575

Last change on this file since 575 was 217, checked in by Herwig Bauernfeind, 16 years ago

Import Samba 3.3 branch at 3.3.0 level docs-xml (psmedley's port)

File size: 6.3 KB
Line 
1<?xml version="1.0" encoding="iso-8859-1"?>
2<!DOCTYPE chapter PUBLIC "-//Samba-Team//DTD DocBook V4.2-Based Variant V1.0//EN" "http://www.samba.org/samba/DTD/samba-doc">
3<chapter id="architecture">
4<chapterinfo>
5 <author>
6 <firstname>Dan</firstname><surname>Shearer</surname>
7 </author>
8 <pubdate> November 1997</pubdate>
9</chapterinfo>
10
11<title>Samba Architecture</title>
12
13<sect1>
14<title>Introduction</title>
15
16<para>
17This document gives a general overview of how Samba works
18internally. The Samba Team has tried to come up with a model which is
19the best possible compromise between elegance, portability, security
20and the constraints imposed by the very messy SMB and CIFS
21protocol.
22</para>
23
24<para>
25It also tries to answer some of the frequently asked questions such as:
26</para>
27
28<orderedlist>
29<listitem><para>
30 Is Samba secure when running on Unix? The xyz platform?
31 What about the root priveliges issue?
32</para></listitem>
33
34<listitem><para>Pros and cons of multithreading in various parts of Samba</para></listitem>
35
36<listitem><para>Why not have a separate process for name resolution, WINS, and browsing?</para></listitem>
37
38</orderedlist>
39
40</sect1>
41
42<sect1>
43<title>Multithreading and Samba</title>
44
45<para>
46People sometimes tout threads as a uniformly good thing. They are very
47nice in their place but are quite inappropriate for smbd. nmbd is
48another matter, and multi-threading it would be very nice.
49</para>
50
51<para>
52The short version is that smbd is not multithreaded, and alternative
53servers that take this approach under Unix (such as Syntax, at the
54time of writing) suffer tremendous performance penalties and are less
55robust. nmbd is not threaded either, but this is because it is not
56possible to do it while keeping code consistent and portable across 35
57or more platforms. (This drawback also applies to threading smbd.)
58</para>
59
60<para>
61The longer versions is that there are very good reasons for not making
62smbd multi-threaded. Multi-threading would actually make Samba much
63slower, less scalable, less portable and much less robust. The fact
64that we use a separate process for each connection is one of Samba's
65biggest advantages.
66</para>
67
68</sect1>
69
70<sect1>
71<title>Threading smbd</title>
72
73<para>
74A few problems that would arise from a threaded smbd are:
75</para>
76
77<orderedlist>
78<listitem><para>
79 It's not only to create threads instead of processes, but you
80 must care about all variables if they have to be thread specific
81 (currently they would be global).
82</para></listitem>
83
84<listitem><para>
85 if one thread dies (eg. a seg fault) then all threads die. We can
86 immediately throw robustness out the window.
87</para></listitem>
88
89<listitem><para>
90 many of the system calls we make are blocking. Non-blocking
91 equivalents of many calls are either not available or are awkward (and
92 slow) to use. So while we block in one thread all clients are
93 waiting. Imagine if one share is a slow NFS filesystem and the others
94 are fast, we will end up slowing all clients to the speed of NFS.
95</para></listitem>
96
97<listitem><para>
98 you can't run as a different uid in different threads. This means
99 we would have to switch uid/gid on _every_ SMB packet. It would be
100 horrendously slow.
101</para></listitem>
102
103<listitem><para>
104 the per process file descriptor limit would mean that we could only
105 support a limited number of clients.
106</para></listitem>
107
108<listitem><para>
109 we couldn't use the system locking calls as the locking context of
110 fcntl() is a process, not a thread.
111</para></listitem>
112
113</orderedlist>
114
115</sect1>
116
117<sect1>
118<title>Threading nmbd</title>
119
120<para>
121This would be ideal, but gets sunk by portability requirements.
122</para>
123
124<para>
125Andrew tried to write a test threads library for nmbd that used only
126ansi-C constructs (using setjmp and longjmp). Unfortunately some OSes
127defeat this by restricting longjmp to calling addresses that are
128shallower than the current address on the stack (apparently AIX does
129this). This makes a truly portable threads library impossible. So to
130support all our current platforms we would have to code nmbd both with
131and without threads, and as the real aim of threads is to make the
132code clearer we would not have gained anything. (it is a myth that
133threads make things faster. threading is like recursion, it can make
134things clear but the same thing can always be done faster by some
135other method)
136</para>
137
138<para>
139Chris tried to spec out a general design that would abstract threading
140vs separate processes (vs other methods?) and make them accessible
141through some general API. This doesn't work because of the data
142sharing requirements of the protocol (packets in the future depending
143on packets now, etc.) At least, the code would work but would be very
144clumsy, and besides the fork() type model would never work on Unix. (Is there an OS that it would work on, for nmbd?)
145</para>
146
147<para>
148A fork() is cheap, but not nearly cheap enough to do on every UDP
149packet that arrives. Having a pool of processes is possible but is
150nasty to program cleanly due to the enormous amount of shared data (in
151complex structures) between the processes. We can't rely on each
152platform having a shared memory system.
153</para>
154
155</sect1>
156
157<sect1>
158<title>nbmd Design</title>
159
160<para>
161Originally Andrew used recursion to simulate a multi-threaded
162environment, which use the stack enormously and made for really
163confusing debugging sessions. Luke Leighton rewrote it to use a
164queuing system that keeps state information on each packet. The
165first version used a single structure which was used by all the
166pending states. As the initialisation of this structure was
167done by adding arguments, as the functionality developed, it got
168pretty messy. So, it was replaced with a higher-order function
169and a pointer to a user-defined memory block. This suddenly
170made things much simpler: large numbers of functions could be
171made static, and modularised. This is the same principle as used
172in NT's kernel, and achieves the same effect as threads, but in
173a single process.
174</para>
175
176<para>
177Then Jeremy rewrote nmbd. The packet data in nmbd isn't what's on the
178wire. It's a nice format that is very amenable to processing but still
179keeps the idea of a distinct packet. See "struct packet_struct" in
180nameserv.h. It has all the detail but none of the on-the-wire
181mess. This makes it ideal for using in disk or memory-based databases
182for browsing and WINS support.
183</para>
184
185</sect1>
186</chapter>
Note: See TracBrowser for help on using the repository browser.