- Timestamp:
- Nov 12, 2012, 5:38:52 PM (13 years ago)
- File:
-
- 1 edited
Legend:
- Unmodified
- Added
- Removed
-
branches/samba-3.5.x/docs/htmldocs/Samba3-HOWTO/unicode.html
r599 r736 1 <html><head><meta http-equiv="Content-Type" content="text/html; charset=ISO-8859-1"><title>Chapter 30. Unicode/Charsets</title><link rel="stylesheet" href="../samba.css" type="text/css"><meta name="generator" content="DocBook XSL Stylesheets V1.75.2"><link rel="home" href="index.html" title="The Official Samba 3.5.x HOWTO and Reference Guide"><link rel="up" href="optional.html" title="Part III. Advanced Configuration"><link rel="prev" href="integrate-ms-networks.html" title="Chapter 29. Integrating MS Windows Networks with Samba"><link rel="next" href="Backup.html" title="Chapter 31. Backup Techniques"></head><body bgcolor="white" text="black" link="#0000FF" vlink="#840084" alink="#0000FF"><div class="navheader"><table width="100%" summary="Navigation header"><tr><th colspan="3" align="center">Chapter 30. Unicode/Charsets</th></tr><tr><td width="20%" align="left"><a accesskey="p" href="integrate-ms-networks.html">Prev</a> </td><th width="60%" align="center">Part III. Advanced Configuration</th><td width="20%" align="right"> <a accesskey="n" href="Backup.html">Next</a></td></tr></table><hr></div><div class="chapter" title="Chapter 30. Unicode/Charsets"><div class="titlepage"><div><div><h2 class="title"><a name="unicode"></a>Chapter 30. Unicode/Charsets</h2></div><div><div class="author"><h3 class="author"><span class="firstname">Jelmer</span> <span class="othername">R.</span> <span class="surname">Vernooij</span></h3><div class="affiliation"><span class="orgname">The Samba Team<br></span><div class="address"><p><code class="email"><<a class="email" href="mailto:jelmer@samba.org">jelmer@samba.org</a>></code></p></div></div></div></div><div><div class="author"><h3 class="author"><span class="firstname">John</span> <span class="othername">H.</span> <span class="surname">Terpstra</span></h3><div class="affiliation"><span class="orgname">Samba Team<br></span><div class="address"><p><code class="email"><<a class="email" href="mailto:jht@samba.org">jht@samba.org</a>></code></p></div></div></div></div><div><div class="author"><h3 class="author"><span class="firstname">TAKAHASHI</span> <span class="surname">Motonobu</span></h3><span class="contrib">Japanese character support</span> <div class="affiliation"><div class="address"><p><code class="email"><<a class="email" href="mailto:monyo@home.monyo.com">monyo@home.monyo.com</a>></code></p></div></div></div></div><div><p class="pubdate">25 March 2003</p></div></div></div><div class="toc"><p><b>Table of Contents</b></p><dl><dt><span class="sect1"><a href="unicode.html#id43254 2">Features and Benefits</a></span></dt><dt><span class="sect1"><a href="unicode.html#id432588">What Are Charsets and Unicode?</a></span></dt><dt><span class="sect1"><a href="unicode.html#id432707">Samba and Charsets</a></span></dt><dt><span class="sect1"><a href="unicode.html#id432832">Conversion from Old Names</a></span></dt><dt><span class="sect1"><a href="unicode.html#id432862">Japanese Charsets</a></span></dt><dd><dl><dt><span class="sect2"><a href="unicode.html#id432977">Basic Parameter Setting</a></span></dt><dt><span class="sect2"><a href="unicode.html#id433554">Individual Implementations</a></span></dt><dt><span class="sect2"><a href="unicode.html#id433668">Migration from Samba-2.2 Series</a></span></dt></dl></dd><dt><span class="sect1"><a href="unicode.html#id433807">Common Errors</a></span></dt><dd><dl><dt><span class="sect2"><a href="unicode.html#id433812">CP850.so Can't Be Found</a></span></dt></dl></dd></dl></div><div class="sect1" title="Features and Benefits"><div class="titlepage"><div><div><h2 class="title" style="clear: both"><a name="id432542"></a>Features and Benefits</h2></div></div></div><p>2 <a class="indexterm" name="id4325 50"></a>1 <html><head><meta http-equiv="Content-Type" content="text/html; charset=ISO-8859-1"><title>Chapter 30. Unicode/Charsets</title><link rel="stylesheet" href="../samba.css" type="text/css"><meta name="generator" content="DocBook XSL Stylesheets V1.75.2"><link rel="home" href="index.html" title="The Official Samba 3.5.x HOWTO and Reference Guide"><link rel="up" href="optional.html" title="Part III. Advanced Configuration"><link rel="prev" href="integrate-ms-networks.html" title="Chapter 29. Integrating MS Windows Networks with Samba"><link rel="next" href="Backup.html" title="Chapter 31. Backup Techniques"></head><body bgcolor="white" text="black" link="#0000FF" vlink="#840084" alink="#0000FF"><div class="navheader"><table width="100%" summary="Navigation header"><tr><th colspan="3" align="center">Chapter 30. Unicode/Charsets</th></tr><tr><td width="20%" align="left"><a accesskey="p" href="integrate-ms-networks.html">Prev</a> </td><th width="60%" align="center">Part III. Advanced Configuration</th><td width="20%" align="right"> <a accesskey="n" href="Backup.html">Next</a></td></tr></table><hr></div><div class="chapter" title="Chapter 30. Unicode/Charsets"><div class="titlepage"><div><div><h2 class="title"><a name="unicode"></a>Chapter 30. Unicode/Charsets</h2></div><div><div class="author"><h3 class="author"><span class="firstname">Jelmer</span> <span class="othername">R.</span> <span class="surname">Vernooij</span></h3><div class="affiliation"><span class="orgname">The Samba Team<br></span><div class="address"><p><code class="email"><<a class="email" href="mailto:jelmer@samba.org">jelmer@samba.org</a>></code></p></div></div></div></div><div><div class="author"><h3 class="author"><span class="firstname">John</span> <span class="othername">H.</span> <span class="surname">Terpstra</span></h3><div class="affiliation"><span class="orgname">Samba Team<br></span><div class="address"><p><code class="email"><<a class="email" href="mailto:jht@samba.org">jht@samba.org</a>></code></p></div></div></div></div><div><div class="author"><h3 class="author"><span class="firstname">TAKAHASHI</span> <span class="surname">Motonobu</span></h3><span class="contrib">Japanese character support</span> <div class="affiliation"><div class="address"><p><code class="email"><<a class="email" href="mailto:monyo@home.monyo.com">monyo@home.monyo.com</a>></code></p></div></div></div></div><div><p class="pubdate">25 March 2003</p></div></div></div><div class="toc"><p><b>Table of Contents</b></p><dl><dt><span class="sect1"><a href="unicode.html#id432540">Features and Benefits</a></span></dt><dt><span class="sect1"><a href="unicode.html#id432585">What Are Charsets and Unicode?</a></span></dt><dt><span class="sect1"><a href="unicode.html#id432704">Samba and Charsets</a></span></dt><dt><span class="sect1"><a href="unicode.html#id432830">Conversion from Old Names</a></span></dt><dt><span class="sect1"><a href="unicode.html#id432859">Japanese Charsets</a></span></dt><dd><dl><dt><span class="sect2"><a href="unicode.html#id432975">Basic Parameter Setting</a></span></dt><dt><span class="sect2"><a href="unicode.html#id433552">Individual Implementations</a></span></dt><dt><span class="sect2"><a href="unicode.html#id433665">Migration from Samba-2.2 Series</a></span></dt></dl></dd><dt><span class="sect1"><a href="unicode.html#id433804">Common Errors</a></span></dt><dd><dl><dt><span class="sect2"><a href="unicode.html#id433810">CP850.so Can't Be Found</a></span></dt></dl></dd></dl></div><div class="sect1" title="Features and Benefits"><div class="titlepage"><div><div><h2 class="title" style="clear: both"><a name="id432540"></a>Features and Benefits</h2></div></div></div><p> 2 <a class="indexterm" name="id432548"></a> 3 3 Every industry eventually matures. One of the great areas of maturation is in 4 4 the focus that has been given over the past decade to make it possible for anyone … … 12 12 is deserving of special mention. 13 13 </p><p> 14 <a class="indexterm" name="id43257 4"></a>14 <a class="indexterm" name="id432571"></a> 15 15 Samba-2.x supported a single locale through a mechanism called 16 16 <span class="emphasis"><em>codepages</em></span>. Samba-3 is destined to become a truly transglobal 17 17 file- and printer-sharing platform. 18 </p></div><div class="sect1" title="What Are Charsets and Unicode?"><div class="titlepage"><div><div><h2 class="title" style="clear: both"><a name="id43258 8"></a>What Are Charsets and Unicode?</h2></div></div></div><p>19 <a class="indexterm" name="id43259 5"></a>18 </p></div><div class="sect1" title="What Are Charsets and Unicode?"><div class="titlepage"><div><div><h2 class="title" style="clear: both"><a name="id432585"></a>What Are Charsets and Unicode?</h2></div></div></div><p> 19 <a class="indexterm" name="id432593"></a> 20 20 Computers communicate in numbers. In texts, each number is 21 21 translated to a corresponding letter. The meaning that will be assigned … … 23 23 </em></span> that is used. 24 24 </p><p> 25 <a class="indexterm" name="id4326 11"></a>26 <a class="indexterm" name="id43261 8"></a>25 <a class="indexterm" name="id432609"></a> 26 <a class="indexterm" name="id432615"></a> 27 27 A charset can be seen as a table that is used to translate numbers to 28 28 letters. Not all computers use the same charset (there are charsets … … 32 32 256 characters. Using this mode of encoding, each character takes exactly one byte. 33 33 </p><p> 34 <a class="indexterm" name="id43263 2"></a>35 <a class="indexterm" name="id43263 9"></a>34 <a class="indexterm" name="id432630"></a> 35 <a class="indexterm" name="id432636"></a> 36 36 There are also charsets that support extended characters, but those need at least 37 37 twice as much storage space as does ASCII encoding. Such charsets can contain … … 40 40 more then one byte to store one character. 41 41 </p><p> 42 <a class="indexterm" name="id43265 7"></a>42 <a class="indexterm" name="id432655"></a> 43 43 One standardized multibyte charset encoding scheme is known as 44 44 <a class="ulink" href="http://www.unicode.org/" target="_top">unicode</a>. A big advantage of using a … … 46 46 computers use the same charset when they are communicating. 47 47 </p><p> 48 <a class="indexterm" name="id43267 5"></a>49 <a class="indexterm" name="id43268 2"></a>50 <a class="indexterm" name="id43268 9"></a>48 <a class="indexterm" name="id432673"></a> 49 <a class="indexterm" name="id432680"></a> 50 <a class="indexterm" name="id432687"></a> 51 51 Old Windows clients use single-byte charsets, named 52 52 <em class="parameter"><code>codepages</code></em>, by Microsoft. However, there is no support for … … 54 54 have to make sure you are using the same charset when talking to an older client. 55 55 Newer clients (Windows NT, 200x, XP) talk Unicode over the wire. 56 </p></div><div class="sect1" title="Samba and Charsets"><div class="titlepage"><div><div><h2 class="title" style="clear: both"><a name="id43270 7"></a>Samba and Charsets</h2></div></div></div><p>57 <a class="indexterm" name="id43271 4"></a>58 <a class="indexterm" name="id4327 21"></a>56 </p></div><div class="sect1" title="Samba and Charsets"><div class="titlepage"><div><div><h2 class="title" style="clear: both"><a name="id432704"></a>Samba and Charsets</h2></div></div></div><p> 57 <a class="indexterm" name="id432712"></a> 58 <a class="indexterm" name="id432719"></a> 59 59 As of Samba-3, Samba can (and will) talk Unicode over the wire. Internally, 60 60 Samba knows of three kinds of character sets: 61 61 </p><div class="variablelist"><dl><dt><span class="term"><a class="link" href="smb.conf.5.html#UNIXCHARSET" target="_top">unix charset</a></span></dt><dd><p> 62 <a class="indexterm" name="id4327 51"></a>63 <a class="indexterm" name="id43275 8"></a>62 <a class="indexterm" name="id432749"></a> 63 <a class="indexterm" name="id432755"></a> 64 64 This is the charset used internally by your operating system. 65 65 The default is <code class="constant">UTF-8</code>, which is fine for most … … 74 74 Run <code class="literal">testparm -v | grep "dos charset"</code> to see 75 75 what the default is on your system. 76 </p></dd></dl></div></div><div class="sect1" title="Conversion from Old Names"><div class="titlepage"><div><div><h2 class="title" style="clear: both"><a name="id43283 2"></a>Conversion from Old Names</h2></div></div></div><p>77 <a class="indexterm" name="id4328 40"></a>76 </p></dd></dl></div></div><div class="sect1" title="Conversion from Old Names"><div class="titlepage"><div><div><h2 class="title" style="clear: both"><a name="id432830"></a>Conversion from Old Names</h2></div></div></div><p> 77 <a class="indexterm" name="id432838"></a> 78 78 Because previous Samba versions did not do any charset conversion, 79 79 characters in filenames are usually not correct in the UNIX charset but only … … 81 81 </p><p>Bjoern Jacke has written a utility named <a class="ulink" href="http://j3e.de/linux/convmv/" target="_top">convmv</a> 82 82 that can convert whole directory structures to different charsets with one single command. 83 </p></div><div class="sect1" title="Japanese Charsets"><div class="titlepage"><div><div><h2 class="title" style="clear: both"><a name="id4328 62"></a>Japanese Charsets</h2></div></div></div><p>83 </p></div><div class="sect1" title="Japanese Charsets"><div class="titlepage"><div><div><h2 class="title" style="clear: both"><a name="id432859"></a>Japanese Charsets</h2></div></div></div><p> 84 84 Setting up Japanese charsets is quite difficult. This is mainly because: 85 85 </p><div class="itemizedlist"><ul class="itemizedlist" type="disc"><li class="listitem"><p> 86 <a class="indexterm" name="id43287 6"></a>86 <a class="indexterm" name="id432874"></a> 87 87 The Windows character set is extended from the original legacy Japanese 88 88 standard (JIS X 0208) and is not standardized. This means that the strictly 89 89 standardized implementation cannot support the full Windows character set. 90 90 </p></li><li class="listitem"><p> 91 <a class="indexterm" name="id4328 90"></a>92 <a class="indexterm" name="id43289 6"></a>93 <a class="indexterm" name="id43290 3"></a>94 <a class="indexterm" name="id4329 10"></a>95 <a class="indexterm" name="id43291 7"></a>91 <a class="indexterm" name="id432887"></a> 92 <a class="indexterm" name="id432894"></a> 93 <a class="indexterm" name="id432901"></a> 94 <a class="indexterm" name="id432908"></a> 95 <a class="indexterm" name="id432914"></a> 96 96 Mainly for historical reasons, there are several encoding methods in 97 97 Japanese, which are not fully compatible with each other. There are … … 113 113 the charset parameters depends on the implementation of iconv() you are using. 114 114 </p><p> 115 <a class="indexterm" name="id43294 6"></a>116 <a class="indexterm" name="id43295 3"></a>117 <a class="indexterm" name="id4329 60"></a>118 <a class="indexterm" name="id43296 6"></a>115 <a class="indexterm" name="id432944"></a> 116 <a class="indexterm" name="id432950"></a> 117 <a class="indexterm" name="id432957"></a> 118 <a class="indexterm" name="id432964"></a> 119 119 Though 2-byte fixed UCS-2 encoding is used in Windows internally, 120 120 Shift_JIS series encoding is usually used in Japanese environments 121 121 as ASCII encoding is in English environments. 122 </p></li></ul></div><div class="sect2" title="Basic Parameter Setting"><div class="titlepage"><div><div><h3 class="title"><a name="id43297 7"></a>Basic Parameter Setting</h3></div></div></div><p>123 <a class="indexterm" name="id43298 4"></a>122 </p></li></ul></div><div class="sect2" title="Basic Parameter Setting"><div class="titlepage"><div><div><h3 class="title"><a name="id432975"></a>Basic Parameter Setting</h3></div></div></div><p> 123 <a class="indexterm" name="id432981"></a> 124 124 The <a class="link" href="smb.conf.5.html#DOSCHARSET" target="_top">dos charset</a> and 125 125 <a class="link" href="smb.conf.5.html#DISPLAYCHARSET" target="_top">display charset</a> … … 128 128 but sometimes has a different name. 129 129 </p><p> 130 <a class="indexterm" name="id43301 7"></a>131 <a class="indexterm" name="id43302 4"></a>132 <a class="indexterm" name="id4330 31"></a>130 <a class="indexterm" name="id433015"></a> 131 <a class="indexterm" name="id433022"></a> 132 <a class="indexterm" name="id433028"></a> 133 133 The <a class="link" href="smb.conf.5.html#UNIXCHARSET" target="_top">unix charset</a> can be either Shift_JIS series, 134 134 EUC-JP series, or UTF-8. UTF-8 is always available, but the availability of other locales … … 167 167 with Shift_JIS. 168 168 </p></dd><dt><span class="term">EUC-JP series</span></dt><dd><p> 169 <a class="indexterm" name="id43314 7"></a>170 <a class="indexterm" name="id43315 4"></a>169 <a class="indexterm" name="id433145"></a> 170 <a class="indexterm" name="id433152"></a> 171 171 EUC-JP series means a locale that is equivalent to the industry 172 172 standard called EUC-JP, widely used in Japanese UNIX (although EUC … … 177 177 <span class="quote">“<span class="quote">.txt</span>”</span> (an 8-byte BINARY string). 178 178 </p><p> 179 <a class="indexterm" name="id43317 5"></a>180 <a class="indexterm" name="id43318 2"></a>181 <a class="indexterm" name="id43318 9"></a>182 <a class="indexterm" name="id43319 6"></a>183 <a class="indexterm" name="id43320 2"></a>184 <a class="indexterm" name="id43320 9"></a>185 <a class="indexterm" name="id43321 6"></a>186 <a class="indexterm" name="id43322 3"></a>187 <a class="indexterm" name="id4332 30"></a>188 <a class="indexterm" name="id43323 6"></a>179 <a class="indexterm" name="id433173"></a> 180 <a class="indexterm" name="id433180"></a> 181 <a class="indexterm" name="id433186"></a> 182 <a class="indexterm" name="id433193"></a> 183 <a class="indexterm" name="id433200"></a> 184 <a class="indexterm" name="id433207"></a> 185 <a class="indexterm" name="id433214"></a> 186 <a class="indexterm" name="id433220"></a> 187 <a class="indexterm" name="id433227"></a> 188 <a class="indexterm" name="id433234"></a> 189 189 Since EUC-JP is usually used on open source UNIX, Linux, and FreeBSD, and on commercial-based UNIX, Solaris, 190 190 IRIX, and Tru64 UNIX as Japanese locale (however, it is also possible on Solaris to use Shift_JIS and UTF-8, … … 199 199 during parsing filenames. 200 200 </p><p> 201 <a class="indexterm" name="id43326 3"></a>201 <a class="indexterm" name="id433261"></a> 202 202 Moreover, if you built Samba using differently installed libiconv, 203 203 the eucJP-ms locale included in libiconv and EUC-JP series locale … … 224 224 written from Windows on UNIX. 225 225 </p><p> 226 <a class="indexterm" name="id43332 4"></a>227 <a class="indexterm" name="id4333 30"></a>228 <a class="indexterm" name="id43333 7"></a>226 <a class="indexterm" name="id433321"></a> 227 <a class="indexterm" name="id433328"></a> 228 <a class="indexterm" name="id433335"></a> 229 229 In addition, although it is not directly concerned with Samba, since 230 230 there is a delicate difference between the iconv() function, which is … … 234 234 of the limitations involved in the process. 235 235 </p><p> 236 <a class="indexterm" name="id4333 51"></a>236 <a class="indexterm" name="id433348"></a> 237 237 Although Mac OS X uses UTF-8 as its encoding method for filenames, 238 238 it uses an extended UTF-8 specification that Samba cannot handle, so 239 239 UTF-8 locale is not available for Mac OS X. 240 240 </p></dd><dt><span class="term">Shift_JIS series + vfs_cap (CAP encoding)</span></dt><dd><p> 241 <a class="indexterm" name="id4333 71"></a>242 <a class="indexterm" name="id43337 7"></a>243 <a class="indexterm" name="id43338 4"></a>241 <a class="indexterm" name="id433368"></a> 242 <a class="indexterm" name="id433375"></a> 243 <a class="indexterm" name="id433382"></a> 244 244 CAP encoding means a specification used in CAP and NetAtalk, file 245 245 server software for Macintosh. In the case of CAP encoding, for … … 270 270 To use CAP encoding on Samba-3, you should use the unix charset parameter and VFS 271 271 as in <a class="link" href="unicode.html#vfscap-intl" title="Example 30.1. VFS CAP">the VFS CAP smb.conf file</a>. 272 </p><div class="example"><a name="vfscap-intl"></a><p class="title"><b>Example 30.1. VFS CAP</b></p><div class="example-contents"><table border="0" summary="Simple list" class="simplelist"><tr><td> </td></tr><tr><td><em class="parameter"><code>[global]</code></em></td></tr><tr><td># the locale name "CP932" may be different</td></tr><tr><td><a class="indexterm" name="id4334 70"></a><em class="parameter"><code>dos charset = CP932</code></em></td></tr><tr><td><a class="indexterm" name="id433482"></a><em class="parameter"><code>unix charset = CP932</code></em></td></tr><tr><td> </td></tr><tr><td><em class="parameter"><code>[cap-share]</code></em></td></tr><tr><td><a class="indexterm" name="id433502"></a><em class="parameter"><code>vfs option = cap</code></em></td></tr></table></div></div><br class="example-break"><p>273 <a class="indexterm" name="id43351 7"></a>274 <a class="indexterm" name="id43352 4"></a>275 <a class="indexterm" name="id4335 30"></a>276 <a class="indexterm" name="id43353 7"></a>272 </p><div class="example"><a name="vfscap-intl"></a><p class="title"><b>Example 30.1. VFS CAP</b></p><div class="example-contents"><table border="0" summary="Simple list" class="simplelist"><tr><td> </td></tr><tr><td><em class="parameter"><code>[global]</code></em></td></tr><tr><td># the locale name "CP932" may be different</td></tr><tr><td><a class="indexterm" name="id433468"></a><em class="parameter"><code>dos charset = CP932</code></em></td></tr><tr><td><a class="indexterm" name="id433479"></a><em class="parameter"><code>unix charset = CP932</code></em></td></tr><tr><td> </td></tr><tr><td><em class="parameter"><code>[cap-share]</code></em></td></tr><tr><td><a class="indexterm" name="id433500"></a><em class="parameter"><code>vfs option = cap</code></em></td></tr></table></div></div><br class="example-break"><p> 273 <a class="indexterm" name="id433514"></a> 274 <a class="indexterm" name="id433521"></a> 275 <a class="indexterm" name="id433528"></a> 276 <a class="indexterm" name="id433535"></a> 277 277 You should set CP932 if using GNU libiconv for unix charset. With this setting, 278 278 filenames in the <span class="quote">“<span class="quote">cap-share</span>”</span> share are written with CAP encoding. 279 </p></dd></dl></div></div><div class="sect2" title="Individual Implementations"><div class="titlepage"><div><div><h3 class="title"><a name="id43355 4"></a>Individual Implementations</h3></div></div></div><p>279 </p></dd></dl></div></div><div class="sect2" title="Individual Implementations"><div class="titlepage"><div><div><h3 class="title"><a name="id433552"></a>Individual Implementations</h3></div></div></div><p> 280 280 Here is some additional information regarding individual implementations: 281 281 </p><div class="variablelist"><dl><dt><span class="term">GNU libiconv</span></dt><dd><p> … … 300 300 </p><p> 301 301 Using the above glibc, these setting are available: 302 </p><table border="0" summary="Simple list" class="simplelist"><tr><td><a class="indexterm" name="id43362 3"></a><em class="parameter"><code>dos charset = CP932</code></em></td></tr><tr><td><a class="indexterm" name="id433635"></a><em class="parameter"><code>unix charset = CP932 / eucJP-ms / UTF-8</code></em></td></tr><tr><td><a class="indexterm" name="id433646"></a><em class="parameter"><code>display charset = CP932</code></em></td></tr></table><p>302 </p><table border="0" summary="Simple list" class="simplelist"><tr><td><a class="indexterm" name="id433621"></a><em class="parameter"><code>dos charset = CP932</code></em></td></tr><tr><td><a class="indexterm" name="id433632"></a><em class="parameter"><code>unix charset = CP932 / eucJP-ms / UTF-8</code></em></td></tr><tr><td><a class="indexterm" name="id433644"></a><em class="parameter"><code>display charset = CP932</code></em></td></tr></table><p> 303 303 </p><p> 304 304 Other Japanese locales (for example, Shift_JIS and EUC-JP) should not 305 305 be used because of the lack of the compatibility with Windows. 306 </p></dd></dl></div></div><div class="sect2" title="Migration from Samba-2.2 Series"><div class="titlepage"><div><div><h3 class="title"><a name="id43366 8"></a>Migration from Samba-2.2 Series</h3></div></div></div><p>306 </p></dd></dl></div></div><div class="sect2" title="Migration from Samba-2.2 Series"><div class="titlepage"><div><div><h3 class="title"><a name="id433665"></a>Migration from Samba-2.2 Series</h3></div></div></div><p> 307 307 Prior to Samba-2.2 series, the <span class="quote">“<span class="quote">coding system</span>”</span> parameter was used. The default codepage in Samba 308 308 2.x was code page 850. In the Samba-3 series this has been replaced with the <a class="link" href="smb.conf.5.html#UNIXCHARSET" target="_top">unix charset</a> parameter. <a class="link" href="unicode.html#japancharsets" title="Table 30.1. Japanese Character Sets in Samba-2.2 and Samba-3">Japanese Character Sets in Samba-2.2 and Samba-3</a> 309 309 shows the mapping table when migrating from the Samba-2.2 series to Samba-3. 310 </p><div class="table"><a name="japancharsets"></a><p class="title"><b>Table 30.1. Japanese Character Sets in Samba-2.2 and Samba-3</b></p><div class="table-contents"><table summary="Japanese Character Sets in Samba-2.2 and Samba-3" border="1"><colgroup><col align="center"><col align="center"></colgroup><thead><tr><th align="center">Samba-2.2 Coding System</th><th align="center">Samba-3 unix charset</th></tr></thead><tbody><tr><td align="center">SJIS</td><td align="center">Shift_JIS series</td></tr><tr><td align="center">EUC</td><td align="center">EUC-JP series</td></tr><tr><td align="center">EUC3<sup>[<a name="id43375 7" href="#ftn.id433757" class="footnote">a</a>]</sup></td><td align="center">EUC-JP series</td></tr><tr><td align="center">CAP</td><td align="center">Shift_JIS series + VFS</td></tr><tr><td align="center">HEX</td><td align="center">currently none</td></tr><tr><td align="center">UTF8</td><td align="center">UTF-8</td></tr><tr><td align="center">UTF8-Mac<sup>[<a name="id433788" href="#ftn.id433788" class="footnote">b</a>]</sup></td><td align="center">currently none</td></tr><tr><td align="center">others</td><td align="center">none</td></tr></tbody><tbody class="footnotes"><tr><td colspan="2"><div class="footnote"><p><sup>[<a name="ftn.id433757" href="#id433757" class="para">a</a>] </sup>Only exists in Japanese Samba version</p></div><div class="footnote"><p><sup>[<a name="ftn.id433788" href="#id433788" class="para">b</a>] </sup>Only exists in Japanese Samba version</p></div></td></tr></tbody></table></div></div><br class="table-break"></div></div><div class="sect1" title="Common Errors"><div class="titlepage"><div><div><h2 class="title" style="clear: both"><a name="id433807"></a>Common Errors</h2></div></div></div><div class="sect2" title="CP850.so Can't Be Found"><div class="titlepage"><div><div><h3 class="title"><a name="id433812"></a>CP850.so Can't Be Found</h3></div></div></div><p><span class="quote">“<span class="quote">Samba is complaining about a missing <code class="filename">CP850.so</code> file.</span>”</span></p><p>310 </p><div class="table"><a name="japancharsets"></a><p class="title"><b>Table 30.1. Japanese Character Sets in Samba-2.2 and Samba-3</b></p><div class="table-contents"><table summary="Japanese Character Sets in Samba-2.2 and Samba-3" border="1"><colgroup><col align="center"><col align="center"></colgroup><thead><tr><th align="center">Samba-2.2 Coding System</th><th align="center">Samba-3 unix charset</th></tr></thead><tbody><tr><td align="center">SJIS</td><td align="center">Shift_JIS series</td></tr><tr><td align="center">EUC</td><td align="center">EUC-JP series</td></tr><tr><td align="center">EUC3<sup>[<a name="id433754" href="#ftn.id433754" class="footnote">a</a>]</sup></td><td align="center">EUC-JP series</td></tr><tr><td align="center">CAP</td><td align="center">Shift_JIS series + VFS</td></tr><tr><td align="center">HEX</td><td align="center">currently none</td></tr><tr><td align="center">UTF8</td><td align="center">UTF-8</td></tr><tr><td align="center">UTF8-Mac<sup>[<a name="id433785" href="#ftn.id433785" class="footnote">b</a>]</sup></td><td align="center">currently none</td></tr><tr><td align="center">others</td><td align="center">none</td></tr></tbody><tbody class="footnotes"><tr><td colspan="2"><div class="footnote"><p><sup>[<a name="ftn.id433754" href="#id433754" class="para">a</a>] </sup>Only exists in Japanese Samba version</p></div><div class="footnote"><p><sup>[<a name="ftn.id433785" href="#id433785" class="para">b</a>] </sup>Only exists in Japanese Samba version</p></div></td></tr></tbody></table></div></div><br class="table-break"></div></div><div class="sect1" title="Common Errors"><div class="titlepage"><div><div><h2 class="title" style="clear: both"><a name="id433804"></a>Common Errors</h2></div></div></div><div class="sect2" title="CP850.so Can't Be Found"><div class="titlepage"><div><div><h3 class="title"><a name="id433810"></a>CP850.so Can't Be Found</h3></div></div></div><p><span class="quote">“<span class="quote">Samba is complaining about a missing <code class="filename">CP850.so</code> file.</span>”</span></p><p> 311 311 CP850 is the default <a class="link" href="smb.conf.5.html#DOSCHARSET" target="_top">dos charset</a>. 312 312 The <a class="link" href="smb.conf.5.html#DOSCHARSET" target="_top">dos charset</a> is used to convert data to the codepage used by your DOS clients.
Note:
See TracChangeset
for help on using the changeset viewer.