- Timestamp:
- May 23, 2008, 6:56:41 AM (17 years ago)
- File:
-
- 1 edited
Legend:
- Unmodified
- Added
- Removed
-
branches/samba-3.0/docs/htmldocs/Samba3-HOWTO/unicode.html
r44 r134 1 <html><head><meta http-equiv="Content-Type" content="text/html; charset=ISO-8859-1"><title>Chapter 30. Unicode/Charsets</title><link rel="stylesheet" href=" samba.css" type="text/css"><meta name="generator" content="DocBook XSL Stylesheets V1.71.0"><link rel="start" href="index.html" title="The Official Samba-3 HOWTO and Reference Guide"><link rel="up" href="optional.html" title="Part III. Advanced Configuration"><link rel="prev" href="integrate-ms-networks.html" title="Chapter 29. Integrating MS Windows Networks with Samba"><link rel="next" href="Backup.html" title="Chapter 31. Backup Techniques"></head><body bgcolor="white" text="black" link="#0000FF" vlink="#840084" alink="#0000FF"><div class="navheader"><table width="100%" summary="Navigation header"><tr><th colspan="3" align="center">Chapter 30. Unicode/Charsets</th></tr><tr><td width="20%" align="left"><a accesskey="p" href="integrate-ms-networks.html">Prev</a> </td><th width="60%" align="center">Part III. Advanced Configuration</th><td width="20%" align="right"> <a accesskey="n" href="Backup.html">Next</a></td></tr></table><hr></div><div class="chapter" lang="en"><div class="titlepage"><div><div><h2 class="title"><a name="unicode"></a>Chapter 30. Unicode/Charsets</h2></div><div><div class="author"><h3 class="author"><span class="firstname">Jelmer</span> <span class="othername">R.</span> <span class="surname">Vernooij</span></h3><div class="affiliation"><span class="orgname">The Samba Team<br></span><div class="address"><p><code class="email"><<a href="mailto:jelmer@samba.org">jelmer@samba.org</a>></code></p></div></div></div></div><div><div class="author"><h3 class="author"><span class="firstname">John</span> <span class="othername">H.</span> <span class="surname">Terpstra</span></h3><div class="affiliation"><span class="orgname">Samba Team<br></span><div class="address"><p><code class="email"><<a href="mailto:jht@samba.org">jht@samba.org</a>></code></p></div></div></div></div><div><div class="author"><h3 class="author"><span class="firstname">TAKAHASHI</span> <span class="surname">Motonobu</span></h3><span class="contrib">Japanese character support</span><div class="affiliation"><div class="address"><p><code class="email"><<a href="mailto:monyo@home.monyo.com">monyo@home.monyo.com</a>></code></p></div></div></div></div><div><p class="pubdate">25 March 2003</p></div></div></div><div class="toc"><p><b>Table of Contents</b></p><dl><dt><span class="sect1"><a href="unicode.html#id426339">Features and Benefits</a></span></dt><dt><span class="sect1"><a href="unicode.html#id426385">What Are Charsets and Unicode?</a></span></dt><dt><span class="sect1"><a href="unicode.html#id426504">Samba and Charsets</a></span></dt><dt><span class="sect1"><a href="unicode.html#id426620">Conversion from Old Names</a></span></dt><dt><span class="sect1"><a href="unicode.html#id426649">Japanese Charsets</a></span></dt><dd><dl><dt><span class="sect2"><a href="unicode.html#id426764">Basic Parameter Setting</a></span></dt><dt><span class="sect2"><a href="unicode.html#id427327">Individual Implementations</a></span></dt><dt><span class="sect2"><a href="unicode.html#id427443">Migration from Samba-2.2 Series</a></span></dt></dl></dd><dt><span class="sect1"><a href="unicode.html#id427579">Common Errors</a></span></dt><dd><dl><dt><span class="sect2"><a href="unicode.html#id427584">CP850.so Can't Be Found</a></span></dt></dl></dd></dl></div><div class="sect1" lang="en"><div class="titlepage"><div><div><h2 class="title" style="clear: both"><a name="id426339"></a>Features and Benefits</h2></div></div></div><p>2 <a class="indexterm" name="id4 26347"></a>1 <html><head><meta http-equiv="Content-Type" content="text/html; charset=ISO-8859-1"><title>Chapter 30. Unicode/Charsets</title><link rel="stylesheet" href="../samba.css" type="text/css"><meta name="generator" content="DocBook XSL Stylesheets V1.73.2"><link rel="start" href="index.html" title="The Official Samba 3.2.x HOWTO and Reference Guide"><link rel="up" href="optional.html" title="Part III. Advanced Configuration"><link rel="prev" href="integrate-ms-networks.html" title="Chapter 29. Integrating MS Windows Networks with Samba"><link rel="next" href="Backup.html" title="Chapter 31. Backup Techniques"></head><body bgcolor="white" text="black" link="#0000FF" vlink="#840084" alink="#0000FF"><div class="navheader"><table width="100%" summary="Navigation header"><tr><th colspan="3" align="center">Chapter 30. Unicode/Charsets</th></tr><tr><td width="20%" align="left"><a accesskey="p" href="integrate-ms-networks.html">Prev</a> </td><th width="60%" align="center">Part III. Advanced Configuration</th><td width="20%" align="right"> <a accesskey="n" href="Backup.html">Next</a></td></tr></table><hr></div><div class="chapter" lang="en"><div class="titlepage"><div><div><h2 class="title"><a name="unicode"></a>Chapter 30. Unicode/Charsets</h2></div><div><div class="author"><h3 class="author"><span class="firstname">Jelmer</span> <span class="othername">R.</span> <span class="surname">Vernooij</span></h3><div class="affiliation"><span class="orgname">The Samba Team<br></span><div class="address"><p><code class="email"><<a class="email" href="mailto:jelmer@samba.org">jelmer@samba.org</a>></code></p></div></div></div></div><div><div class="author"><h3 class="author"><span class="firstname">John</span> <span class="othername">H.</span> <span class="surname">Terpstra</span></h3><div class="affiliation"><span class="orgname">Samba Team<br></span><div class="address"><p><code class="email"><<a class="email" href="mailto:jht@samba.org">jht@samba.org</a>></code></p></div></div></div></div><div><div class="author"><h3 class="author"><span class="firstname">TAKAHASHI</span> <span class="surname">Motonobu</span></h3><span class="contrib">Japanese character support</span> <div class="affiliation"><div class="address"><p><code class="email"><<a class="email" href="mailto:monyo@home.monyo.com">monyo@home.monyo.com</a>></code></p></div></div></div></div><div><p class="pubdate">25 March 2003</p></div></div></div><div class="toc"><p><b>Table of Contents</b></p><dl><dt><span class="sect1"><a href="unicode.html#id450204">Features and Benefits</a></span></dt><dt><span class="sect1"><a href="unicode.html#id450250">What Are Charsets and Unicode?</a></span></dt><dt><span class="sect1"><a href="unicode.html#id450371">Samba and Charsets</a></span></dt><dt><span class="sect1"><a href="unicode.html#id450497">Conversion from Old Names</a></span></dt><dt><span class="sect1"><a href="unicode.html#id450526">Japanese Charsets</a></span></dt><dd><dl><dt><span class="sect2"><a href="unicode.html#id450646">Basic Parameter Setting</a></span></dt><dt><span class="sect2"><a href="unicode.html#id451226">Individual Implementations</a></span></dt><dt><span class="sect2"><a href="unicode.html#id451340">Migration from Samba-2.2 Series</a></span></dt></dl></dd><dt><span class="sect1"><a href="unicode.html#id451479">Common Errors</a></span></dt><dd><dl><dt><span class="sect2"><a href="unicode.html#id451484">CP850.so Can't Be Found</a></span></dt></dl></dd></dl></div><div class="sect1" lang="en"><div class="titlepage"><div><div><h2 class="title" style="clear: both"><a name="id450204"></a>Features and Benefits</h2></div></div></div><p> 2 <a class="indexterm" name="id450212"></a> 3 3 Every industry eventually matures. One of the great areas of maturation is in 4 4 the focus that has been given over the past decade to make it possible for anyone … … 9 9 Of all the effort that has been brought to bear on providing native 10 10 language support for all computer users, the efforts of the 11 <a href="http://www.openi18n.org/" target="_top">Openi18n organization</a>11 <a class="ulink" href="http://www.openi18n.org/" target="_top">Openi18n organization</a> 12 12 is deserving of special mention. 13 13 </p><p> 14 <a class="indexterm" name="id4 26371"></a>14 <a class="indexterm" name="id450236"></a> 15 15 Samba-2.x supported a single locale through a mechanism called 16 16 <span class="emphasis"><em>codepages</em></span>. Samba-3 is destined to become a truly transglobal 17 17 file- and printer-sharing platform. 18 </p></div><div class="sect1" lang="en"><div class="titlepage"><div><div><h2 class="title" style="clear: both"><a name="id4 26385"></a>What Are Charsets and Unicode?</h2></div></div></div><p>19 <a class="indexterm" name="id4 26393"></a>18 </p></div><div class="sect1" lang="en"><div class="titlepage"><div><div><h2 class="title" style="clear: both"><a name="id450250"></a>What Are Charsets and Unicode?</h2></div></div></div><p> 19 <a class="indexterm" name="id450257"></a> 20 20 Computers communicate in numbers. In texts, each number is 21 21 translated to a corresponding letter. The meaning that will be assigned … … 23 23 </em></span> that is used. 24 24 </p><p> 25 <a class="indexterm" name="id4 26408"></a>26 <a class="indexterm" name="id4 26415"></a>25 <a class="indexterm" name="id450273"></a> 26 <a class="indexterm" name="id450280"></a> 27 27 A charset can be seen as a table that is used to translate numbers to 28 28 letters. Not all computers use the same charset (there are charsets … … 32 32 256 characters. Using this mode of encoding, each character takes exactly one byte. 33 33 </p><p> 34 <a class="indexterm" name="id4 26429"></a>35 <a class="indexterm" name="id4 26436"></a>34 <a class="indexterm" name="id450294"></a> 35 <a class="indexterm" name="id450301"></a> 36 36 There are also charsets that support extended characters, but those need at least 37 37 twice as much storage space as does ASCII encoding. Such charsets can contain … … 40 40 more then one byte to store one character. 41 41 </p><p> 42 <a class="indexterm" name="id4 26455"></a>42 <a class="indexterm" name="id450319"></a> 43 43 One standardized multibyte charset encoding scheme is known as 44 <a href="http://www.unicode.org/" target="_top">unicode</a>. A big advantage of using a44 <a class="ulink" href="http://www.unicode.org/" target="_top">unicode</a>. A big advantage of using a 45 45 multibyte charset is that you only need one. There is no need to make sure two 46 46 computers use the same charset when they are communicating. 47 47 </p><p> 48 <a class="indexterm" name="id4 26473"></a>49 <a class="indexterm" name="id4 26479"></a>50 <a class="indexterm" name="id4 26486"></a>48 <a class="indexterm" name="id450337"></a> 49 <a class="indexterm" name="id450344"></a> 50 <a class="indexterm" name="id450351"></a> 51 51 Old Windows clients use single-byte charsets, named 52 52 <em class="parameter"><code>codepages</code></em>, by Microsoft. However, there is no support for … … 54 54 have to make sure you are using the same charset when talking to an older client. 55 55 Newer clients (Windows NT, 200x, XP) talk Unicode over the wire. 56 </p></div><div class="sect1" lang="en"><div class="titlepage"><div><div><h2 class="title" style="clear: both"><a name="id4 26504"></a>Samba and Charsets</h2></div></div></div><p>57 <a class="indexterm" name="id4 26512"></a>58 <a class="indexterm" name="id4 26518"></a>56 </p></div><div class="sect1" lang="en"><div class="titlepage"><div><div><h2 class="title" style="clear: both"><a name="id450371"></a>Samba and Charsets</h2></div></div></div><p> 57 <a class="indexterm" name="id450379"></a> 58 <a class="indexterm" name="id450385"></a> 59 59 As of Samba-3, Samba can (and will) talk Unicode over the wire. Internally, 60 60 Samba knows of three kinds of character sets: 61 </p><div class="variablelist"><dl><dt><span class="term"><a class=" indexterm" name="id426533"></a>unix charset</span></dt><dd><p>62 <a class="indexterm" name="id4 26544"></a>63 <a class="indexterm" name="id4 26551"></a>61 </p><div class="variablelist"><dl><dt><span class="term"><a class="link" href="smb.conf.5.html#UNIXCHARSET">unix charset</a></span></dt><dd><p> 62 <a class="indexterm" name="id450415"></a> 63 <a class="indexterm" name="id450421"></a> 64 64 This is the charset used internally by your operating system. 65 65 The default is <code class="constant">UTF-8</code>, which is fine for most … … 67 67 in previous Samba releases was to save filenames in the encoding of the 68 68 clients for example, CP850 for Western European countries. 69 </p></dd><dt><span class="term"><a class=" indexterm" name="id426572"></a>display charset</span></dt><dd><p>This is the charset Samba uses to print messages69 </p></dd><dt><span class="term"><a class="link" href="smb.conf.5.html#DISPLAYCHARSET">display charset</a></span></dt><dd><p>This is the charset Samba uses to print messages 70 70 on your screen. It should generally be the same as the <em class="parameter"><code>unix charset</code></em>. 71 </p></dd><dt><span class="term"><a class=" indexterm" name="id426596"></a>dos charset</span></dt><dd><p>This is the charset Samba uses when communicating with71 </p></dd><dt><span class="term"><a class="link" href="smb.conf.5.html#DOSCHARSET">dos charset</a></span></dt><dd><p>This is the charset Samba uses when communicating with 72 72 DOS and Windows 9x/Me clients. It will talk Unicode to all newer clients. 73 73 The default depends on the charsets you have installed on your system. 74 74 Run <code class="literal">testparm -v | grep "dos charset"</code> to see 75 75 what the default is on your system. 76 </p></dd></dl></div></div><div class="sect1" lang="en"><div class="titlepage"><div><div><h2 class="title" style="clear: both"><a name="id4 26620"></a>Conversion from Old Names</h2></div></div></div><p>77 <a class="indexterm" name="id4 26627"></a>76 </p></dd></dl></div></div><div class="sect1" lang="en"><div class="titlepage"><div><div><h2 class="title" style="clear: both"><a name="id450497"></a>Conversion from Old Names</h2></div></div></div><p> 77 <a class="indexterm" name="id450505"></a> 78 78 Because previous Samba versions did not do any charset conversion, 79 79 characters in filenames are usually not correct in the UNIX charset but only 80 80 for the local charset used by the DOS/Windows clients. 81 </p><p>Bjoern Jacke has written a utility named <a href="http://j3e.de/linux/convmv/" target="_top">convmv</a>81 </p><p>Bjoern Jacke has written a utility named <a class="ulink" href="http://j3e.de/linux/convmv/" target="_top">convmv</a> 82 82 that can convert whole directory structures to different charsets with one single command. 83 </p></div><div class="sect1" lang="en"><div class="titlepage"><div><div><h2 class="title" style="clear: both"><a name="id4 26649"></a>Japanese Charsets</h2></div></div></div><p>83 </p></div><div class="sect1" lang="en"><div class="titlepage"><div><div><h2 class="title" style="clear: both"><a name="id450526"></a>Japanese Charsets</h2></div></div></div><p> 84 84 Setting up Japanese charsets is quite difficult. This is mainly because: 85 85 </p><div class="itemizedlist"><ul type="disc"><li><p> 86 <a class="indexterm" name="id4 26664"></a>86 <a class="indexterm" name="id450541"></a> 87 87 The Windows character set is extended from the original legacy Japanese 88 88 standard (JIS X 0208) and is not standardized. This means that the strictly 89 89 standardized implementation cannot support the full Windows character set. 90 90 </p></li><li><p> 91 <a class="indexterm" name="id4 26677"></a>92 <a class="indexterm" name="id4 26684"></a>93 <a class="indexterm" name="id4 26690"></a>94 <a class="indexterm" name="id4 26697"></a>95 <a class="indexterm" name="id4 26703"></a>91 <a class="indexterm" name="id450554"></a> 92 <a class="indexterm" name="id450561"></a> 93 <a class="indexterm" name="id450568"></a> 94 <a class="indexterm" name="id450574"></a> 95 <a class="indexterm" name="id450581"></a> 96 96 Mainly for historical reasons, there are several encoding methods in 97 97 Japanese, which are not fully compatible with each other. There are … … 113 113 the charset parameters depends on the implementation of iconv() you are using. 114 114 </p><p> 115 <a class="indexterm" name="id4 26733"></a>116 <a class="indexterm" name="id4 26740"></a>117 <a class="indexterm" name="id4 26746"></a>118 <a class="indexterm" name="id4 26753"></a>115 <a class="indexterm" name="id450615"></a> 116 <a class="indexterm" name="id450622"></a> 117 <a class="indexterm" name="id450629"></a> 118 <a class="indexterm" name="id450635"></a> 119 119 Though 2-byte fixed UCS-2 encoding is used in Windows internally, 120 120 Shift_JIS series encoding is usually used in Japanese environments 121 121 as ASCII encoding is in English environments. 122 </p></li></ul></div><div class="sect2" lang="en"><div class="titlepage"><div><div><h3 class="title"><a name="id4 26764"></a>Basic Parameter Setting</h3></div></div></div><p>123 <a class="indexterm" name="id4 26771"></a>124 The <a class=" indexterm" name="id426778"></a>dos charsetand125 <a class=" indexterm" name="id426785"></a>display charset122 </p></li></ul></div><div class="sect2" lang="en"><div class="titlepage"><div><div><h3 class="title"><a name="id450646"></a>Basic Parameter Setting</h3></div></div></div><p> 123 <a class="indexterm" name="id450653"></a> 124 The <a class="link" href="smb.conf.5.html#DOSCHARSET">dos charset</a> and 125 <a class="link" href="smb.conf.5.html#DISPLAYCHARSET">display charset</a> 126 126 should be set to the locale compatible with the character set 127 127 and encoding method used on Windows. This is usually CP932 128 128 but sometimes has a different name. 129 129 </p><p> 130 <a class="indexterm" name="id4 26796"></a>131 <a class="indexterm" name="id4 26803"></a>132 <a class="indexterm" name="id4 26810"></a>133 The <a class=" indexterm" name="id426817"></a>unix charsetcan be either Shift_JIS series,130 <a class="indexterm" name="id450686"></a> 131 <a class="indexterm" name="id450692"></a> 132 <a class="indexterm" name="id450699"></a> 133 The <a class="link" href="smb.conf.5.html#UNIXCHARSET">unix charset</a> can be either Shift_JIS series, 134 134 EUC-JP series, or UTF-8. UTF-8 is always available, but the availability of other locales 135 135 and the name itself depends on the system. 136 136 </p><p> 137 137 Additionally, you can consider using the Shift_JIS series as the 138 value of the <a class=" indexterm" name="id426829"></a>unix charset138 value of the <a class="link" href="smb.conf.5.html#UNIXCHARSET">unix charset</a> 139 139 parameter by using the vfs_cap module, which does the same thing as 140 140 setting “<span class="quote">coding system = CAP</span>” in the Samba 2.2 series. 141 141 </p><p> 142 Where to set <a class=" indexterm" name="id426844"></a>unix charset142 Where to set <a class="link" href="smb.conf.5.html#UNIXCHARSET">unix charset</a> 143 143 to is a difficult question. Here is a list of details, advantages, and 144 144 disadvantages of using a certain value. … … 167 167 with Shift_JIS. 168 168 </p></dd><dt><span class="term">EUC-JP series</span></dt><dd><p> 169 <a class="indexterm" name="id4 26917"></a>170 <a class="indexterm" name="id4 26924"></a>169 <a class="indexterm" name="id450816"></a> 170 <a class="indexterm" name="id450823"></a> 171 171 EUC-JP series means a locale that is equivalent to the industry 172 172 standard called EUC-JP, widely used in Japanese UNIX (although EUC … … 177 177 “<span class="quote">.txt</span>” (an 8-byte BINARY string). 178 178 </p><p> 179 <a class="indexterm" name="id4 26945"></a>180 <a class="indexterm" name="id4 26952"></a>181 <a class="indexterm" name="id4 26959"></a>182 <a class="indexterm" name="id4 26965"></a>183 <a class="indexterm" name="id4 26972"></a>184 <a class="indexterm" name="id4 26979"></a>185 <a class="indexterm" name="id4 26986"></a>186 <a class="indexterm" name="id4 26993"></a>187 <a class="indexterm" name="id4 26999"></a>188 <a class="indexterm" name="id4 27006"></a>179 <a class="indexterm" name="id450844"></a> 180 <a class="indexterm" name="id450851"></a> 181 <a class="indexterm" name="id450858"></a> 182 <a class="indexterm" name="id450864"></a> 183 <a class="indexterm" name="id450871"></a> 184 <a class="indexterm" name="id450878"></a> 185 <a class="indexterm" name="id450885"></a> 186 <a class="indexterm" name="id450892"></a> 187 <a class="indexterm" name="id450898"></a> 188 <a class="indexterm" name="id450905"></a> 189 189 Since EUC-JP is usually used on open source UNIX, Linux, and FreeBSD, and on commercial-based UNIX, Solaris, 190 190 IRIX, and Tru64 UNIX as Japanese locale (however, it is also possible on Solaris to use Shift_JIS and UTF-8, … … 199 199 during parsing filenames. 200 200 </p><p> 201 <a class="indexterm" name="id4 27033"></a>201 <a class="indexterm" name="id450932"></a> 202 202 Moreover, if you built Samba using differently installed libiconv, 203 203 the eucJP-ms locale included in libiconv and EUC-JP series locale … … 224 224 written from Windows on UNIX. 225 225 </p><p> 226 <a class="indexterm" name="id4 27094"></a>227 <a class="indexterm" name="id4 27100"></a>228 <a class="indexterm" name="id4 27107"></a>226 <a class="indexterm" name="id450996"></a> 227 <a class="indexterm" name="id451003"></a> 228 <a class="indexterm" name="id451010"></a> 229 229 In addition, although it is not directly concerned with Samba, since 230 230 there is a delicate difference between the iconv() function, which is … … 234 234 of the limitations involved in the process. 235 235 </p><p> 236 <a class="indexterm" name="id4 27121"></a>236 <a class="indexterm" name="id451024"></a> 237 237 Although Mac OS X uses UTF-8 as its encoding method for filenames, 238 238 it uses an extended UTF-8 specification that Samba cannot handle, so 239 239 UTF-8 locale is not available for Mac OS X. 240 240 </p></dd><dt><span class="term">Shift_JIS series + vfs_cap (CAP encoding)</span></dt><dd><p> 241 <a class="indexterm" name="id4 27140"></a>242 <a class="indexterm" name="id4 27147"></a>243 <a class="indexterm" name="id4 27154"></a>241 <a class="indexterm" name="id451043"></a> 242 <a class="indexterm" name="id451050"></a> 243 <a class="indexterm" name="id451057"></a> 244 244 CAP encoding means a specification used in CAP and NetAtalk, file 245 245 server software for Macintosh. In the case of CAP encoding, for … … 269 269 </p><p> 270 270 To use CAP encoding on Samba-3, you should use the unix charset parameter and VFS 271 as in <a href="unicode.html#vfscap-intl" title="Example 30.1. VFS CAP">the VFS CAP smb.conf file</a>.272 </p><div class="example"><a name="vfscap-intl"></a><p class="title"><b>Example 30.1. VFS CAP</b></p><div class="example-contents"><table class="simplelist" border="0" summary="Simple list"><tr><td> </td></tr><tr><td><em class="parameter"><code>[global]</code></em></td></tr><tr><td># the locale name "CP932" may be different</td></tr><tr><td><a class="indexterm" name="id4 27240"></a><em class="parameter"><code>dos charset = CP932</code></em></td></tr><tr><td><a class="indexterm" name="id427252"></a><em class="parameter"><code>unix charset = CP932</code></em></td></tr><tr><td> </td></tr><tr><td><em class="parameter"><code>[cap-share]</code></em></td></tr><tr><td><a class="indexterm" name="id427274"></a><em class="parameter"><code>vfs option = cap</code></em></td></tr></table></div></div><br class="example-break"><p>273 <a class="indexterm" name="id4 27289"></a>274 <a class="indexterm" name="id4 27296"></a>275 <a class="indexterm" name="id4 27303"></a>276 <a class="indexterm" name="id4 27310"></a>271 as in <a class="link" href="unicode.html#vfscap-intl" title="Example 30.1. VFS CAP">the VFS CAP smb.conf file</a>. 272 </p><div class="example"><a name="vfscap-intl"></a><p class="title"><b>Example 30.1. VFS CAP</b></p><div class="example-contents"><table class="simplelist" border="0" summary="Simple list"><tr><td> </td></tr><tr><td><em class="parameter"><code>[global]</code></em></td></tr><tr><td># the locale name "CP932" may be different</td></tr><tr><td><a class="indexterm" name="id451142"></a><em class="parameter"><code>dos charset = CP932</code></em></td></tr><tr><td><a class="indexterm" name="id451154"></a><em class="parameter"><code>unix charset = CP932</code></em></td></tr><tr><td> </td></tr><tr><td><em class="parameter"><code>[cap-share]</code></em></td></tr><tr><td><a class="indexterm" name="id451174"></a><em class="parameter"><code>vfs option = cap</code></em></td></tr></table></div></div><br class="example-break"><p> 273 <a class="indexterm" name="id451189"></a> 274 <a class="indexterm" name="id451196"></a> 275 <a class="indexterm" name="id451202"></a> 276 <a class="indexterm" name="id451209"></a> 277 277 You should set CP932 if using GNU libiconv for unix charset. With this setting, 278 278 filenames in the “<span class="quote">cap-share</span>” share are written with CAP encoding. 279 </p></dd></dl></div></div><div class="sect2" lang="en"><div class="titlepage"><div><div><h3 class="title"><a name="id4 27327"></a>Individual Implementations</h3></div></div></div><p>279 </p></dd></dl></div></div><div class="sect2" lang="en"><div class="titlepage"><div><div><h3 class="title"><a name="id451226"></a>Individual Implementations</h3></div></div></div><p> 280 280 Here is some additional information regarding individual implementations: 281 281 </p><div class="variablelist"><dl><dt><span class="term">GNU libiconv</span></dt><dd><p> 282 282 To handle Japanese correctly, you should apply the patch 283 <a href="http://www2d.biglobe.ne.jp/~msyk/software/libiconv-patch.html" target="_top">libiconv-1.8-cp932-patch.diff.gz</a>283 <a class="ulink" href="http://www2d.biglobe.ne.jp/~msyk/software/libiconv-patch.html" target="_top">libiconv-1.8-cp932-patch.diff.gz</a> 284 284 to libiconv-1.8. 285 285 </p><p> … … 296 296 be used because of the lack of the compatibility with Windows. 297 297 </p></dd><dt><span class="term">GNU glibc</span></dt><dd><p> 298 To handle Japanese correctly, you should apply a <a href="http://www2d.biglobe.ne.jp/~msyk/software/glibc/" target="_top">patch</a>298 To handle Japanese correctly, you should apply a <a class="ulink" href="http://www2d.biglobe.ne.jp/~msyk/software/glibc/" target="_top">patch</a> 299 299 to glibc-2.2.5/2.3.1/2.3.2 or should use the patch-merged versions, glibc-2.3.3 or later. 300 300 </p><p> 301 301 Using the above glibc, these setting are available: 302 </p><table class="simplelist" border="0" summary="Simple list"><tr><td><a class="indexterm" name="id4 27396"></a><em class="parameter"><code>dos charset = CP932</code></em></td></tr><tr><td><a class="indexterm" name="id427408"></a><em class="parameter"><code>unix charset = CP932 / eucJP-ms / UTF-8</code></em></td></tr><tr><td><a class="indexterm" name="id427421"></a><em class="parameter"><code>display charset = CP932</code></em></td></tr></table><p>302 </p><table class="simplelist" border="0" summary="Simple list"><tr><td><a class="indexterm" name="id451295"></a><em class="parameter"><code>dos charset = CP932</code></em></td></tr><tr><td><a class="indexterm" name="id451307"></a><em class="parameter"><code>unix charset = CP932 / eucJP-ms / UTF-8</code></em></td></tr><tr><td><a class="indexterm" name="id451318"></a><em class="parameter"><code>display charset = CP932</code></em></td></tr></table><p> 303 303 </p><p> 304 304 Other Japanese locales (for example, Shift_JIS and EUC-JP) should not 305 305 be used because of the lack of the compatibility with Windows. 306 </p></dd></dl></div></div><div class="sect2" lang="en"><div class="titlepage"><div><div><h3 class="title"><a name="id4 27443"></a>Migration from Samba-2.2 Series</h3></div></div></div><p>306 </p></dd></dl></div></div><div class="sect2" lang="en"><div class="titlepage"><div><div><h3 class="title"><a name="id451340"></a>Migration from Samba-2.2 Series</h3></div></div></div><p> 307 307 Prior to Samba-2.2 series, the “<span class="quote">coding system</span>” parameter was used. The default codepage in Samba 308 2.x was code page 850. In the Samba-3 series this has been replaced with the <a class=" indexterm" name="id427456"></a>unix charset parameter. <ahref="unicode.html#japancharsets" title="Table 30.1. Japanese Character Sets in Samba-2.2 and Samba-3">Japanese Character Sets in Samba-2.2 and Samba-3</a>308 2.x was code page 850. In the Samba-3 series this has been replaced with the <a class="link" href="smb.conf.5.html#UNIXCHARSET">unix charset</a> parameter. <a class="link" href="unicode.html#japancharsets" title="Table 30.1. Japanese Character Sets in Samba-2.2 and Samba-3">Japanese Character Sets in Samba-2.2 and Samba-3</a> 309 309 shows the mapping table when migrating from the Samba-2.2 series to Samba-3. 310 </p><div class="table"><a name="japancharsets"></a><p class="title"><b>Table 30.1. Japanese Character Sets in Samba-2.2 and Samba-3</b></p><div class="table-contents"><table summary="Japanese Character Sets in Samba-2.2 and Samba-3" border="1"><colgroup><col align="center"><col align="center"></colgroup><thead><tr><th align="center">Samba-2.2 Coding System</th><th align="center">Samba-3 unix charset</th></tr></thead><tbody><tr><td align="center">SJIS</td><td align="center">Shift_JIS series</td></tr><tr><td align="center">EUC</td><td align="center">EUC-JP series</td></tr><tr><td align="center">EUC3<sup>[<a name="id4 27528" href="#ftn.id427528">a</a>]</sup></td><td align="center">EUC-JP series</td></tr><tr><td align="center">CAP</td><td align="center">Shift_JIS series + VFS</td></tr><tr><td align="center">HEX</td><td align="center">currently none</td></tr><tr><td align="center">UTF8</td><td align="center">UTF-8</td></tr><tr><td align="center">UTF8-Mac<sup>[<a name="id427559" href="#ftn.id427559">b</a>]</sup></td><td align="center">currently none</td></tr><tr><td align="center">others</td><td align="center">none</td></tr></tbody><tbody class="footnotes"><tr><td colspan="2"><div class="footnote"><p><sup>[<a name="ftn.id427528" href="#id427528">a</a>] </sup>Only exists in Japanese Samba version</p></div><div class="footnote"><p><sup>[<a name="ftn.id427559" href="#id427559">b</a>] </sup>Only exists in Japanese Samba version</p></div></td></tr></tbody></table></div></div><br class="table-break"></div></div><div class="sect1" lang="en"><div class="titlepage"><div><div><h2 class="title" style="clear: both"><a name="id427579"></a>Common Errors</h2></div></div></div><div class="sect2" lang="en"><div class="titlepage"><div><div><h3 class="title"><a name="id427584"></a>CP850.so Can't Be Found</h3></div></div></div><p>“<span class="quote">Samba is complaining about a missing <code class="filename">CP850.so</code> file.</span>”</p><p>311 CP850 is the default <a class=" indexterm" name="id427603"></a>dos charset.312 The <a class=" indexterm" name="id427610"></a>dos charsetis used to convert data to the codepage used by your DOS clients.310 </p><div class="table"><a name="japancharsets"></a><p class="title"><b>Table 30.1. Japanese Character Sets in Samba-2.2 and Samba-3</b></p><div class="table-contents"><table summary="Japanese Character Sets in Samba-2.2 and Samba-3" border="1"><colgroup><col align="center"><col align="center"></colgroup><thead><tr><th align="center">Samba-2.2 Coding System</th><th align="center">Samba-3 unix charset</th></tr></thead><tbody><tr><td align="center">SJIS</td><td align="center">Shift_JIS series</td></tr><tr><td align="center">EUC</td><td align="center">EUC-JP series</td></tr><tr><td align="center">EUC3<sup>[<a name="id451429" href="#ftn.id451429" class="footnote">a</a>]</sup></td><td align="center">EUC-JP series</td></tr><tr><td align="center">CAP</td><td align="center">Shift_JIS series + VFS</td></tr><tr><td align="center">HEX</td><td align="center">currently none</td></tr><tr><td align="center">UTF8</td><td align="center">UTF-8</td></tr><tr><td align="center">UTF8-Mac<sup>[<a name="id451460" href="#ftn.id451460" class="footnote">b</a>]</sup></td><td align="center">currently none</td></tr><tr><td align="center">others</td><td align="center">none</td></tr></tbody><tbody class="footnotes"><tr><td colspan="2"><div class="footnote"><p><sup>[<a name="ftn.id451429" href="#id451429" class="para">a</a>] </sup>Only exists in Japanese Samba version</p></div><div class="footnote"><p><sup>[<a name="ftn.id451460" href="#id451460" class="para">b</a>] </sup>Only exists in Japanese Samba version</p></div></td></tr></tbody></table></div></div><br class="table-break"></div></div><div class="sect1" lang="en"><div class="titlepage"><div><div><h2 class="title" style="clear: both"><a name="id451479"></a>Common Errors</h2></div></div></div><div class="sect2" lang="en"><div class="titlepage"><div><div><h3 class="title"><a name="id451484"></a>CP850.so Can't Be Found</h3></div></div></div><p>“<span class="quote">Samba is complaining about a missing <code class="filename">CP850.so</code> file.</span>”</p><p> 311 CP850 is the default <a class="link" href="smb.conf.5.html#DOSCHARSET">dos charset</a>. 312 The <a class="link" href="smb.conf.5.html#DOSCHARSET">dos charset</a> is used to convert data to the codepage used by your DOS clients. 313 313 If you do not have any DOS clients, you can safely ignore this message. </p><p> 314 314 CP850 should be supported by your local iconv implementation. Make sure you have all the required packages installed.
Note:
See TracChangeset
for help on using the changeset viewer.