- Timestamp:
- May 27, 2009, 9:08:03 AM (16 years ago)
- File:
-
- 1 edited
Legend:
- Unmodified
- Added
- Removed
-
branches/samba-3.2.x/docs/htmldocs/Samba3-HOWTO/unicode.html
r226 r231 1 <html><head><meta http-equiv="Content-Type" content="text/html; charset=ISO-8859-1"><title>Chapter 30. Unicode/Charsets</title><link rel="stylesheet" href="../samba.css" type="text/css"><meta name="generator" content="DocBook XSL Stylesheets V1.7 3.1"><link rel="start" href="index.html" title="The Official Samba 3.2.x HOWTO and Reference Guide"><link rel="up" href="optional.html" title="Part III. Advanced Configuration"><link rel="prev" href="integrate-ms-networks.html" title="Chapter 29. Integrating MS Windows Networks with Samba"><link rel="next" href="Backup.html" title="Chapter 31. Backup Techniques"></head><body bgcolor="white" text="black" link="#0000FF" vlink="#840084" alink="#0000FF"><div class="navheader"><table width="100%" summary="Navigation header"><tr><th colspan="3" align="center">Chapter 30. Unicode/Charsets</th></tr><tr><td width="20%" align="left"><a accesskey="p" href="integrate-ms-networks.html">Prev</a> </td><th width="60%" align="center">Part III. Advanced Configuration</th><td width="20%" align="right"> <a accesskey="n" href="Backup.html">Next</a></td></tr></table><hr></div><div class="chapter" lang="en"><div class="titlepage"><div><div><h2 class="title"><a name="unicode"></a>Chapter 30. Unicode/Charsets</h2></div><div><div class="author"><h3 class="author"><span class="firstname">Jelmer</span> <span class="othername">R.</span> <span class="surname">Vernooij</span></h3><div class="affiliation"><span class="orgname">The Samba Team<br></span><div class="address"><p><code class="email"><<a class="email" href="mailto:jelmer@samba.org">jelmer@samba.org</a>></code></p></div></div></div></div><div><div class="author"><h3 class="author"><span class="firstname">John</span> <span class="othername">H.</span> <span class="surname">Terpstra</span></h3><div class="affiliation"><span class="orgname">Samba Team<br></span><div class="address"><p><code class="email"><<a class="email" href="mailto:jht@samba.org">jht@samba.org</a>></code></p></div></div></div></div><div><div class="author"><h3 class="author"><span class="firstname">TAKAHASHI</span> <span class="surname">Motonobu</span></h3><span class="contrib">Japanese character support</span> <div class="affiliation"><div class="address"><p><code class="email"><<a class="email" href="mailto:monyo@home.monyo.com">monyo@home.monyo.com</a>></code></p></div></div></div></div><div><p class="pubdate">25 March 2003</p></div></div></div><div class="toc"><p><b>Table of Contents</b></p><dl><dt><span class="sect1"><a href="unicode.html#id2662849">Features and Benefits</a></span></dt><dt><span class="sect1"><a href="unicode.html#id2662901">What Are Charsets and Unicode?</a></span></dt><dt><span class="sect1"><a href="unicode.html#id2663035">Samba and Charsets</a></span></dt><dt><span class="sect1"><a href="unicode.html#id2663171">Conversion from Old Names</a></span></dt><dt><span class="sect1"><a href="unicode.html#id2663202">Japanese Charsets</a></span></dt><dd><dl><dt><span class="sect2"><a href="unicode.html#id2663342">Basic Parameter Setting</a></span></dt><dt><span class="sect2"><a href="unicode.html#id2663994">Individual Implementations</a></span></dt><dt><span class="sect2"><a href="unicode.html#id2664118">Migration from Samba-2.2 Series</a></span></dt></dl></dd><dt><span class="sect1"><a href="unicode.html#id2664264">Common Errors</a></span></dt><dd><dl><dt><span class="sect2"><a href="unicode.html#id2664270">CP850.so Can't Be Found</a></span></dt></dl></dd></dl></div><div class="sect1" lang="en"><div class="titlepage"><div><div><h2 class="title" style="clear: both"><a name="id2662849"></a>Features and Benefits</h2></div></div></div><p>2 <a class="indexterm" name="id266 2857"></a>1 <html><head><meta http-equiv="Content-Type" content="text/html; charset=ISO-8859-1"><title>Chapter 30. Unicode/Charsets</title><link rel="stylesheet" href="../samba.css" type="text/css"><meta name="generator" content="DocBook XSL Stylesheets V1.74.0"><link rel="home" href="index.html" title="The Official Samba 3.2.x HOWTO and Reference Guide"><link rel="up" href="optional.html" title="Part III. Advanced Configuration"><link rel="prev" href="integrate-ms-networks.html" title="Chapter 29. Integrating MS Windows Networks with Samba"><link rel="next" href="Backup.html" title="Chapter 31. Backup Techniques"></head><body bgcolor="white" text="black" link="#0000FF" vlink="#840084" alink="#0000FF"><div class="navheader"><table width="100%" summary="Navigation header"><tr><th colspan="3" align="center">Chapter 30. Unicode/Charsets</th></tr><tr><td width="20%" align="left"><a accesskey="p" href="integrate-ms-networks.html">Prev</a> </td><th width="60%" align="center">Part III. Advanced Configuration</th><td width="20%" align="right"> <a accesskey="n" href="Backup.html">Next</a></td></tr></table><hr></div><div class="chapter" lang="en"><div class="titlepage"><div><div><h2 class="title"><a name="unicode"></a>Chapter 30. Unicode/Charsets</h2></div><div><div class="author"><h3 class="author"><span class="firstname">Jelmer</span> <span class="othername">R.</span> <span class="orgname">The Samba Team</span> <span class="surname">Vernooij</span></h3><div class="affiliation"><span class="orgname">The Samba Team<br></span><div class="address"><p><code class="email"><<a class="email" href="mailto:jelmer@samba.org">jelmer@samba.org</a>></code></p></div></div></div></div><div><div class="author"><h3 class="author"><span class="firstname">John</span> <span class="othername">H.</span> <span class="orgname">Samba Team</span> <span class="surname">Terpstra</span></h3><div class="affiliation"><span class="orgname">Samba Team<br></span><div class="address"><p><code class="email"><<a class="email" href="mailto:jht@samba.org">jht@samba.org</a>></code></p></div></div></div></div><div><div class="author"><h3 class="author"><span class="firstname">TAKAHASHI</span> <span class="surname">Motonobu</span></h3><span class="contrib">Japanese character support</span> <div class="affiliation"><div class="address"><p><code class="email"><<a class="email" href="mailto:monyo@home.monyo.com">monyo@home.monyo.com</a>></code></p></div></div></div></div><div><p class="pubdate">25 March 2003</p></div></div></div><div class="toc"><p><b>Table of Contents</b></p><dl><dt><span class="sect1"><a href="unicode.html#id2669841">Features and Benefits</a></span></dt><dt><span class="sect1"><a href="unicode.html#id2669892">What Are Charsets and Unicode?</a></span></dt><dt><span class="sect1"><a href="unicode.html#id2670030">Samba and Charsets</a></span></dt><dt><span class="sect1"><a href="unicode.html#id2670166">Conversion from Old Names</a></span></dt><dt><span class="sect1"><a href="unicode.html#id2670197">Japanese Charsets</a></span></dt><dd><dl><dt><span class="sect2"><a href="unicode.html#id2670337">Basic Parameter Setting</a></span></dt><dt><span class="sect2"><a href="unicode.html#id2670985">Individual Implementations</a></span></dt><dt><span class="sect2"><a href="unicode.html#id2671108">Migration from Samba-2.2 Series</a></span></dt></dl></dd><dt><span class="sect1"><a href="unicode.html#id2671255">Common Errors</a></span></dt><dd><dl><dt><span class="sect2"><a href="unicode.html#id2671261">CP850.so Can't Be Found</a></span></dt></dl></dd></dl></div><div class="sect1" lang="en"><div class="titlepage"><div><div><h2 class="title" style="clear: both"><a name="id2669841"></a>Features and Benefits</h2></div></div></div><p> 2 <a class="indexterm" name="id2669849"></a> 3 3 Every industry eventually matures. One of the great areas of maturation is in 4 4 the focus that has been given over the past decade to make it possible for anyone … … 12 12 is deserving of special mention. 13 13 </p><p> 14 <a class="indexterm" name="id266 2886"></a>14 <a class="indexterm" name="id2669877"></a> 15 15 Samba-2.x supported a single locale through a mechanism called 16 16 <span class="emphasis"><em>codepages</em></span>. Samba-3 is destined to become a truly transglobal 17 17 file- and printer-sharing platform. 18 </p></div><div class="sect1" lang="en"><div class="titlepage"><div><div><h2 class="title" style="clear: both"><a name="id266 2901"></a>What Are Charsets and Unicode?</h2></div></div></div><p>19 <a class="indexterm" name="id266 2909"></a>18 </p></div><div class="sect1" lang="en"><div class="titlepage"><div><div><h2 class="title" style="clear: both"><a name="id2669892"></a>What Are Charsets and Unicode?</h2></div></div></div><p> 19 <a class="indexterm" name="id2669900"></a> 20 20 Computers communicate in numbers. In texts, each number is 21 21 translated to a corresponding letter. The meaning that will be assigned … … 23 23 </em></span> that is used. 24 24 </p><p> 25 <a class="indexterm" name="id266 2926"></a>26 <a class="indexterm" name="id266 2933"></a>25 <a class="indexterm" name="id2669918"></a> 26 <a class="indexterm" name="id2669924"></a> 27 27 A charset can be seen as a table that is used to translate numbers to 28 28 letters. Not all computers use the same charset (there are charsets … … 32 32 256 characters. Using this mode of encoding, each character takes exactly one byte. 33 33 </p><p> 34 <a class="indexterm" name="id266 2951"></a>35 <a class="indexterm" name="id266 2958"></a>34 <a class="indexterm" name="id2669942"></a> 35 <a class="indexterm" name="id2669949"></a> 36 36 There are also charsets that support extended characters, but those need at least 37 37 twice as much storage space as does ASCII encoding. Such charsets can contain … … 40 40 more then one byte to store one character. 41 41 </p><p> 42 <a class="indexterm" name="id266 2980"></a>42 <a class="indexterm" name="id2669971"></a> 43 43 One standardized multibyte charset encoding scheme is known as 44 44 <a class="ulink" href="http://www.unicode.org/" target="_top">unicode</a>. A big advantage of using a … … 46 46 computers use the same charset when they are communicating. 47 47 </p><p> 48 <a class="indexterm" name="id266 3000"></a>49 <a class="indexterm" name="id266 3007"></a>50 <a class="indexterm" name="id26 63014"></a>48 <a class="indexterm" name="id2669991"></a> 49 <a class="indexterm" name="id2669998"></a> 50 <a class="indexterm" name="id2670005"></a> 51 51 Old Windows clients use single-byte charsets, named 52 52 <em class="parameter"><code>codepages</code></em>, by Microsoft. However, there is no support for … … 54 54 have to make sure you are using the same charset when talking to an older client. 55 55 Newer clients (Windows NT, 200x, XP) talk Unicode over the wire. 56 </p></div><div class="sect1" lang="en"><div class="titlepage"><div><div><h2 class="title" style="clear: both"><a name="id26 63035"></a>Samba and Charsets</h2></div></div></div><p>57 <a class="indexterm" name="id26 63042"></a>58 <a class="indexterm" name="id26 63049"></a>56 </p></div><div class="sect1" lang="en"><div class="titlepage"><div><div><h2 class="title" style="clear: both"><a name="id2670030"></a>Samba and Charsets</h2></div></div></div><p> 57 <a class="indexterm" name="id2670038"></a> 58 <a class="indexterm" name="id2670045"></a> 59 59 As of Samba-3, Samba can (and will) talk Unicode over the wire. Internally, 60 60 Samba knows of three kinds of character sets: 61 </p><div class="variablelist"><dl><dt><span class="term"><a class="link" href="smb.conf.5.html#UNIXCHARSET" >unix charset</a></span></dt><dd><p>62 <a class="indexterm" name="id26 63082"></a>63 <a class="indexterm" name="id26 63088"></a>61 </p><div class="variablelist"><dl><dt><span class="term"><a class="link" href="smb.conf.5.html#UNIXCHARSET" target="_top">unix charset</a></span></dt><dd><p> 62 <a class="indexterm" name="id2670077"></a> 63 <a class="indexterm" name="id2670083"></a> 64 64 This is the charset used internally by your operating system. 65 65 The default is <code class="constant">UTF-8</code>, which is fine for most … … 67 67 in previous Samba releases was to save filenames in the encoding of the 68 68 clients for example, CP850 for Western European countries. 69 </p></dd><dt><span class="term"><a class="link" href="smb.conf.5.html#DISPLAYCHARSET" >display charset</a></span></dt><dd><p>This is the charset Samba uses to print messages69 </p></dd><dt><span class="term"><a class="link" href="smb.conf.5.html#DISPLAYCHARSET" target="_top">display charset</a></span></dt><dd><p>This is the charset Samba uses to print messages 70 70 on your screen. It should generally be the same as the <em class="parameter"><code>unix charset</code></em>. 71 </p></dd><dt><span class="term"><a class="link" href="smb.conf.5.html#DOSCHARSET" >dos charset</a></span></dt><dd><p>This is the charset Samba uses when communicating with71 </p></dd><dt><span class="term"><a class="link" href="smb.conf.5.html#DOSCHARSET" target="_top">dos charset</a></span></dt><dd><p>This is the charset Samba uses when communicating with 72 72 DOS and Windows 9x/Me clients. It will talk Unicode to all newer clients. 73 73 The default depends on the charsets you have installed on your system. 74 74 Run <code class="literal">testparm -v | grep "dos charset"</code> to see 75 75 what the default is on your system. 76 </p></dd></dl></div></div><div class="sect1" lang="en"><div class="titlepage"><div><div><h2 class="title" style="clear: both"><a name="id26 63171"></a>Conversion from Old Names</h2></div></div></div><p>77 <a class="indexterm" name="id26 63179"></a>76 </p></dd></dl></div></div><div class="sect1" lang="en"><div class="titlepage"><div><div><h2 class="title" style="clear: both"><a name="id2670166"></a>Conversion from Old Names</h2></div></div></div><p> 77 <a class="indexterm" name="id2670174"></a> 78 78 Because previous Samba versions did not do any charset conversion, 79 79 characters in filenames are usually not correct in the UNIX charset but only … … 81 81 </p><p>Bjoern Jacke has written a utility named <a class="ulink" href="http://j3e.de/linux/convmv/" target="_top">convmv</a> 82 82 that can convert whole directory structures to different charsets with one single command. 83 </p></div><div class="sect1" lang="en"><div class="titlepage"><div><div><h2 class="title" style="clear: both"><a name="id26 63202"></a>Japanese Charsets</h2></div></div></div><p>83 </p></div><div class="sect1" lang="en"><div class="titlepage"><div><div><h2 class="title" style="clear: both"><a name="id2670197"></a>Japanese Charsets</h2></div></div></div><p> 84 84 Setting up Japanese charsets is quite difficult. This is mainly because: 85 85 </p><div class="itemizedlist"><ul type="disc"><li><p> 86 <a class="indexterm" name="id26 63218"></a>86 <a class="indexterm" name="id2670213"></a> 87 87 The Windows character set is extended from the original legacy Japanese 88 88 standard (JIS X 0208) and is not standardized. This means that the strictly 89 89 standardized implementation cannot support the full Windows character set. 90 90 </p></li><li><p> 91 <a class="indexterm" name="id26 63233"></a>92 <a class="indexterm" name="id26 63240"></a>93 <a class="indexterm" name="id26 63247"></a>94 <a class="indexterm" name="id26 63253"></a>95 <a class="indexterm" name="id26 63260"></a>91 <a class="indexterm" name="id2670228"></a> 92 <a class="indexterm" name="id2670235"></a> 93 <a class="indexterm" name="id2670242"></a> 94 <a class="indexterm" name="id2670249"></a> 95 <a class="indexterm" name="id2670255"></a> 96 96 Mainly for historical reasons, there are several encoding methods in 97 97 Japanese, which are not fully compatible with each other. There are … … 113 113 the charset parameters depends on the implementation of iconv() you are using. 114 114 </p><p> 115 <a class="indexterm" name="id26 63309"></a>116 <a class="indexterm" name="id26 63316"></a>117 <a class="indexterm" name="id26 63323"></a>118 <a class="indexterm" name="id26 63330"></a>115 <a class="indexterm" name="id2670305"></a> 116 <a class="indexterm" name="id2670312"></a> 117 <a class="indexterm" name="id2670318"></a> 118 <a class="indexterm" name="id2670325"></a> 119 119 Though 2-byte fixed UCS-2 encoding is used in Windows internally, 120 120 Shift_JIS series encoding is usually used in Japanese environments 121 121 as ASCII encoding is in English environments. 122 </p></li></ul></div><div class="sect2" lang="en"><div class="titlepage"><div><div><h3 class="title"><a name="id26 63342"></a>Basic Parameter Setting</h3></div></div></div><p>123 <a class="indexterm" name="id26 63349"></a>124 The <a class="link" href="smb.conf.5.html#DOSCHARSET" >dos charset</a> and125 <a class="link" href="smb.conf.5.html#DISPLAYCHARSET" >display charset</a>122 </p></li></ul></div><div class="sect2" lang="en"><div class="titlepage"><div><div><h3 class="title"><a name="id2670337"></a>Basic Parameter Setting</h3></div></div></div><p> 123 <a class="indexterm" name="id2670344"></a> 124 The <a class="link" href="smb.conf.5.html#DOSCHARSET" target="_top">dos charset</a> and 125 <a class="link" href="smb.conf.5.html#DISPLAYCHARSET" target="_top">display charset</a> 126 126 should be set to the locale compatible with the character set 127 127 and encoding method used on Windows. This is usually CP932 128 128 but sometimes has a different name. 129 129 </p><p> 130 <a class="indexterm" name="id26 63385"></a>131 <a class="indexterm" name="id26 63392"></a>132 <a class="indexterm" name="id26 63398"></a>133 The <a class="link" href="smb.conf.5.html#UNIXCHARSET" >unix charset</a> can be either Shift_JIS series,130 <a class="indexterm" name="id2670381"></a> 131 <a class="indexterm" name="id2670387"></a> 132 <a class="indexterm" name="id2670394"></a> 133 The <a class="link" href="smb.conf.5.html#UNIXCHARSET" target="_top">unix charset</a> can be either Shift_JIS series, 134 134 EUC-JP series, or UTF-8. UTF-8 is always available, but the availability of other locales 135 135 and the name itself depends on the system. 136 136 </p><p> 137 137 Additionally, you can consider using the Shift_JIS series as the 138 value of the <a class="link" href="smb.conf.5.html#UNIXCHARSET" >unix charset</a>138 value of the <a class="link" href="smb.conf.5.html#UNIXCHARSET" target="_top">unix charset</a> 139 139 parameter by using the vfs_cap module, which does the same thing as 140 140 setting “<span class="quote">coding system = CAP</span>” in the Samba 2.2 series. 141 141 </p><p> 142 Where to set <a class="link" href="smb.conf.5.html#UNIXCHARSET" >unix charset</a>142 Where to set <a class="link" href="smb.conf.5.html#UNIXCHARSET" target="_top">unix charset</a> 143 143 to is a difficult question. Here is a list of details, advantages, and 144 144 disadvantages of using a certain value. … … 167 167 with Shift_JIS. 168 168 </p></dd><dt><span class="term">EUC-JP series</span></dt><dd><p> 169 <a class="indexterm" name="id26 63538"></a>170 <a class="indexterm" name="id26 63545"></a>169 <a class="indexterm" name="id2670528"></a> 170 <a class="indexterm" name="id2670535"></a> 171 171 EUC-JP series means a locale that is equivalent to the industry 172 172 standard called EUC-JP, widely used in Japanese UNIX (although EUC … … 177 177 “<span class="quote">.txt</span>” (an 8-byte BINARY string). 178 178 </p><p> 179 <a class="indexterm" name="id26 63569"></a>180 <a class="indexterm" name="id26 63576"></a>181 <a class="indexterm" name="id26 63583"></a>182 <a class="indexterm" name="id26 63590"></a>183 <a class="indexterm" name="id26 63597"></a>184 <a class="indexterm" name="id26 63603"></a>185 <a class="indexterm" name="id26 63610"></a>186 <a class="indexterm" name="id26 63617"></a>187 <a class="indexterm" name="id26 63624"></a>188 <a class="indexterm" name="id26 63630"></a>179 <a class="indexterm" name="id2670560"></a> 180 <a class="indexterm" name="id2670567"></a> 181 <a class="indexterm" name="id2670573"></a> 182 <a class="indexterm" name="id2670580"></a> 183 <a class="indexterm" name="id2670587"></a> 184 <a class="indexterm" name="id2670594"></a> 185 <a class="indexterm" name="id2670601"></a> 186 <a class="indexterm" name="id2670607"></a> 187 <a class="indexterm" name="id2670614"></a> 188 <a class="indexterm" name="id2670621"></a> 189 189 Since EUC-JP is usually used on open source UNIX, Linux, and FreeBSD, and on commercial-based UNIX, Solaris, 190 190 IRIX, and Tru64 UNIX as Japanese locale (however, it is also possible on Solaris to use Shift_JIS and UTF-8, … … 199 199 during parsing filenames. 200 200 </p><p> 201 <a class="indexterm" name="id26 63663"></a>201 <a class="indexterm" name="id2670654"></a> 202 202 Moreover, if you built Samba using differently installed libiconv, 203 203 the eucJP-ms locale included in libiconv and EUC-JP series locale … … 224 224 written from Windows on UNIX. 225 225 </p><p> 226 <a class="indexterm" name="id26 63744"></a>227 <a class="indexterm" name="id26 63751"></a>228 <a class="indexterm" name="id26 63757"></a>226 <a class="indexterm" name="id2670735"></a> 227 <a class="indexterm" name="id2670742"></a> 228 <a class="indexterm" name="id2670749"></a> 229 229 In addition, although it is not directly concerned with Samba, since 230 230 there is a delicate difference between the iconv() function, which is … … 234 234 of the limitations involved in the process. 235 235 </p><p> 236 <a class="indexterm" name="id26 63774"></a>236 <a class="indexterm" name="id2670766"></a> 237 237 Although Mac OS X uses UTF-8 as its encoding method for filenames, 238 238 it uses an extended UTF-8 specification that Samba cannot handle, so 239 239 UTF-8 locale is not available for Mac OS X. 240 240 </p></dd><dt><span class="term">Shift_JIS series + vfs_cap (CAP encoding)</span></dt><dd><p> 241 <a class="indexterm" name="id26 63796"></a>242 <a class="indexterm" name="id26 63803"></a>243 <a class="indexterm" name="id26 63809"></a>241 <a class="indexterm" name="id2670786"></a> 242 <a class="indexterm" name="id2670793"></a> 243 <a class="indexterm" name="id2670800"></a> 244 244 CAP encoding means a specification used in CAP and NetAtalk, file 245 245 server software for Macintosh. In the case of CAP encoding, for … … 270 270 To use CAP encoding on Samba-3, you should use the unix charset parameter and VFS 271 271 as in <a class="link" href="unicode.html#vfscap-intl" title="Example 30.1. VFS CAP">the VFS CAP smb.conf file</a>. 272 </p><div class="example"><a name="vfscap-intl"></a><p class="title"><b>Example 30.1. VFS CAP</b></p><div class="example-contents"><table class="simplelist" border="0" summary="Simple list"><tr><td> </td></tr><tr><td><em class="parameter"><code>[global]</code></em></td></tr><tr><td># the locale name "CP932" may be different</td></tr><tr><td><a class="indexterm" name="id26 63908"></a><em class="parameter"><code>dos charset = CP932</code></em></td></tr><tr><td><a class="indexterm" name="id2663920"></a><em class="parameter"><code>unix charset = CP932</code></em></td></tr><tr><td> </td></tr><tr><td><em class="parameter"><code>[cap-share]</code></em></td></tr><tr><td><a class="indexterm" name="id2663941"></a><em class="parameter"><code>vfs option = cap</code></em></td></tr></table></div></div><br class="example-break"><p>273 <a class="indexterm" name="id26 63956"></a>274 <a class="indexterm" name="id26 63962"></a>275 <a class="indexterm" name="id26 63969"></a>276 <a class="indexterm" name="id26 63976"></a>272 </p><div class="example"><a name="vfscap-intl"></a><p class="title"><b>Example 30.1. VFS CAP</b></p><div class="example-contents"><table class="simplelist" border="0" summary="Simple list"><tr><td> </td></tr><tr><td><em class="parameter"><code>[global]</code></em></td></tr><tr><td># the locale name "CP932" may be different</td></tr><tr><td><a class="indexterm" name="id2670899"></a><em class="parameter"><code>dos charset = CP932</code></em></td></tr><tr><td><a class="indexterm" name="id2670911"></a><em class="parameter"><code>unix charset = CP932</code></em></td></tr><tr><td> </td></tr><tr><td><em class="parameter"><code>[cap-share]</code></em></td></tr><tr><td><a class="indexterm" name="id2670932"></a><em class="parameter"><code>vfs option = cap</code></em></td></tr></table></div></div><br class="example-break"><p> 273 <a class="indexterm" name="id2670946"></a> 274 <a class="indexterm" name="id2670953"></a> 275 <a class="indexterm" name="id2670960"></a> 276 <a class="indexterm" name="id2670967"></a> 277 277 You should set CP932 if using GNU libiconv for unix charset. With this setting, 278 278 filenames in the “<span class="quote">cap-share</span>” share are written with CAP encoding. 279 </p></dd></dl></div></div><div class="sect2" lang="en"><div class="titlepage"><div><div><h3 class="title"><a name="id26 63994"></a>Individual Implementations</h3></div></div></div><p>279 </p></dd></dl></div></div><div class="sect2" lang="en"><div class="titlepage"><div><div><h3 class="title"><a name="id2670985"></a>Individual Implementations</h3></div></div></div><p> 280 280 Here is some additional information regarding individual implementations: 281 281 </p><div class="variablelist"><dl><dt><span class="term">GNU libiconv</span></dt><dd><p> … … 300 300 </p><p> 301 301 Using the above glibc, these setting are available: 302 </p><table class="simplelist" border="0" summary="Simple list"><tr><td><a class="indexterm" name="id26 64071"></a><em class="parameter"><code>dos charset = CP932</code></em></td></tr><tr><td><a class="indexterm" name="id2664082"></a><em class="parameter"><code>unix charset = CP932 / eucJP-ms / UTF-8</code></em></td></tr><tr><td><a class="indexterm" name="id2664095"></a><em class="parameter"><code>display charset = CP932</code></em></td></tr></table><p>302 </p><table class="simplelist" border="0" summary="Simple list"><tr><td><a class="indexterm" name="id2671062"></a><em class="parameter"><code>dos charset = CP932</code></em></td></tr><tr><td><a class="indexterm" name="id2671073"></a><em class="parameter"><code>unix charset = CP932 / eucJP-ms / UTF-8</code></em></td></tr><tr><td><a class="indexterm" name="id2671085"></a><em class="parameter"><code>display charset = CP932</code></em></td></tr></table><p> 303 303 </p><p> 304 304 Other Japanese locales (for example, Shift_JIS and EUC-JP) should not 305 305 be used because of the lack of the compatibility with Windows. 306 </p></dd></dl></div></div><div class="sect2" lang="en"><div class="titlepage"><div><div><h3 class="title"><a name="id26 64118"></a>Migration from Samba-2.2 Series</h3></div></div></div><p>306 </p></dd></dl></div></div><div class="sect2" lang="en"><div class="titlepage"><div><div><h3 class="title"><a name="id2671108"></a>Migration from Samba-2.2 Series</h3></div></div></div><p> 307 307 Prior to Samba-2.2 series, the “<span class="quote">coding system</span>” parameter was used. The default codepage in Samba 308 2.x was code page 850. In the Samba-3 series this has been replaced with the <a class="link" href="smb.conf.5.html#UNIXCHARSET" >unix charset</a> parameter. <a class="link" href="unicode.html#japancharsets" title="Table 30.1. Japanese Character Sets in Samba-2.2 and Samba-3">Japanese Character Sets in Samba-2.2 and Samba-3</a>308 2.x was code page 850. In the Samba-3 series this has been replaced with the <a class="link" href="smb.conf.5.html#UNIXCHARSET" target="_top">unix charset</a> parameter. <a class="link" href="unicode.html#japancharsets" title="Table 30.1. Japanese Character Sets in Samba-2.2 and Samba-3">Japanese Character Sets in Samba-2.2 and Samba-3</a> 309 309 shows the mapping table when migrating from the Samba-2.2 series to Samba-3. 310 </p><div class="table"><a name="japancharsets"></a><p class="title"><b>Table 30.1. Japanese Character Sets in Samba-2.2 and Samba-3</b></p><div class="table-contents"><table summary="Japanese Character Sets in Samba-2.2 and Samba-3" border="1"><colgroup><col align="center"><col align="center"></colgroup><thead><tr><th align="center">Samba-2.2 Coding System</th><th align="center">Samba-3 unix charset</th></tr></thead><tbody><tr><td align="center">SJIS</td><td align="center">Shift_JIS series</td></tr><tr><td align="center">EUC</td><td align="center">EUC-JP series</td></tr><tr><td align="center">EUC3<sup>[<a name="id26 64213" href="#ftn.id2664213" class="footnote">a</a>]</sup></td><td align="center">EUC-JP series</td></tr><tr><td align="center">CAP</td><td align="center">Shift_JIS series + VFS</td></tr><tr><td align="center">HEX</td><td align="center">currently none</td></tr><tr><td align="center">UTF8</td><td align="center">UTF-8</td></tr><tr><td align="center">UTF8-Mac<sup>[<a name="id2664244" href="#ftn.id2664244" class="footnote">b</a>]</sup></td><td align="center">currently none</td></tr><tr><td align="center">others</td><td align="center">none</td></tr></tbody><tbody class="footnotes"><tr><td colspan="2"><div class="footnote"><p><sup>[<a name="ftn.id2664213" href="#id2664213" class="para">a</a>] </sup>Only exists in Japanese Samba version</p></div><div class="footnote"><p><sup>[<a name="ftn.id2664244" href="#id2664244" class="para">b</a>] </sup>Only exists in Japanese Samba version</p></div></td></tr></tbody></table></div></div><br class="table-break"></div></div><div class="sect1" lang="en"><div class="titlepage"><div><div><h2 class="title" style="clear: both"><a name="id2664264"></a>Common Errors</h2></div></div></div><div class="sect2" lang="en"><div class="titlepage"><div><div><h3 class="title"><a name="id2664270"></a>CP850.so Can't Be Found</h3></div></div></div><p>“<span class="quote">Samba is complaining about a missing <code class="filename">CP850.so</code> file.</span>”</p><p>311 CP850 is the default <a class="link" href="smb.conf.5.html#DOSCHARSET" >dos charset</a>.312 The <a class="link" href="smb.conf.5.html#DOSCHARSET" >dos charset</a> is used to convert data to the codepage used by your DOS clients.310 </p><div class="table"><a name="japancharsets"></a><p class="title"><b>Table 30.1. Japanese Character Sets in Samba-2.2 and Samba-3</b></p><div class="table-contents"><table summary="Japanese Character Sets in Samba-2.2 and Samba-3" border="1"><colgroup><col align="center"><col align="center"></colgroup><thead><tr><th align="center">Samba-2.2 Coding System</th><th align="center">Samba-3 unix charset</th></tr></thead><tbody><tr><td align="center">SJIS</td><td align="center">Shift_JIS series</td></tr><tr><td align="center">EUC</td><td align="center">EUC-JP series</td></tr><tr><td align="center">EUC3<sup>[<a name="id2671204" href="#ftn.id2671204" class="footnote">a</a>]</sup></td><td align="center">EUC-JP series</td></tr><tr><td align="center">CAP</td><td align="center">Shift_JIS series + VFS</td></tr><tr><td align="center">HEX</td><td align="center">currently none</td></tr><tr><td align="center">UTF8</td><td align="center">UTF-8</td></tr><tr><td align="center">UTF8-Mac<sup>[<a name="id2671235" href="#ftn.id2671235" class="footnote">b</a>]</sup></td><td align="center">currently none</td></tr><tr><td align="center">others</td><td align="center">none</td></tr></tbody><tbody class="footnotes"><tr><td colspan="2"><div class="footnote"><p><sup>[<a name="ftn.id2671204" href="#id2671204" class="para">a</a>] </sup>Only exists in Japanese Samba version</p></div><div class="footnote"><p><sup>[<a name="ftn.id2671235" href="#id2671235" class="para">b</a>] </sup>Only exists in Japanese Samba version</p></div></td></tr></tbody></table></div></div><br class="table-break"></div></div><div class="sect1" lang="en"><div class="titlepage"><div><div><h2 class="title" style="clear: both"><a name="id2671255"></a>Common Errors</h2></div></div></div><div class="sect2" lang="en"><div class="titlepage"><div><div><h3 class="title"><a name="id2671261"></a>CP850.so Can't Be Found</h3></div></div></div><p>“<span class="quote">Samba is complaining about a missing <code class="filename">CP850.so</code> file.</span>”</p><p> 311 CP850 is the default <a class="link" href="smb.conf.5.html#DOSCHARSET" target="_top">dos charset</a>. 312 The <a class="link" href="smb.conf.5.html#DOSCHARSET" target="_top">dos charset</a> is used to convert data to the codepage used by your DOS clients. 313 313 If you do not have any DOS clients, you can safely ignore this message. </p><p> 314 314 CP850 should be supported by your local iconv implementation. Make sure you have all the required packages installed.
Note:
See TracChangeset
for help on using the changeset viewer.