- Timestamp:
- Sep 30, 2008, 7:14:08 AM (17 years ago)
- File:
-
- 1 edited
Legend:
- Unmodified
- Added
- Removed
-
branches/samba-3.0/docs/htmldocs/Samba3-HOWTO/unicode.html
r134 r158 1 <html><head><meta http-equiv="Content-Type" content="text/html; charset=ISO-8859-1"><title>Chapter 30. Unicode/Charsets</title><link rel="stylesheet" href="../samba.css" type="text/css"><meta name="generator" content="DocBook XSL Stylesheets V1.73.2"><link rel="start" href="index.html" title="The Official Samba 3.2.x HOWTO and Reference Guide"><link rel="up" href="optional.html" title="Part III. Advanced Configuration"><link rel="prev" href="integrate-ms-networks.html" title="Chapter 29. Integrating MS Windows Networks with Samba"><link rel="next" href="Backup.html" title="Chapter 31. Backup Techniques"></head><body bgcolor="white" text="black" link="#0000FF" vlink="#840084" alink="#0000FF"><div class="navheader"><table width="100%" summary="Navigation header"><tr><th colspan="3" align="center">Chapter 30. Unicode/Charsets</th></tr><tr><td width="20%" align="left"><a accesskey="p" href="integrate-ms-networks.html">Prev</a> </td><th width="60%" align="center">Part III. Advanced Configuration</th><td width="20%" align="right"> <a accesskey="n" href="Backup.html">Next</a></td></tr></table><hr></div><div class="chapter" lang="en"><div class="titlepage"><div><div><h2 class="title"><a name="unicode"></a>Chapter 30. Unicode/Charsets</h2></div><div><div class="author"><h3 class="author"><span class="firstname">Jelmer</span> <span class="othername">R.</span> <span class="surname">Vernooij</span></h3><div class="affiliation"><span class="orgname">The Samba Team<br></span><div class="address"><p><code class="email"><<a class="email" href="mailto:jelmer@samba.org">jelmer@samba.org</a>></code></p></div></div></div></div><div><div class="author"><h3 class="author"><span class="firstname">John</span> <span class="othername">H.</span> <span class="surname">Terpstra</span></h3><div class="affiliation"><span class="orgname">Samba Team<br></span><div class="address"><p><code class="email"><<a class="email" href="mailto:jht@samba.org">jht@samba.org</a>></code></p></div></div></div></div><div><div class="author"><h3 class="author"><span class="firstname">TAKAHASHI</span> <span class="surname">Motonobu</span></h3><span class="contrib">Japanese character support</span> <div class="affiliation"><div class="address"><p><code class="email"><<a class="email" href="mailto:monyo@home.monyo.com">monyo@home.monyo.com</a>></code></p></div></div></div></div><div><p class="pubdate">25 March 2003</p></div></div></div><div class="toc"><p><b>Table of Contents</b></p><dl><dt><span class="sect1"><a href="unicode.html#id4 50204">Features and Benefits</a></span></dt><dt><span class="sect1"><a href="unicode.html#id450250">What Are Charsets and Unicode?</a></span></dt><dt><span class="sect1"><a href="unicode.html#id450371">Samba and Charsets</a></span></dt><dt><span class="sect1"><a href="unicode.html#id450497">Conversion from Old Names</a></span></dt><dt><span class="sect1"><a href="unicode.html#id450526">Japanese Charsets</a></span></dt><dd><dl><dt><span class="sect2"><a href="unicode.html#id450646">Basic Parameter Setting</a></span></dt><dt><span class="sect2"><a href="unicode.html#id451226">Individual Implementations</a></span></dt><dt><span class="sect2"><a href="unicode.html#id451340">Migration from Samba-2.2 Series</a></span></dt></dl></dd><dt><span class="sect1"><a href="unicode.html#id451479">Common Errors</a></span></dt><dd><dl><dt><span class="sect2"><a href="unicode.html#id451484">CP850.so Can't Be Found</a></span></dt></dl></dd></dl></div><div class="sect1" lang="en"><div class="titlepage"><div><div><h2 class="title" style="clear: both"><a name="id450204"></a>Features and Benefits</h2></div></div></div><p>2 <a class="indexterm" name="id4 50212"></a>1 <html><head><meta http-equiv="Content-Type" content="text/html; charset=ISO-8859-1"><title>Chapter 30. Unicode/Charsets</title><link rel="stylesheet" href="../samba.css" type="text/css"><meta name="generator" content="DocBook XSL Stylesheets V1.73.2"><link rel="start" href="index.html" title="The Official Samba 3.2.x HOWTO and Reference Guide"><link rel="up" href="optional.html" title="Part III. Advanced Configuration"><link rel="prev" href="integrate-ms-networks.html" title="Chapter 29. Integrating MS Windows Networks with Samba"><link rel="next" href="Backup.html" title="Chapter 31. Backup Techniques"></head><body bgcolor="white" text="black" link="#0000FF" vlink="#840084" alink="#0000FF"><div class="navheader"><table width="100%" summary="Navigation header"><tr><th colspan="3" align="center">Chapter 30. Unicode/Charsets</th></tr><tr><td width="20%" align="left"><a accesskey="p" href="integrate-ms-networks.html">Prev</a> </td><th width="60%" align="center">Part III. Advanced Configuration</th><td width="20%" align="right"> <a accesskey="n" href="Backup.html">Next</a></td></tr></table><hr></div><div class="chapter" lang="en"><div class="titlepage"><div><div><h2 class="title"><a name="unicode"></a>Chapter 30. Unicode/Charsets</h2></div><div><div class="author"><h3 class="author"><span class="firstname">Jelmer</span> <span class="othername">R.</span> <span class="surname">Vernooij</span></h3><div class="affiliation"><span class="orgname">The Samba Team<br></span><div class="address"><p><code class="email"><<a class="email" href="mailto:jelmer@samba.org">jelmer@samba.org</a>></code></p></div></div></div></div><div><div class="author"><h3 class="author"><span class="firstname">John</span> <span class="othername">H.</span> <span class="surname">Terpstra</span></h3><div class="affiliation"><span class="orgname">Samba Team<br></span><div class="address"><p><code class="email"><<a class="email" href="mailto:jht@samba.org">jht@samba.org</a>></code></p></div></div></div></div><div><div class="author"><h3 class="author"><span class="firstname">TAKAHASHI</span> <span class="surname">Motonobu</span></h3><span class="contrib">Japanese character support</span> <div class="affiliation"><div class="address"><p><code class="email"><<a class="email" href="mailto:monyo@home.monyo.com">monyo@home.monyo.com</a>></code></p></div></div></div></div><div><p class="pubdate">25 March 2003</p></div></div></div><div class="toc"><p><b>Table of Contents</b></p><dl><dt><span class="sect1"><a href="unicode.html#id424458">Features and Benefits</a></span></dt><dt><span class="sect1"><a href="unicode.html#id424503">What Are Charsets and Unicode?</a></span></dt><dt><span class="sect1"><a href="unicode.html#id424622">Samba and Charsets</a></span></dt><dt><span class="sect1"><a href="unicode.html#id424748">Conversion from Old Names</a></span></dt><dt><span class="sect1"><a href="unicode.html#id424778">Japanese Charsets</a></span></dt><dd><dl><dt><span class="sect2"><a href="unicode.html#id424898">Basic Parameter Setting</a></span></dt><dt><span class="sect2"><a href="unicode.html#id425481">Individual Implementations</a></span></dt><dt><span class="sect2"><a href="unicode.html#id425594">Migration from Samba-2.2 Series</a></span></dt></dl></dd><dt><span class="sect1"><a href="unicode.html#id425733">Common Errors</a></span></dt><dd><dl><dt><span class="sect2"><a href="unicode.html#id425739">CP850.so Can't Be Found</a></span></dt></dl></dd></dl></div><div class="sect1" lang="en"><div class="titlepage"><div><div><h2 class="title" style="clear: both"><a name="id424458"></a>Features and Benefits</h2></div></div></div><p> 2 <a class="indexterm" name="id424466"></a> 3 3 Every industry eventually matures. One of the great areas of maturation is in 4 4 the focus that has been given over the past decade to make it possible for anyone … … 12 12 is deserving of special mention. 13 13 </p><p> 14 <a class="indexterm" name="id4 50236"></a>14 <a class="indexterm" name="id424489"></a> 15 15 Samba-2.x supported a single locale through a mechanism called 16 16 <span class="emphasis"><em>codepages</em></span>. Samba-3 is destined to become a truly transglobal 17 17 file- and printer-sharing platform. 18 </p></div><div class="sect1" lang="en"><div class="titlepage"><div><div><h2 class="title" style="clear: both"><a name="id4 50250"></a>What Are Charsets and Unicode?</h2></div></div></div><p>19 <a class="indexterm" name="id4 50257"></a>18 </p></div><div class="sect1" lang="en"><div class="titlepage"><div><div><h2 class="title" style="clear: both"><a name="id424503"></a>What Are Charsets and Unicode?</h2></div></div></div><p> 19 <a class="indexterm" name="id424511"></a> 20 20 Computers communicate in numbers. In texts, each number is 21 21 translated to a corresponding letter. The meaning that will be assigned … … 23 23 </em></span> that is used. 24 24 </p><p> 25 <a class="indexterm" name="id4 50273"></a>26 <a class="indexterm" name="id4 50280"></a>25 <a class="indexterm" name="id424527"></a> 26 <a class="indexterm" name="id424534"></a> 27 27 A charset can be seen as a table that is used to translate numbers to 28 28 letters. Not all computers use the same charset (there are charsets … … 32 32 256 characters. Using this mode of encoding, each character takes exactly one byte. 33 33 </p><p> 34 <a class="indexterm" name="id4 50294"></a>35 <a class="indexterm" name="id4 50301"></a>34 <a class="indexterm" name="id424548"></a> 35 <a class="indexterm" name="id424554"></a> 36 36 There are also charsets that support extended characters, but those need at least 37 37 twice as much storage space as does ASCII encoding. Such charsets can contain … … 40 40 more then one byte to store one character. 41 41 </p><p> 42 <a class="indexterm" name="id4 50319"></a>42 <a class="indexterm" name="id424573"></a> 43 43 One standardized multibyte charset encoding scheme is known as 44 44 <a class="ulink" href="http://www.unicode.org/" target="_top">unicode</a>. A big advantage of using a … … 46 46 computers use the same charset when they are communicating. 47 47 </p><p> 48 <a class="indexterm" name="id4 50337"></a>49 <a class="indexterm" name="id4 50344"></a>50 <a class="indexterm" name="id4 50351"></a>48 <a class="indexterm" name="id424591"></a> 49 <a class="indexterm" name="id424598"></a> 50 <a class="indexterm" name="id424605"></a> 51 51 Old Windows clients use single-byte charsets, named 52 52 <em class="parameter"><code>codepages</code></em>, by Microsoft. However, there is no support for … … 54 54 have to make sure you are using the same charset when talking to an older client. 55 55 Newer clients (Windows NT, 200x, XP) talk Unicode over the wire. 56 </p></div><div class="sect1" lang="en"><div class="titlepage"><div><div><h2 class="title" style="clear: both"><a name="id4 50371"></a>Samba and Charsets</h2></div></div></div><p>57 <a class="indexterm" name="id4 50379"></a>58 <a class="indexterm" name="id4 50385"></a>56 </p></div><div class="sect1" lang="en"><div class="titlepage"><div><div><h2 class="title" style="clear: both"><a name="id424622"></a>Samba and Charsets</h2></div></div></div><p> 57 <a class="indexterm" name="id424630"></a> 58 <a class="indexterm" name="id424637"></a> 59 59 As of Samba-3, Samba can (and will) talk Unicode over the wire. Internally, 60 60 Samba knows of three kinds of character sets: 61 61 </p><div class="variablelist"><dl><dt><span class="term"><a class="link" href="smb.conf.5.html#UNIXCHARSET">unix charset</a></span></dt><dd><p> 62 <a class="indexterm" name="id4 50415"></a>63 <a class="indexterm" name="id4 50421"></a>62 <a class="indexterm" name="id424666"></a> 63 <a class="indexterm" name="id424673"></a> 64 64 This is the charset used internally by your operating system. 65 65 The default is <code class="constant">UTF-8</code>, which is fine for most … … 74 74 Run <code class="literal">testparm -v | grep "dos charset"</code> to see 75 75 what the default is on your system. 76 </p></dd></dl></div></div><div class="sect1" lang="en"><div class="titlepage"><div><div><h2 class="title" style="clear: both"><a name="id4 50497"></a>Conversion from Old Names</h2></div></div></div><p>77 <a class="indexterm" name="id4 50505"></a>76 </p></dd></dl></div></div><div class="sect1" lang="en"><div class="titlepage"><div><div><h2 class="title" style="clear: both"><a name="id424748"></a>Conversion from Old Names</h2></div></div></div><p> 77 <a class="indexterm" name="id424756"></a> 78 78 Because previous Samba versions did not do any charset conversion, 79 79 characters in filenames are usually not correct in the UNIX charset but only … … 81 81 </p><p>Bjoern Jacke has written a utility named <a class="ulink" href="http://j3e.de/linux/convmv/" target="_top">convmv</a> 82 82 that can convert whole directory structures to different charsets with one single command. 83 </p></div><div class="sect1" lang="en"><div class="titlepage"><div><div><h2 class="title" style="clear: both"><a name="id4 50526"></a>Japanese Charsets</h2></div></div></div><p>83 </p></div><div class="sect1" lang="en"><div class="titlepage"><div><div><h2 class="title" style="clear: both"><a name="id424778"></a>Japanese Charsets</h2></div></div></div><p> 84 84 Setting up Japanese charsets is quite difficult. This is mainly because: 85 85 </p><div class="itemizedlist"><ul type="disc"><li><p> 86 <a class="indexterm" name="id4 50541"></a>86 <a class="indexterm" name="id424793"></a> 87 87 The Windows character set is extended from the original legacy Japanese 88 88 standard (JIS X 0208) and is not standardized. This means that the strictly 89 89 standardized implementation cannot support the full Windows character set. 90 90 </p></li><li><p> 91 <a class="indexterm" name="id4 50554"></a>92 <a class="indexterm" name="id4 50561"></a>93 <a class="indexterm" name="id4 50568"></a>94 <a class="indexterm" name="id4 50574"></a>95 <a class="indexterm" name="id4 50581"></a>91 <a class="indexterm" name="id424806"></a> 92 <a class="indexterm" name="id424812"></a> 93 <a class="indexterm" name="id424819"></a> 94 <a class="indexterm" name="id424826"></a> 95 <a class="indexterm" name="id424832"></a> 96 96 Mainly for historical reasons, there are several encoding methods in 97 97 Japanese, which are not fully compatible with each other. There are … … 113 113 the charset parameters depends on the implementation of iconv() you are using. 114 114 </p><p> 115 <a class="indexterm" name="id4 50615"></a>116 <a class="indexterm" name="id4 50622"></a>117 <a class="indexterm" name="id4 50629"></a>118 <a class="indexterm" name="id4 50635"></a>115 <a class="indexterm" name="id424867"></a> 116 <a class="indexterm" name="id424873"></a> 117 <a class="indexterm" name="id424880"></a> 118 <a class="indexterm" name="id424887"></a> 119 119 Though 2-byte fixed UCS-2 encoding is used in Windows internally, 120 120 Shift_JIS series encoding is usually used in Japanese environments 121 121 as ASCII encoding is in English environments. 122 </p></li></ul></div><div class="sect2" lang="en"><div class="titlepage"><div><div><h3 class="title"><a name="id4 50646"></a>Basic Parameter Setting</h3></div></div></div><p>123 <a class="indexterm" name="id4 50653"></a>122 </p></li></ul></div><div class="sect2" lang="en"><div class="titlepage"><div><div><h3 class="title"><a name="id424898"></a>Basic Parameter Setting</h3></div></div></div><p> 123 <a class="indexterm" name="id424904"></a> 124 124 The <a class="link" href="smb.conf.5.html#DOSCHARSET">dos charset</a> and 125 125 <a class="link" href="smb.conf.5.html#DISPLAYCHARSET">display charset</a> … … 128 128 but sometimes has a different name. 129 129 </p><p> 130 <a class="indexterm" name="id4 50686"></a>131 <a class="indexterm" name="id4 50692"></a>132 <a class="indexterm" name="id4 50699"></a>130 <a class="indexterm" name="id424937"></a> 131 <a class="indexterm" name="id424944"></a> 132 <a class="indexterm" name="id424950"></a> 133 133 The <a class="link" href="smb.conf.5.html#UNIXCHARSET">unix charset</a> can be either Shift_JIS series, 134 134 EUC-JP series, or UTF-8. UTF-8 is always available, but the availability of other locales … … 167 167 with Shift_JIS. 168 168 </p></dd><dt><span class="term">EUC-JP series</span></dt><dd><p> 169 <a class="indexterm" name="id4 50816"></a>170 <a class="indexterm" name="id4 50823"></a>169 <a class="indexterm" name="id425070"></a> 170 <a class="indexterm" name="id425077"></a> 171 171 EUC-JP series means a locale that is equivalent to the industry 172 172 standard called EUC-JP, widely used in Japanese UNIX (although EUC … … 177 177 “<span class="quote">.txt</span>” (an 8-byte BINARY string). 178 178 </p><p> 179 <a class="indexterm" name="id4 50844"></a>180 <a class="indexterm" name="id4 50851"></a>181 <a class="indexterm" name="id4 50858"></a>182 <a class="indexterm" name="id4 50864"></a>183 <a class="indexterm" name="id4 50871"></a>184 <a class="indexterm" name="id4 50878"></a>185 <a class="indexterm" name="id4 50885"></a>186 <a class="indexterm" name="id4 50892"></a>187 <a class="indexterm" name="id4 50898"></a>188 <a class="indexterm" name="id4 50905"></a>179 <a class="indexterm" name="id425098"></a> 180 <a class="indexterm" name="id425105"></a> 181 <a class="indexterm" name="id425112"></a> 182 <a class="indexterm" name="id425119"></a> 183 <a class="indexterm" name="id425125"></a> 184 <a class="indexterm" name="id425132"></a> 185 <a class="indexterm" name="id425139"></a> 186 <a class="indexterm" name="id425146"></a> 187 <a class="indexterm" name="id425153"></a> 188 <a class="indexterm" name="id425159"></a> 189 189 Since EUC-JP is usually used on open source UNIX, Linux, and FreeBSD, and on commercial-based UNIX, Solaris, 190 190 IRIX, and Tru64 UNIX as Japanese locale (however, it is also possible on Solaris to use Shift_JIS and UTF-8, … … 199 199 during parsing filenames. 200 200 </p><p> 201 <a class="indexterm" name="id4 50932"></a>201 <a class="indexterm" name="id425186"></a> 202 202 Moreover, if you built Samba using differently installed libiconv, 203 203 the eucJP-ms locale included in libiconv and EUC-JP series locale … … 224 224 written from Windows on UNIX. 225 225 </p><p> 226 <a class="indexterm" name="id4 50996"></a>227 <a class="indexterm" name="id4 51003"></a>228 <a class="indexterm" name="id4 51010"></a>226 <a class="indexterm" name="id425250"></a> 227 <a class="indexterm" name="id425257"></a> 228 <a class="indexterm" name="id425264"></a> 229 229 In addition, although it is not directly concerned with Samba, since 230 230 there is a delicate difference between the iconv() function, which is … … 234 234 of the limitations involved in the process. 235 235 </p><p> 236 <a class="indexterm" name="id4 51024"></a>236 <a class="indexterm" name="id425278"></a> 237 237 Although Mac OS X uses UTF-8 as its encoding method for filenames, 238 238 it uses an extended UTF-8 specification that Samba cannot handle, so 239 239 UTF-8 locale is not available for Mac OS X. 240 240 </p></dd><dt><span class="term">Shift_JIS series + vfs_cap (CAP encoding)</span></dt><dd><p> 241 <a class="indexterm" name="id4 51043"></a>242 <a class="indexterm" name="id4 51050"></a>243 <a class="indexterm" name="id4 51057"></a>241 <a class="indexterm" name="id425297"></a> 242 <a class="indexterm" name="id425304"></a> 243 <a class="indexterm" name="id425311"></a> 244 244 CAP encoding means a specification used in CAP and NetAtalk, file 245 245 server software for Macintosh. In the case of CAP encoding, for … … 270 270 To use CAP encoding on Samba-3, you should use the unix charset parameter and VFS 271 271 as in <a class="link" href="unicode.html#vfscap-intl" title="Example 30.1. VFS CAP">the VFS CAP smb.conf file</a>. 272 </p><div class="example"><a name="vfscap-intl"></a><p class="title"><b>Example 30.1. VFS CAP</b></p><div class="example-contents"><table class="simplelist" border="0" summary="Simple list"><tr><td> </td></tr><tr><td><em class="parameter"><code>[global]</code></em></td></tr><tr><td># the locale name "CP932" may be different</td></tr><tr><td><a class="indexterm" name="id4 51142"></a><em class="parameter"><code>dos charset = CP932</code></em></td></tr><tr><td><a class="indexterm" name="id451154"></a><em class="parameter"><code>unix charset = CP932</code></em></td></tr><tr><td> </td></tr><tr><td><em class="parameter"><code>[cap-share]</code></em></td></tr><tr><td><a class="indexterm" name="id451174"></a><em class="parameter"><code>vfs option = cap</code></em></td></tr></table></div></div><br class="example-break"><p>273 <a class="indexterm" name="id4 51189"></a>274 <a class="indexterm" name="id4 51196"></a>275 <a class="indexterm" name="id4 51202"></a>276 <a class="indexterm" name="id4 51209"></a>272 </p><div class="example"><a name="vfscap-intl"></a><p class="title"><b>Example 30.1. VFS CAP</b></p><div class="example-contents"><table class="simplelist" border="0" summary="Simple list"><tr><td> </td></tr><tr><td><em class="parameter"><code>[global]</code></em></td></tr><tr><td># the locale name "CP932" may be different</td></tr><tr><td><a class="indexterm" name="id425396"></a><em class="parameter"><code>dos charset = CP932</code></em></td></tr><tr><td><a class="indexterm" name="id425408"></a><em class="parameter"><code>unix charset = CP932</code></em></td></tr><tr><td> </td></tr><tr><td><em class="parameter"><code>[cap-share]</code></em></td></tr><tr><td><a class="indexterm" name="id425428"></a><em class="parameter"><code>vfs option = cap</code></em></td></tr></table></div></div><br class="example-break"><p> 273 <a class="indexterm" name="id425443"></a> 274 <a class="indexterm" name="id425450"></a> 275 <a class="indexterm" name="id425457"></a> 276 <a class="indexterm" name="id425463"></a> 277 277 You should set CP932 if using GNU libiconv for unix charset. With this setting, 278 278 filenames in the “<span class="quote">cap-share</span>” share are written with CAP encoding. 279 </p></dd></dl></div></div><div class="sect2" lang="en"><div class="titlepage"><div><div><h3 class="title"><a name="id4 51226"></a>Individual Implementations</h3></div></div></div><p>279 </p></dd></dl></div></div><div class="sect2" lang="en"><div class="titlepage"><div><div><h3 class="title"><a name="id425481"></a>Individual Implementations</h3></div></div></div><p> 280 280 Here is some additional information regarding individual implementations: 281 281 </p><div class="variablelist"><dl><dt><span class="term">GNU libiconv</span></dt><dd><p> … … 300 300 </p><p> 301 301 Using the above glibc, these setting are available: 302 </p><table class="simplelist" border="0" summary="Simple list"><tr><td><a class="indexterm" name="id4 51295"></a><em class="parameter"><code>dos charset = CP932</code></em></td></tr><tr><td><a class="indexterm" name="id451307"></a><em class="parameter"><code>unix charset = CP932 / eucJP-ms / UTF-8</code></em></td></tr><tr><td><a class="indexterm" name="id451318"></a><em class="parameter"><code>display charset = CP932</code></em></td></tr></table><p>302 </p><table class="simplelist" border="0" summary="Simple list"><tr><td><a class="indexterm" name="id425549"></a><em class="parameter"><code>dos charset = CP932</code></em></td></tr><tr><td><a class="indexterm" name="id425561"></a><em class="parameter"><code>unix charset = CP932 / eucJP-ms / UTF-8</code></em></td></tr><tr><td><a class="indexterm" name="id425572"></a><em class="parameter"><code>display charset = CP932</code></em></td></tr></table><p> 303 303 </p><p> 304 304 Other Japanese locales (for example, Shift_JIS and EUC-JP) should not 305 305 be used because of the lack of the compatibility with Windows. 306 </p></dd></dl></div></div><div class="sect2" lang="en"><div class="titlepage"><div><div><h3 class="title"><a name="id4 51340"></a>Migration from Samba-2.2 Series</h3></div></div></div><p>306 </p></dd></dl></div></div><div class="sect2" lang="en"><div class="titlepage"><div><div><h3 class="title"><a name="id425594"></a>Migration from Samba-2.2 Series</h3></div></div></div><p> 307 307 Prior to Samba-2.2 series, the “<span class="quote">coding system</span>” parameter was used. The default codepage in Samba 308 308 2.x was code page 850. In the Samba-3 series this has been replaced with the <a class="link" href="smb.conf.5.html#UNIXCHARSET">unix charset</a> parameter. <a class="link" href="unicode.html#japancharsets" title="Table 30.1. Japanese Character Sets in Samba-2.2 and Samba-3">Japanese Character Sets in Samba-2.2 and Samba-3</a> 309 309 shows the mapping table when migrating from the Samba-2.2 series to Samba-3. 310 </p><div class="table"><a name="japancharsets"></a><p class="title"><b>Table 30.1. Japanese Character Sets in Samba-2.2 and Samba-3</b></p><div class="table-contents"><table summary="Japanese Character Sets in Samba-2.2 and Samba-3" border="1"><colgroup><col align="center"><col align="center"></colgroup><thead><tr><th align="center">Samba-2.2 Coding System</th><th align="center">Samba-3 unix charset</th></tr></thead><tbody><tr><td align="center">SJIS</td><td align="center">Shift_JIS series</td></tr><tr><td align="center">EUC</td><td align="center">EUC-JP series</td></tr><tr><td align="center">EUC3<sup>[<a name="id4 51429" href="#ftn.id451429" class="footnote">a</a>]</sup></td><td align="center">EUC-JP series</td></tr><tr><td align="center">CAP</td><td align="center">Shift_JIS series + VFS</td></tr><tr><td align="center">HEX</td><td align="center">currently none</td></tr><tr><td align="center">UTF8</td><td align="center">UTF-8</td></tr><tr><td align="center">UTF8-Mac<sup>[<a name="id451460" href="#ftn.id451460" class="footnote">b</a>]</sup></td><td align="center">currently none</td></tr><tr><td align="center">others</td><td align="center">none</td></tr></tbody><tbody class="footnotes"><tr><td colspan="2"><div class="footnote"><p><sup>[<a name="ftn.id451429" href="#id451429" class="para">a</a>] </sup>Only exists in Japanese Samba version</p></div><div class="footnote"><p><sup>[<a name="ftn.id451460" href="#id451460" class="para">b</a>] </sup>Only exists in Japanese Samba version</p></div></td></tr></tbody></table></div></div><br class="table-break"></div></div><div class="sect1" lang="en"><div class="titlepage"><div><div><h2 class="title" style="clear: both"><a name="id451479"></a>Common Errors</h2></div></div></div><div class="sect2" lang="en"><div class="titlepage"><div><div><h3 class="title"><a name="id451484"></a>CP850.so Can't Be Found</h3></div></div></div><p>“<span class="quote">Samba is complaining about a missing <code class="filename">CP850.so</code> file.</span>”</p><p>310 </p><div class="table"><a name="japancharsets"></a><p class="title"><b>Table 30.1. Japanese Character Sets in Samba-2.2 and Samba-3</b></p><div class="table-contents"><table summary="Japanese Character Sets in Samba-2.2 and Samba-3" border="1"><colgroup><col align="center"><col align="center"></colgroup><thead><tr><th align="center">Samba-2.2 Coding System</th><th align="center">Samba-3 unix charset</th></tr></thead><tbody><tr><td align="center">SJIS</td><td align="center">Shift_JIS series</td></tr><tr><td align="center">EUC</td><td align="center">EUC-JP series</td></tr><tr><td align="center">EUC3<sup>[<a name="id425683" href="#ftn.id425683" class="footnote">a</a>]</sup></td><td align="center">EUC-JP series</td></tr><tr><td align="center">CAP</td><td align="center">Shift_JIS series + VFS</td></tr><tr><td align="center">HEX</td><td align="center">currently none</td></tr><tr><td align="center">UTF8</td><td align="center">UTF-8</td></tr><tr><td align="center">UTF8-Mac<sup>[<a name="id425714" href="#ftn.id425714" class="footnote">b</a>]</sup></td><td align="center">currently none</td></tr><tr><td align="center">others</td><td align="center">none</td></tr></tbody><tbody class="footnotes"><tr><td colspan="2"><div class="footnote"><p><sup>[<a name="ftn.id425683" href="#id425683" class="para">a</a>] </sup>Only exists in Japanese Samba version</p></div><div class="footnote"><p><sup>[<a name="ftn.id425714" href="#id425714" class="para">b</a>] </sup>Only exists in Japanese Samba version</p></div></td></tr></tbody></table></div></div><br class="table-break"></div></div><div class="sect1" lang="en"><div class="titlepage"><div><div><h2 class="title" style="clear: both"><a name="id425733"></a>Common Errors</h2></div></div></div><div class="sect2" lang="en"><div class="titlepage"><div><div><h3 class="title"><a name="id425739"></a>CP850.so Can't Be Found</h3></div></div></div><p>“<span class="quote">Samba is complaining about a missing <code class="filename">CP850.so</code> file.</span>”</p><p> 311 311 CP850 is the default <a class="link" href="smb.conf.5.html#DOSCHARSET">dos charset</a>. 312 312 The <a class="link" href="smb.conf.5.html#DOSCHARSET">dos charset</a> is used to convert data to the codepage used by your DOS clients.
Note:
See TracChangeset
for help on using the changeset viewer.