1 | # From jose@monkey.org Thu Jun 5 11:48:35 2003
|
---|
2 | # Return-Path: <jose@monkey.org>
|
---|
3 | # Received: from localhost (skeeve [127.0.0.1])
|
---|
4 | # by skeeve.com (8.12.5/8.12.5) with ESMTP id h558eVvA012655
|
---|
5 | # for <arnold@localhost>; Thu, 5 Jun 2003 11:48:35 +0300
|
---|
6 | # Received: from actcom.co.il [192.114.47.1]
|
---|
7 | # by localhost with POP3 (fetchmail-5.9.0)
|
---|
8 | # for arnold@localhost (single-drop); Thu, 05 Jun 2003 11:48:35 +0300 (IDT)
|
---|
9 | # Received: by actcom.co.il (mbox arobbins)
|
---|
10 | # (with Cubic Circle's cucipop (v1.31 1998/05/13) Thu Jun 5 11:47:59 2003)
|
---|
11 | # X-From_: jose@monkey.org Thu Jun 5 07:14:45 2003
|
---|
12 | # Received: from smtp1.actcom.net.il by actcom.co.il with ESMTP
|
---|
13 | # (8.11.6/actcom-0.2) id h554EdY08108 for <arobbins@actcom.co.il>;
|
---|
14 | # Thu, 5 Jun 2003 07:14:41 +0300 (EET DST)
|
---|
15 | # (rfc931-sender: smtp.actcom.co.il [192.114.47.13])
|
---|
16 | # Received: from f7.net (consort.superb.net [209.61.216.22])
|
---|
17 | # by smtp1.actcom.net.il (8.12.8/8.12.8) with ESMTP id h554G3To008304
|
---|
18 | # for <arobbins@actcom.co.il>; Thu, 5 Jun 2003 07:16:05 +0300
|
---|
19 | # Received: from fencepost.gnu.org (fencepost.gnu.org [199.232.76.164])
|
---|
20 | # by f7.net (8.11.7/8.11.6) with ESMTP id h554Ean08172
|
---|
21 | # for <arnold@skeeve.com>; Thu, 5 Jun 2003 00:14:36 -0400
|
---|
22 | # Received: from monty-python.gnu.org ([199.232.76.173])
|
---|
23 | # by fencepost.gnu.org with esmtp (Exim 4.20)
|
---|
24 | # id 19Nm96-0001xE-1i
|
---|
25 | # for arnold@gnu.ai.mit.edu; Thu, 05 Jun 2003 00:14:36 -0400
|
---|
26 | # Received: from mail by monty-python.gnu.org with spam-scanned (Exim 4.20)
|
---|
27 | # id 19Nm8x-0005ge-Dz
|
---|
28 | # for arnold@gnu.ai.mit.edu; Thu, 05 Jun 2003 00:14:28 -0400
|
---|
29 | # Received: from naughty.monkey.org ([66.93.9.164])
|
---|
30 | # by monty-python.gnu.org with esmtp (Exim 4.20)
|
---|
31 | # id 19Nm8w-0005VM-Ko
|
---|
32 | # for arnold@gnu.ai.mit.edu; Thu, 05 Jun 2003 00:14:26 -0400
|
---|
33 | # Received: by naughty.monkey.org (Postfix, from userid 1203)
|
---|
34 | # id C15511BA97B; Thu, 5 Jun 2003 00:14:19 -0400 (EDT)
|
---|
35 | # Received: from localhost (localhost [127.0.0.1])
|
---|
36 | # by naughty.monkey.org (Postfix) with ESMTP
|
---|
37 | # id BF9821BA969; Thu, 5 Jun 2003 00:14:19 -0400 (EDT)
|
---|
38 | # Date: Thu, 5 Jun 2003 00:14:19 -0400 (EDT)
|
---|
39 | # From: Jose Nazario <jose@monkey.org>
|
---|
40 | # To: bug-gnu-utils@prep.ai.mit.edu, arnold@gnu.ai.mit.edu,
|
---|
41 | # netbsd-bugs@netbsd.org
|
---|
42 | # Subject: bug in gawk/gsub() (not present in nawk)
|
---|
43 | # Message-ID: <Pine.BSO.4.51.0306050007160.31577@naughty.monkey.org>
|
---|
44 | # MIME-Version: 1.0
|
---|
45 | # Content-Type: TEXT/PLAIN; charset=US-ASCII
|
---|
46 | # X-Spam-Status: No, hits=-1.2 required=5.0
|
---|
47 | # tests=SPAM_PHRASE_00_01,USER_AGENT_PINE
|
---|
48 | # version=2.41
|
---|
49 | # X-Spam-Level:
|
---|
50 | # X-SpamBouncer: 1.4 (10/07/01)
|
---|
51 | # X-SBClass: OK
|
---|
52 | # Status: R
|
---|
53 | #
|
---|
54 | # while playing with some tools in data massaging, i had to migrate from an
|
---|
55 | # openbsd/nawk system to a netbsd/gawk system. i found the folllowing
|
---|
56 | # behavior, which seems to be a bug.
|
---|
57 | #
|
---|
58 | # the following gsub() pattern has a strange effect under gawk which is not
|
---|
59 | # visible in nawk (at least as compiled on openbsd). the intention is to
|
---|
60 | # take a string like "This Is a Title: My Title?" and turn it into a
|
---|
61 | # normalized string: "ThisIsaTitleMyTitle". to do this, i wrote the
|
---|
62 | # following gross gsub line in an awk script:
|
---|
63 | #
|
---|
64 | # gsub(/[\ \"-\/\\:;\[\]\@\?\.\,\$]/, "", $2)
|
---|
65 | # print $2
|
---|
66 | #
|
---|
67 | # in gawk, as found in netbsd-macppc/1.5.2, this will drop the first letter
|
---|
68 | # of every word. the resulting string will be "hissitleyitle", while in nawk
|
---|
69 | # as built on openbsd-3.3 this will get it correct.
|
---|
70 | #
|
---|
71 | # any insights? the inconsistency with this relatively naive pattern seems a
|
---|
72 | # bit odd. (i would up installing nawk built from openbsd sources.)
|
---|
73 | #
|
---|
74 | # thanks. sorry i didn't send a better bug report, netbsd folks, i'm not
|
---|
75 | # much of a netbsd user, and i dont have send-pr set up. yes, this is a
|
---|
76 | # slightly older version of netbsd and gawk:
|
---|
77 | #
|
---|
78 | # $ uname -a
|
---|
79 | # NetBSD entropy 1.5.2 NetBSD 1.5.2 (GENERIC) #0: Sun Feb 10 02:00:04 EST
|
---|
80 | # 2002 jose@entropy:/usr/src/sys/arch/macppc/compile/GENERIC macppc
|
---|
81 | # $ awk --version
|
---|
82 | # GNU Awk 3.0.3
|
---|
83 | # Copyright (C) 1989, 1991-1997 Free Software Foundation.
|
---|
84 | #
|
---|
85 | #
|
---|
86 | #
|
---|
87 | # thanks.
|
---|
88 | #
|
---|
89 | # ___________________________
|
---|
90 | # jose nazario, ph.d. jose@monkey.org
|
---|
91 | # http://monkey.org/~jose/
|
---|
92 | #
|
---|
93 | #
|
---|
94 | {
|
---|
95 | gsub(/[\ \"-\/\\:;\[\]\@\?\.\,\$]/, "")
|
---|
96 | print
|
---|
97 | }
|
---|