[#33161] Call/CC and Ruby iterators. — olczyk@... (Thaddeus L Olczyk)

Reading about call/cc in Scheme I get the impression that it is very

11 messages 2002/02/05

[#33242] favicon.ico — Dave Thomas <Dave@...>

19 messages 2002/02/06
[#33256] Re: favicon.ico — Leon Torres <leon@...> 2002/02/06

[#33435] Reg: tiny contest: who's faster? (add_a_gram) — grady@... (Steven Grady)

> My current solution works correctly with various inputs.

17 messages 2002/02/08

[#33500] Ruby Embedded Documentation — William Djaja Tjokroaminata <billtj@...>

Hi,

24 messages 2002/02/10
[#33502] Re: Ruby Embedded Documentation — "Lyle Johnson" <ljohnson@...> 2002/02/10

> Now, I am using Ruby on Linux, and I have downloaded Ruby version

[#33615] Name resolution in Ruby — stern@... (Alan Stern)

I've been struggling to understand how name resolution is supposed to

16 messages 2002/02/11

[#33617] choice of HTML templating system — Paul Brannan <paul@...>

I am not a web developer, nor do I pretend to be one.

23 messages 2002/02/11

[#33619] make first letter lowercase — sebi@... (sebi)

hello,

20 messages 2002/02/11
[#33620] Re: [newbie] make first letter lowercase — Tobias Reif <tobiasreif@...> 2002/02/11

sebi wrote:

[#33624] Re: [newbie] make first letter lowercase — "Jeff 'japhy' Pinyan" <jeffp@...> 2002/02/11

On Feb 11, Tobias Reif said:

[#33632] Re: [newbie] make first letter lowercase — Mathieu Bouchard <matju@...> 2002/02/12

[#33731] simple XML parsing (greedy / non-greedy — Ron Jeffries <ronjeffries@...>

Suppose I had this text

14 messages 2002/02/13

[#33743] qualms about respond_to? idiom — David Alan Black <dblack@...>

Hi --

28 messages 2002/02/13
[#33751] Re: qualms about respond_to? idiom — Dave Thomas <Dave@...> 2002/02/13

David Alan Black <dblack@candle.superlink.net> writes:

[#33754] Re: qualms about respond_to? idiom — David Alan Black <dblack@...> 2002/02/13

Hi --

[#33848] "Powered by Ruby" banner — Yuri Leikind <YuriLeikind@...>

Hello Ruby folks,

78 messages 2002/02/14
[#33909] Re: "Powered by Ruby" banner — Leon Torres <leon@...> 2002/02/14

On Thu, 14 Feb 2002, Yuri Leikind wrote:

[#33916] RE: "Powered by Ruby" banner — "Jack Dempsey" <dempsejn@...> 2002/02/15

A modest submission:

[#33929] Re: "Powered by Ruby" banner — yet another bill smith <bigbill.smith@...> 2002/02/15

Kent Dahl wrote:

[#33932] OT Netscape 4.x? was Re: "Powered by Ruby" banner — Chris Gehlker <gehlker@...> 2002/02/15

On 2/15/02 5:54 AM, "yet another bill smith" <bigbill.smith@verizon.net>

[#33933] RE: OT Netscape 4.x? was Re: "Powered by Ruby" banner — "Jack Dempsey" <dempsejn@...> 2002/02/15

i just don't understand why it didn't show up! dhtml/javascript, ok, but a

[#33937] Re: OT Netscape 4.x? was Re: "Powered by Ruby" banner — Chris Gehlker <gehlker@...> 2002/02/15

On 2/15/02 7:16 AM, "Jack Dempsey" <dempsejn@georgetown.edu> wrote:

[#33989] Re: OT OmniWeb [was: Netscape 4.x?] — Sean Russell <ser@...> 2002/02/16

Chris Gehlker wrote:

[#33991] Re: OT OmniWeb [was: Netscape 4.x?] — Rob Partington <rjp@...> 2002/02/16

In message <3c6e5e01_1@spamkiller.newsgroups.com>,

[#33993] Re: OT OmniWeb [was: Netscape 4.x?] — Thomas Hurst <tom.hurst@...> 2002/02/16

* Rob Partington (rjp@browser.org) wrote:

[#33925] Re: "Powered by Ruby" banner — Martin Maciaszek <mmaciaszek@...> 2002/02/15

In article <3C6CFCCA.5AD5CA67@scnsoft.com>, Yuri Leikind wrote:

[#33956] Re: "Powered by Ruby" banner — Leon Torres <leon@...> 2002/02/15

On Fri, 15 Feb 2002, Martin Maciaszek wrote:

[#33851] Ruby and .NET — Patrik Sundberg <ps@...>

I have been reading a bit about .NET for the last couple of days and must say

53 messages 2002/02/14

[#34024] Compiled companion language for Ruby? — Erik Terpstra <erik@...>

Hmmm, seems that my previous post was in a different thread, I'll try

12 messages 2002/02/16

[#34036] The GUI Returns — "Horacio Lopez" <vruz@...>

Hello all,

33 messages 2002/02/17

[#34162] Epic4/Ruby — Thomas Hurst <tom.hurst@...>

Rejoice, for you no longer have to put up with that evil excuse for a

34 messages 2002/02/18

[#34185] Operator overloading and multiple arguments — ptkwt@...1.aracnet.com (Phil Tomson)

I'm trying to overload the '<=' operator in a class in order to use it for

10 messages 2002/02/18

[#34217] Ruby for web development — beripome@... (Billy)

Hi all,

21 messages 2002/02/19

[#34350] FAQ for comp.lang.ruby — "Hal E. Fulton" <hal9000@...>

RUBY NEWSGROUP FAQ -- Welcome to comp.lang.ruby! (Revised 2001-2-18)

15 messages 2002/02/20

[#34375] Setting the Ruby continued — <jostein.berntsen@...>

Hi,

24 messages 2002/02/20
[#34384] Re: Setting the Ruby continued — Paulo Schreiner <paulo@...> 2002/02/20

Also VERY important:

[#34467] recursive require — Ron Jeffries <ronjeffries@...>

I'm having a really odd thing happen with two files that mutually

18 messages 2002/02/21

[#34503] special characters — Tobias Reif <tobiasreif@...>

Hi all,

13 messages 2002/02/22

[#34517] Windows Installer Ruby 166-0 available — Andrew Hunt <andy@...>

16 messages 2002/02/22

[#34597] rdoc/xml questions — Dave Thomas <Dave@...>

24 messages 2002/02/23

[#34631] Object/Memory Management — "Sean O'Dell" <sean@...>

I'm new to Ruby and the community here (I've been learning Ruby for a grand

44 messages 2002/02/23

[#34682] duplicate method name — Ron Jeffries <ronjeffries@...>

I just found a case in a test file where i had two tests of the same

16 messages 2002/02/24
[#34687] Re: duplicate method name — s@... (Stefan Schmiedl) 2002/02/24

Hi Ron.

[#34791] Style Question — Ron Jeffries <ronjeffries@...>

So I'm building this set theory library. The "only" object is supposed

13 messages 2002/02/25

[#34912] RCR?: parallel to until: as_soon_as — Tobias Reif <tobiasreif@...>

Hi,

18 messages 2002/02/26

[#34972] OT A Question on work styles — Chris Gehlker <gehlker@...>

As a Mac baby I just had to step through ruby in GDB *from the command line*

20 messages 2002/02/28

[#35015] Time Comparison — "Sean O'Dell" <sean@...>

I am using the time object to compare times between two files and I'm

21 messages 2002/02/28

Re: Regexp::Parser ported to Ruby...?

From: Rob Partington <rjp@...>
Date: 2002-02-07 23:22:27 UTC
List: ruby-talk #33370
In message <Pine.GSO.4.21.0202061629470.612-100000@crusoe.crusoe.net>,
"Jeff 'japhy' Pinyan" <jeffp@crusoe.net> writes:
>                         Some people would like the ability to assign
> DIRECTLY to some variable, like:
> 
>   "japhy" =~ /(?<$front>..)(?<$back>..)$/;
> 
> Well, this syntax isn't possible in Perl (yet).  

That's pretty funky, so I did some fiddling and now Ruby lets you do
   
    "japhy" =~ /(?[front]..)(?[back]..)$/
	puts "front=#{$front} back=#{$back}"

Caveat Emptor: I knocked this up in short space of time and
it's an inelegant hack, but it mostly works.  It only defines global
variables because the first way I tried (rb_eval("variable=%q{match}")
broke on things like "2{=" and it was easier to switch to
rb_define_variable than to handle borderline cases like that.

It doesn't check that the variable you're passing is legal, valid,
sensible or even vaguely presentable.  It trusts you not to hurt it.

I'm not sure the memory management is sensible (I use strdup at one point,
but I guess I could easily change that to use rb_str_new() and store the
Ruby string instead of the C string.  The only documentation I had to work
from was README.EXT in the source tarball.

Variables are only wiped when the whole regexp fails, not individually,
which is suboptimal but better than nothing for the moment.

The patch is against the 1.6.6 sources, and should compile cleanly.
Let me know if I've missed anything glaringly obvious or if there's
anything else I need on the TODO list.

TODO: 
* variable deletions when part of a regexp fails
  "japhy" =~ /(?[a]ap)(?[b]z?)/ # should b end up as nil or ""?
* memory management clean ups
* rewrite the whole thing more elegantly
* backpatch the functionality into GNU regex code for general use?
  (so re_match calls hooks for (?[string]regexp) patterns)

diff -burP ruby-1.6.6o/re.c ruby-1.6.6/re.c
--- ruby-1.6.6o/re.c	Tue Dec 25 14:24:35 2001
+++ ruby-1.6.6/re.c	Thu Feb  7 22:48:23 2002
@@ -623,6 +623,15 @@
     }
 
     if (result < 0) {
+      struct re_pattern_buffer *pb = RREGEXP(re)->ptr;
+      int i;
+
+      /* reset all the variables */
+      for (i=1; i<=pb->regvars; i++) {
+        if (pb->regvar[i]) {
+          rb_define_variable(pb->regvar[i], NULL); /* seems to work */
+        }
+      }
	rb_backref_set(Qnil);
	return result;
     }
@@ -647,6 +656,20 @@
     RMATCH(match)->str = rb_str_new4(str);
     rb_backref_set(match);
 
+    if(RMATCH(match)->regs->num_regs > 0 && RMATCH(match)->regs->regvars > 0) {
+      int i;
+      struct re_registers *r = RMATCH(match)->regs;
+      for (i=1; i<r->num_regs; i++) {
+        char e[1024];
+        VALUE *s = ALLOC_N(VALUE, 1);
+        *s = rb_reg_nth_match(i, match);
+        if (r->var[i] && *s) {
+          rb_define_variable(r->var[i], s);
+        }
+        r->var[i] = NULL;
+      }  
+    }
+    
     OBJ_INFECT(match, re);
     OBJ_INFECT(match, str);
     return result;
diff -burP ruby-1.6.6o/regex.c ruby-1.6.6/regex.c
--- ruby-1.6.6o/regex.c	Wed Dec  5 14:36:41 2001
+++ ruby-1.6.6/regex.c	Thu Feb  7 22:50:00 2002
@@ -1261,6 +1261,10 @@
 
   int options = bufp->options;
 
+  if (bufp->regvar) xfree(bufp->regvar);
+  bufp->regvar = NULL;
+  bufp->regvars = 0;
+
   bufp->fastmap_accurate = 0;
   bufp->must = 0;
   bufp->must_skip = 0;
@@ -1777,6 +1781,25 @@
 	case '>':
 	  break;
 
+    /* this is horrible */
+    case '[': {
+      char var[1024]; /* hackeroo */
+      int j=0;
+      for (;;) {
+        PATFETCH(c);
+        if (c==']') break;
+        var[j++]=c;
+        var[j]='\0';
+      }
+      if (regnum > bufp->regvars) {
+          TREALLOC(bufp->regvar, regnum+1, unsigned char *);
+      }
+      bufp->regvar[regnum] = strdup(var);
+      bufp->regvars = regnum;
+      c='('; /* pretend we're still parsing a match */
+      break; 
+    }
+
 	default:
 	  FREE_AND_RETURN(stackb, "undefined (?...) sequence");
 	}
@@ -2440,6 +2463,7 @@
   bufp->reg_info = TMALLOC(regnum, register_info_type);
   bufp->best_regstart = TMALLOC(regnum, unsigned char*);
   bufp->best_regend = TMALLOC(regnum, unsigned char*);
+  TREALLOC(bufp->regvar, regnum, unsigned char *);
   FREE_AND_RETURN(stackb, 0);
 
  invalid_pattern:
@@ -2476,6 +2500,7 @@
   xfree(bufp->best_regstart);
   xfree(bufp->best_regend);
   xfree(bufp->reg_info);
+  xfree(bufp->regvar);
   xfree(bufp);
 }
 
@@ -3495,15 +3520,18 @@
   if (regs->allocated == 0) {
     regs->beg = TMALLOC(num_regs, int);
     regs->end = TMALLOC(num_regs, int);
+    regs->var = TMALLOC(num_regs, char *);
     regs->allocated = num_regs;
   }
   else if (regs->allocated < num_regs) {
     TREALLOC(regs->beg, num_regs, int);
     TREALLOC(regs->end, num_regs, int);
+    TREALLOC(regs->var, num_regs, char *);
     regs->allocated = num_regs;
   }
   for (i=0; i<num_regs; i++) {
     regs->beg[i] = regs->end[i] = -1;
+    regs->var[i] = NULL;
   }
 }
 
@@ -3568,6 +3596,7 @@
 
   unsigned char **regstart = bufp->regstart;
   unsigned char **regend = bufp->regend;
+  unsigned char **regvar = bufp->regvar;
 
   /* If a group that's operated upon by a repetition operator fails to
      match anything, then the register for its start will need to be
@@ -3694,14 +3723,22 @@
       if (regs) {
 	regs->beg[0] = pos;
 	regs->end[0] = d - string;
+   regs->var[0] = NULL;
+   regs->regvars = bufp->regvars;
+
+   /* bleh */
+   TREALLOC(bufp->regvar, num_regs+1, unsigned char *);
+
 	for (mcnt = 1; mcnt < num_regs; mcnt++) {
 	  if (REG_UNSET(regend[mcnt])) {
 	    regs->beg[mcnt] = -1;
 	    regs->end[mcnt] = -1;
+       regs->var[mcnt] = NULL;
 	    continue;
 	  }
 	  regs->beg[mcnt] = regstart[mcnt] - string;
 	  regs->end[mcnt] = regend[mcnt] - string;
+     regs->var[mcnt] = bufp->regvar[mcnt];
 	}
       }
       FREE_AND_RETURN(stackb, (d - pos - string));
@@ -4405,18 +4442,22 @@
   if (regs1->allocated == 0) {
     regs1->beg = TMALLOC(regs2->num_regs, int);
     regs1->end = TMALLOC(regs2->num_regs, int);
+    regs1->var = TMALLOC(regs2->num_regs, char *);
     regs1->allocated = regs2->num_regs;
   }
   else if (regs1->allocated < regs2->num_regs) {
     TREALLOC(regs1->beg, regs2->num_regs, int);
     TREALLOC(regs1->end, regs2->num_regs, int);
+    TREALLOC(regs1->var, regs2->num_regs, char *);
     regs1->allocated = regs2->num_regs;
   }
   for (i=0; i<regs2->num_regs; i++) {
     regs1->beg[i] = regs2->beg[i];
     regs1->end[i] = regs2->end[i];
+    regs1->var[i] = regs2->var[i];
   }
   regs1->num_regs = regs2->num_regs;
+  regs1->regvars = regs2->regvars;
 }
 
 void
@@ -4426,6 +4467,7 @@
   if (regs->allocated == 0) return;
   if (regs->beg) xfree(regs->beg);
   if (regs->end) xfree(regs->end);
+  if (regs->var) xfree(regs->var);
 }
 
 /* Functions for multi-byte support.
diff -burP ruby-1.6.6o/regex.h ruby-1.6.6/regex.h
--- ruby-1.6.6o/regex.h	Fri Jul 13 06:48:25 2001
+++ ruby-1.6.6/regex.h	Thu Feb  7 18:06:00 2002
@@ -154,6 +154,10 @@
     register_info_type *reg_info;
     unsigned char **best_regstart;
     unsigned char **best_regend;
+
+    /* list of variable names to store register matches in */
+    unsigned char **regvar; 
+    int regvars;
   };
 
 typedef struct re_pattern_buffer regex_t;
@@ -172,8 +176,10 @@
   {
     int allocated;
     int num_regs;
+    int regvars;
     int *beg;
     int *end;
+    char **var;
   };
 
 /* Type for byte offsets within the string.  POSIX mandates this.  */
diff -burP ruby-1.6.6o/sample/test.rb ruby-1.6.6/sample/test.rb
--- ruby-1.6.6o/sample/test.rb	Wed Mar 21 08:04:12 2001
+++ ruby-1.6.6/sample/test.rb	Thu Feb  7 20:49:53 2002
@@ -674,6 +674,8 @@
 test_ok(/(\s+\d+){2}/ =~ " 1 2" && $& == " 1 2")
 test_ok(/(?:\s+\d+){2}/ =~ " 1 2" && $& == " 1 2")
 
+test_ok("moocowfish" =~ /(?[cow]c..)/ && $cow == "cow")
+
 $x = <<END;
 ABCD
 ABCD
require 'rubyunit'

class TestPermute < RUNIT::TestCase
  def test_simple
    testme=%q{moomopcowfishcowx{1=>2}y[1,2]z"string"y3}

    testme =~ /(c..)/      ;     assert($1 == "cow")
    testme =~ /(?[c]m..)/  ;     assert($c == "moo")
	                             assert($1 == "moo")
    testme =~ /(?[c]c.*?w)/;     assert($c == "cow")
	testme =~ /(?[c]x.*?)y/;     assert($c == "x{1=>2}")
	testme =~ /(?[c]y.*?)z/;     assert($c == "y[1,2]")
	testme =~ /(?[c]z.*)$/ ;     assert($c == %q{z"string"y3})
	testme =~ /(?[c]z.*)n/ ;     assert($c == %q{z"stri})
	testme =~ /(?[c]x.*)y/ ;     assert($c == %q{x{1=>2}y[1,2]z"string"})
	testme =~ /(?[c]x.*?)>/;     assert($c == "x{1=")
	testme =~ /y\[1(?[c].*?)z/;  assert($c == ",2]")

	testme =~ /(?[c]cow)(?[d].*cow)(.)/
		assert($c == "cow")
		assert($d == "fishcow")
		assert($3 == "x")

# failing regexps blank the variables
    testme =~ /(?[c]^m.p)/
        assert($c == nil)
        assert($d == "fishcow")

	"japhy" =~ /(?[front]..)(?[back]..)$/
		assert($front == "ap")
		assert($back  == "hy")
  end
end
-- 
rob partington % rjp@browser.org % http://lynx.browser.org/

In This Thread

Prev Next