[#399938] how to read arrary with an array — "Richard D." <lists@...>

Hello. I believe this is basic question, but I'm just starting to learn

19 messages 2012/10/02

[#400050] img src while sending email ruby cgi — Ferdous ara <lists@...>

Hi

16 messages 2012/10/05

[#400351] Drop 1st and last particular character — ajay paswan <lists@...>

What is the most efficient way to drop '#' from the first place and last

15 messages 2012/10/16

[#400374] database part of a desktop application — "Sebastjan H." <lists@...>

Hi,

14 messages 2012/10/16
[#400375] Re: database part of a desktop application — Chad Perrin <code@...> 2012/10/16

On Wed, Oct 17, 2012 at 05:28:39AM +0900, Sebastjan H. wrote:

[#400377] Re: database part of a desktop application — sto.mar@... 2012/10/17

Am 16.10.2012 23:24, schrieb Chad Perrin:

[#400389] Re: database part of a desktop application — Chad Perrin <code@...> 2012/10/17

On Wed, Oct 17, 2012 at 01:39:21PM +0900, sto.mar@web.de wrote:

[#400386] Unable to send attachment, and dealing with multiple attachment — ajay paswan <lists@...>

Hi,

11 messages 2012/10/17

[#400454] Hash with Integer key issue — Wayne Simmerson <lists@...>

Hi Im new to Ruby and am getting some unexpected results from a

18 messages 2012/10/19

[#400535] Name/symbol/object type clash? What is happening here? — Todd Benson <caduceass@...>

It's nonsense code, but I'm curious as to what is going on behind the scenes...

41 messages 2012/10/23

[#400556] Calling a method foo() or an object foo.method_call_here - both — Marc Heiler <lists@...>

Hello.

13 messages 2012/10/24

[#400650] OpenSSL ECDSA public key from private — Nokan Emiro <uzleepito@...>

Hi,

11 messages 2012/10/27

[#400680] Passing folder as argument ARGV? — Joz Private <lists@...>

Is there an easy way to pass multiple files on the command line?

15 messages 2012/10/28
[#400681] Re: Passing folder as argument ARGV? — brad smith <bradleydsmith@...> 2012/10/28

How are you traversing the directory you pass in on the command line ?

[#400697] File.readable? and /proc — Jeff Moore <lists@...>

root@nail:/projects/proc_fs# uname -a

13 messages 2012/10/28

[#400714] Marshal.load weird issue — "Pierre J." <lists@...>

Hi guys

12 messages 2012/10/28

[#400781] bug?: local variable created in if modifier not available in modified expression — "Mean L." <lists@...>

irb(main):001:0> local1 if local1 = "created"

21 messages 2012/10/30
[#400807] Re: bug?: local variable created in if modifier not available in modified expression — Bartosz Dziewoński <matma.rex@...> 2012/10/31

Oh, and in case it wasn't apparent: you can just add

[#400808] Re: bug?: local variable created in if modifier not available in modified expression — Eliezer Croitoru <eliezer@...> 2012/10/31

On 10/31/2012 4:52 PM, Bartosz Dziewoナгki wrote:

[#400809] Re: bug?: local variable created in if modifier not available in modified expression — Robert Klemme <shortcutter@...> 2012/10/31

On Wed, Oct 31, 2012 at 4:28 PM, Eliezer Croitoru <eliezer@ngtech.co.il>wrote:

[#400784] REXML & HTMLentities incorrectly map to UTF-8 — "Mark S." <lists@...>

I have some XML data (UTF 8) that I'm trying to convert into another XML

13 messages 2012/10/30

Re: REXML & HTMLentities incorrectly map to UTF-8

From: "Mark S." <lists@...>
Date: 2012-10-31 18:34:57 UTC
List: ruby-talk #400814
Hello Bob & Carlo,

> I am not really sure about what happens within rexml
> there, but when you get your CDATA string, if you are
> sure that the stuff inside is UTF-8, you can force the
> encoding. By

I'm pretty sure that REXML converts with UTF-8. That's what the tutorial 
implies. In any event, its already done the translation at the moment I 
use element.text. The problem is that its converted HTML entities like 
&#146; into code point at 146 (which is C292) instead of into the 
corresponding functional code point 2019 (single right quote).

> Are you saying that REXML is parsing the content of the
> CDATA section and replacing those entities? Or are you
> extracting the CDATA sections after REXML is finished
> and then parsing them yourself?

Yes, REXML is replacing entities like &#146; and converting it into 
whatever happens to be at codepoint 146. Which happens just to be a 
control point -- not a character. This is not an intelligent mapping.

This conversion apparently happens when I use any form of Xpath to 
collect Elements. This is not what a typical user would expect.

There is a raw-mode that will tell REXML to not translate anything, but 
then it also pulls out the enclosing tags. So I get

   <mystuff>Apostrophe: &#146; </mystuff>.

So maybe I could clean out the tags in this code or maybe I could write 
some complicated recursive code that doesn't use Xpath. But I would 
still need an intelligent way to convert HTML entities to UTF-8.

Which leads me to HTMLentities.

If I try to  use HTMLentities to translate the codes, it also does the 
useless translation of converting &#146; to a codepoint.

I didn't know about Nokigiri. I took 2 days to learn REXML ... thought 
it was a standard. Guess I'll look into NG and see if its better.

Thanks !
Mark

-- 
Posted via http://www.ruby-forum.com/.

In This Thread