<?xml version="1.0" encoding="UTF-8"?><rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
		>
<channel>
	<title>Comments on: TRE 0.8.0 Released</title>
	<atom:link href="http://laurikari.net/tre/tre-0-8-0-released/feed/" rel="self" type="application/rss+xml" />
	<link>http://laurikari.net/tre/tre-0-8-0-released/</link>
	<description>The free and portable approximate regex matching library.</description>
	<lastBuildDate>Sat, 02 Jul 2011 05:30:53 +0000</lastBuildDate>
	<generator>http://wordpress.org/?v=2.9.2</generator>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
		<item>
		<title>By: sang-suan gam</title>
		<link>http://laurikari.net/tre/tre-0-8-0-released/comment-page-1/#comment-448</link>
		<dc:creator>sang-suan gam</dc:creator>
		<pubDate>Tue, 03 May 2011 07:30:55 +0000</pubDate>
		<guid isPermaLink="false">http://laurikari.net/tre/?p=192#comment-448</guid>
		<description>Hi,

just downloaded the library for use on the command
(windows).

the logic queries (AND, OR) are not working ?

# agrep &#039;FATAL&#039; report.20110408.txt
    776 FATAL ERROR IN TWO-TASK SERVER: error = 12571
     20 FATAL ERROR IN TWO-TASK SERVER: error = 12571
     20 FATAL ERROR IN TWO-TASK SERVER: error = 12571
     20 FATAL ERROR IN TWO-TASK SERVER: error = 12571
     20 FATAL ERROR IN TWO-TASK SERVER: error = 12571
     20 FATAL ERROR IN TWO-TASK SERVER: error = 12571
     20 FATAL ERROR IN TWO-TASK SERVER: error = 12571
     20 FATAL ERROR IN TWO-TASK SERVER: error = 12571
     20 FATAL ERROR IN TWO-TASK SERVER: error = 12571
#
# agrep &#039;FATAL;ERROR&#039; report.20110408.txt
# agrep &#039;FATAL,ERROR&#039; report.20110408.txt

also, when i ran the command in a cygwin terminal,
there is a complain that delimiters should not be
empty strings ?

# agrep -d &#039;$$&#039; &#039;FATAL,ERROR&#039; report.20110408.txt
C:\GnuWin32\bin\agrep.exe: Record delimiter pattern must not match an empty string
#

are these features no longer supported ?

Thanks,
sam</description>
		<content:encoded><![CDATA[<p>Hi,</p>
<p>just downloaded the library for use on the command<br />
(windows).</p>
<p>the logic queries (AND, OR) are not working ?</p>
<p># agrep &#8216;FATAL&#8217; report.20110408.txt<br />
    776 FATAL ERROR IN TWO-TASK SERVER: error = 12571<br />
     20 FATAL ERROR IN TWO-TASK SERVER: error = 12571<br />
     20 FATAL ERROR IN TWO-TASK SERVER: error = 12571<br />
     20 FATAL ERROR IN TWO-TASK SERVER: error = 12571<br />
     20 FATAL ERROR IN TWO-TASK SERVER: error = 12571<br />
     20 FATAL ERROR IN TWO-TASK SERVER: error = 12571<br />
     20 FATAL ERROR IN TWO-TASK SERVER: error = 12571<br />
     20 FATAL ERROR IN TWO-TASK SERVER: error = 12571<br />
     20 FATAL ERROR IN TWO-TASK SERVER: error = 12571<br />
#<br />
# agrep &#8216;FATAL;ERROR&#8217; report.20110408.txt<br />
# agrep &#8216;FATAL,ERROR&#8217; report.20110408.txt</p>
<p>also, when i ran the command in a cygwin terminal,<br />
there is a complain that delimiters should not be<br />
empty strings ?</p>
<p># agrep -d &#8216;$$&#8217; &#8216;FATAL,ERROR&#8217; report.20110408.txt<br />
C:\GnuWin32\bin\agrep.exe: Record delimiter pattern must not match an empty string<br />
#</p>
<p>are these features no longer supported ?</p>
<p>Thanks,<br />
sam</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: JM</title>
		<link>http://laurikari.net/tre/tre-0-8-0-released/comment-page-1/#comment-67</link>
		<dc:creator>JM</dc:creator>
		<pubDate>Wed, 21 Jul 2010 18:20:13 +0000</pubDate>
		<guid isPermaLink="false">http://laurikari.net/tre/?p=192#comment-67</guid>
		<description>Looks like agrep currently fails on non-latin encodings.  If the input file has a non-latin character then agrep just stops processing with no errors.</description>
		<content:encoded><![CDATA[<p>Looks like agrep currently fails on non-latin encodings.  If the input file has a non-latin character then agrep just stops processing with no errors.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Julien</title>
		<link>http://laurikari.net/tre/tre-0-8-0-released/comment-page-1/#comment-66</link>
		<dc:creator>Julien</dc:creator>
		<pubDate>Wed, 07 Jul 2010 17:09:50 +0000</pubDate>
		<guid isPermaLink="false">http://laurikari.net/tre/?p=192#comment-66</guid>
		<description>First, thank you for this library, it is really useful.
I could not reach the mailing list through your website, it seems the link is dead. http://laurikari.net/mailman/listinfo/tre-general)

I have stumbled upon an unexpected behavior:

pattern: &#039;1111$&#039;
str: &#039;1111&#039; cost 0
str: &#039;1111 &#039; cost 1 (as expected)

pattern: &#039;2004$&#039;
str: &#039;2004&#039; cost 0
str: &#039;2004 &#039; cost 2 (strange?)

I used a fresh download and default normal compilation
http://laurikari.net/tre/tre-0.8.0.tar.bz2  
./configure 
and then python setup.py install
My platform is Ubuntu 10.04

Here is the python code:
&lt;code&gt;
import tre
fz = tre.Fuzzyness(maxerr=3)

for pattern in [&#039;1111$&#039;, &#039;2004$&#039;]:
    pt = tre.compile(pattern, tre.EXTENDED)
    print &quot;pattern:&quot;, repr(pattern)

    for test_str in [&#039;1111&#039;,&#039;1111 &#039;, &#039;2004&#039;,&#039;2004 &#039;]:
        m = pt.search(test_str, fz) 
        if m:
	    print &quot;str:&quot;, repr(test_str), &quot;cost&quot;, m.cost
&lt;/code&gt;

Is this a known issue and something that would be fixed?
Thanks again.
Julien</description>
		<content:encoded><![CDATA[<p>First, thank you for this library, it is really useful.<br />
I could not reach the mailing list through your website, it seems the link is dead. <a href="http://laurikari.net/mailman/listinfo/tre-general)" rel="nofollow">http://laurikari.net/mailman/listinfo/tre-general)</a></p>
<p>I have stumbled upon an unexpected behavior:</p>
<p>pattern: &#8216;1111$&#8217;<br />
str: &#8216;1111&#8242; cost 0<br />
str: &#8216;1111 &#8216; cost 1 (as expected)</p>
<p>pattern: &#8216;2004$&#8217;<br />
str: &#8216;2004&#8242; cost 0<br />
str: &#8216;2004 &#8216; cost 2 (strange?)</p>
<p>I used a fresh download and default normal compilation<br />
<a href="http://laurikari.net/tre/tre-0.8.0.tar.bz2" rel="nofollow">http://laurikari.net/tre/tre-0.8.0.tar.bz2</a><br />
./configure<br />
and then python setup.py install<br />
My platform is Ubuntu 10.04</p>
<p>Here is the python code:<br />
<code><br />
import tre<br />
fz = tre.Fuzzyness(maxerr=3)</p>
<p>for pattern in ['1111$', '2004$']:<br />
    pt = tre.compile(pattern, tre.EXTENDED)<br />
    print "pattern:", repr(pattern)</p>
<p>    for test_str in ['1111','1111 ', '2004','2004 ']:<br />
        m = pt.search(test_str, fz)<br />
        if m:<br />
	    print "str:", repr(test_str), "cost", m.cost<br />
</code></p>
<p>Is this a known issue and something that would be fixed?<br />
Thanks again.<br />
Julien</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Blaisorblade</title>
		<link>http://laurikari.net/tre/tre-0-8-0-released/comment-page-1/#comment-44</link>
		<dc:creator>Blaisorblade</dc:creator>
		<pubDate>Sat, 01 May 2010 17:43:38 +0000</pubDate>
		<guid isPermaLink="false">http://laurikari.net/tre/?p=192#comment-44</guid>
		<description>Hi, I&#039;ve found a reference to your library here:
http://patrakov.blogspot.com/2009/09/matching-multiple-strings.html
in that article, it is compared (for a testcase of &quot;(word1&#124;word2&#124;....&#124;wordN)&quot;) to a trie-based library and GNU glibc implementation.

While here I see a vastly opposite result:
http://hackerboss.com/is-your-regex-matcher-up-to-snuff/

Now, is this slowdown fixable in your library, at least for non-approximate matching? Do you think there&#039;s a special case for such patterns (using a tree for them) or is the slowdown just a product of using NFA&#039;s versus recursive backtracking*?

*I just read the article you quote on your blog: 
&lt;a href=&quot;http://swtch.com/~rsc/regexp/regexp1.html&quot; rel=&quot;nofollow&quot;&gt;Regular Expression Matching Can Be Simple And Fast&lt;/a&gt; by Russ Cox.</description>
		<content:encoded><![CDATA[<p>Hi, I&#8217;ve found a reference to your library here:<br />
<a href="http://patrakov.blogspot.com/2009/09/matching-multiple-strings.html" rel="nofollow">http://patrakov.blogspot.com/2009/09/matching-multiple-strings.html</a><br />
in that article, it is compared (for a testcase of &#8220;(word1|word2|&#8230;.|wordN)&#8221;) to a trie-based library and GNU glibc implementation.</p>
<p>While here I see a vastly opposite result:<br />
<a href="http://hackerboss.com/is-your-regex-matcher-up-to-snuff/" rel="nofollow">http://hackerboss.com/is-your-regex-matcher-up-to-snuff/</a></p>
<p>Now, is this slowdown fixable in your library, at least for non-approximate matching? Do you think there&#8217;s a special case for such patterns (using a tree for them) or is the slowdown just a product of using NFA&#8217;s versus recursive backtracking*?</p>
<p>*I just read the article you quote on your blog:<br />
<a href="http://swtch.com/~rsc/regexp/regexp1.html" rel="nofollow">Regular Expression Matching Can Be Simple And Fast</a> by Russ Cox.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Enno</title>
		<link>http://laurikari.net/tre/tre-0-8-0-released/comment-page-1/#comment-39</link>
		<dc:creator>Enno</dc:creator>
		<pubDate>Mon, 12 Apr 2010 08:33:36 +0000</pubDate>
		<guid isPermaLink="false">http://laurikari.net/tre/?p=192#comment-39</guid>
		<description>Hi Ville,
I tried to run the retest application in MS Visual Studio in debug mode ... but it would not run because there was an error in the tre.vsproj configuration file that I loaded from darcs:
The additional link library (for project &quot;tre&quot;: properties/configuration properties/Linker/Input/Additional Dependencies) for debug configuration should be &quot;mscvprt&lt;b&gt;d&lt;/b&gt;.lib&quot;.
After I added the &quot;d&quot; character to to library everything was ok!
Regrads, Enno.</description>
		<content:encoded><![CDATA[<p>Hi Ville,<br />
I tried to run the retest application in MS Visual Studio in debug mode &#8230; but it would not run because there was an error in the tre.vsproj configuration file that I loaded from darcs:<br />
The additional link library (for project &#8220;tre&#8221;: properties/configuration properties/Linker/Input/Additional Dependencies) for debug configuration should be &#8220;mscvprt<b>d</b>.lib&#8221;.<br />
After I added the &#8220;d&#8221; character to to library everything was ok!<br />
Regrads, Enno.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Ville Laurikari</title>
		<link>http://laurikari.net/tre/tre-0-8-0-released/comment-page-1/#comment-29</link>
		<dc:creator>Ville Laurikari</dc:creator>
		<pubDate>Fri, 12 Mar 2010 11:46:07 +0000</pubDate>
		<guid isPermaLink="false">http://laurikari.net/tre/?p=192#comment-29</guid>
		<description>Steve, you’ve run into a confusing hack of mine. Please accept my apologies.

In the normal case, tre_ctype() is just the same as the wctype() function from the C library. It returns a “character class” object. The counterpart of tre_ctype() is tre_isctype(), which takes a character and a character class object, and returns non-zero if the character is part of the character class. Normally tre_isctype() is the same as iswctype().

If the system does not have wctype() and iswctype(), TRE uses it’s own implementation. In this case, the character class object returned by tre_ctype() is actually a function which gets called by tre_isctype().

You can find the macros that control this in tre-internal.sh. Search for SYSTEM_WCTYPE.</description>
		<content:encoded><![CDATA[<p>Steve, you’ve run into a confusing hack of mine. Please accept my apologies.</p>
<p>In the normal case, tre_ctype() is just the same as the wctype() function from the C library. It returns a “character class” object. The counterpart of tre_ctype() is tre_isctype(), which takes a character and a character class object, and returns non-zero if the character is part of the character class. Normally tre_isctype() is the same as iswctype().</p>
<p>If the system does not have wctype() and iswctype(), TRE uses it’s own implementation. In this case, the character class object returned by tre_ctype() is actually a function which gets called by tre_isctype().</p>
<p>You can find the macros that control this in tre-internal.sh. Search for SYSTEM_WCTYPE.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Steve Teale</title>
		<link>http://laurikari.net/tre/tre-0-8-0-released/comment-page-1/#comment-20</link>
		<dc:creator>Steve Teale</dc:creator>
		<pubDate>Fri, 29 Jan 2010 09:57:54 +0000</pubDate>
		<guid isPermaLink="false">http://laurikari.net/tre/?p=192#comment-20</guid>
		<description>I am trying to translate the non-fuzzy part of TRE into D. I noticed this:

tre_ctype_t tre_ctype(const char *name)
{
  int i;
  for (i = 0; tre_ctype_map[i].name != NULL; i++)
    {
      if (strcmp(name, tre_ctype_map[i].name) == 0)
	return tre_ctype_map[i].func;
    }
  return (tre_ctype_t)0;
}

It is prototyped as returning a character type, but if name is found, it returns a pointer to a function. The parsing code behaves as if it returned a character (I think). Could you possibly explain?

Thanks Steve</description>
		<content:encoded><![CDATA[<p>I am trying to translate the non-fuzzy part of TRE into D. I noticed this:</p>
<p>tre_ctype_t tre_ctype(const char *name)<br />
{<br />
  int i;<br />
  for (i = 0; tre_ctype_map[i].name != NULL; i++)<br />
    {<br />
      if (strcmp(name, tre_ctype_map[i].name) == 0)<br />
	return tre_ctype_map[i].func;<br />
    }<br />
  return (tre_ctype_t)0;<br />
}</p>
<p>It is prototyped as returning a character type, but if name is found, it returns a pointer to a function. The parsing code behaves as if it returned a character (I think). Could you possibly explain?</p>
<p>Thanks Steve</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Ville Laurikari</title>
		<link>http://laurikari.net/tre/tre-0-8-0-released/comment-page-1/#comment-14</link>
		<dc:creator>Ville Laurikari</dc:creator>
		<pubDate>Wed, 02 Dec 2009 09:45:22 +0000</pubDate>
		<guid isPermaLink="false">http://laurikari.net/tre/?p=192#comment-14</guid>
		<description>Sorry about that, and sorry for the delay (your comment got caught in the spam filter).  Now the darcs repo is properly updated.

I have no problems downloading files from the web interface.</description>
		<content:encoded><![CDATA[<p>Sorry about that, and sorry for the delay (your comment got caught in the spam filter).  Now the darcs repo is properly updated.</p>
<p>I have no problems downloading files from the web interface.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Bang Jun-young</title>
		<link>http://laurikari.net/tre/tre-0-8-0-released/comment-page-1/#comment-11</link>
		<dc:creator>Bang Jun-young</dc:creator>
		<pubDate>Tue, 10 Nov 2009 00:34:33 +0000</pubDate>
		<guid isPermaLink="false">http://laurikari.net/tre/?p=192#comment-11</guid>
		<description>Checked out the sources with command &#039;darcs get --set-scripts-executable http://laurikari.net/tre/darcs/stable/&#039; as described on the download page, but I still can&#039;t find Visual Studio 2008 files there.

Downloading the files from the darcs web interface by clicking on the &#039;plain&#039; link doesn&#039;t work either. What I actually got were HTML-decorated text (&lt; garbled with &amp;lt\;) rather than plain text.

Downloading from the browser&#039;s view-source page doesn&#039;t work either. This time the server refuses to send data to the browser. :-(</description>
		<content:encoded><![CDATA[<p>Checked out the sources with command &#8216;darcs get &#8211;set-scripts-executable <a href="http://laurikari.net/tre/darcs/stable/" rel="nofollow">http://laurikari.net/tre/darcs/stable/</a>&#8216; as described on the download page, but I still can&#8217;t find Visual Studio 2008 files there.</p>
<p>Downloading the files from the darcs web interface by clicking on the &#8216;plain&#8217; link doesn&#8217;t work either. What I actually got were HTML-decorated text (&lt; garbled with &amp;lt\;) rather than plain text.</p>
<p>Downloading from the browser&#039;s view-source page doesn&#039;t work either. This time the server refuses to send data to the browser. <img src='http://laurikari.net/tre/wp/wp-includes/images/smilies/icon_sad.gif' alt=':-(' class='wp-smiley' /> </p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Ville Laurikari</title>
		<link>http://laurikari.net/tre/tre-0-8-0-released/comment-page-1/#comment-7</link>
		<dc:creator>Ville Laurikari</dc:creator>
		<pubDate>Wed, 07 Oct 2009 18:19:40 +0000</pubDate>
		<guid isPermaLink="false">http://laurikari.net/tre/?p=192#comment-7</guid>
		<description>Indeed.  I went and updated the agrep page regarding TRE license.</description>
		<content:encoded><![CDATA[<p>Indeed.  I went and updated the agrep page regarding TRE license.</p>
]]></content:encoded>
	</item>
</channel>
</rss>
<!-- WP Super Cache is installed but broken. The path to wp-cache-phase1.php in wp-content/advanced-cache.php must be fixed! -->
