<?xml version="1.0" encoding="UTF-8"?><rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
		>
<channel>
	<title>Comments on: 利用Minidx Extract-Text Com组件从doc,Xls,Pdf&#8230;&#8230;等读取文本内容VC Demo</title>
	<atom:link href="http://blog.minidx.com/2008/01/10/373.html/feed" rel="self" type="application/rss+xml" />
	<link>http://blog.minidx.com/2008/01/10/373.html</link>
	<description>中文Adobe Flex例子,Flex实例教程,RIA资源,全文检索技术,算法和数据结构</description>
	<lastBuildDate>Tue, 17 Jan 2012 07:33:13 -0800</lastBuildDate>
	<generator>http://wordpress.org/?v=2.8.5</generator>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
		<item>
		<title>By: wind</title>
		<link>http://blog.minidx.com/2008/01/10/373.html/comment-page-2#comment-20615</link>
		<dc:creator>wind</dc:creator>
		<pubDate>Sun, 27 Mar 2011 11:19:54 +0000</pubDate>
		<guid isPermaLink="false">http://blog.minidx.com/2008/01/10/373.html#comment-20615</guid>
		<description>老大，什么时候放一个支持win7/windows 2008 server版的呢？期待中。。。</description>
		<content:encoded><![CDATA[<p>老大，什么时候放一个支持win7/windows 2008 server版的呢？期待中。。。</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: conan</title>
		<link>http://blog.minidx.com/2008/01/10/373.html/comment-page-2#comment-17439</link>
		<dc:creator>conan</dc:creator>
		<pubDate>Mon, 26 Jul 2010 01:25:27 +0000</pubDate>
		<guid isPermaLink="false">http://blog.minidx.com/2008/01/10/373.html#comment-17439</guid>
		<description>你好，我在使用过程中发现ExtractText.dll好像有内存泄露问题，这是我源码：
	BSTR cval;			// extracted text
	LPTSTR re;
	ULONGLONG lFileSize;	// file size
	ITextExtractor *te = NULL;
	HRESULT			hr;                  	// Declare and HRESULT and a pointer to the Simple_ATL interface

	hr = CoInitialize(0);		// Now we will intilize COM
	BSTR fileName = _com_util::ConvertStringToBSTR(strFileName);
	if(SUCCEEDED(hr))
	{
		hr = CoCreateInstance( CLSID_TextExtractor, NULL, CLSCTX_INPROC_SERVER,IID_ITextExtractor, (void**) &amp;te);//尝试调用COM组件
		if (hr == REGDB_E_CLASSNOTREG)//如果组件未注册则使用MyCoCreateInstance直接调用
		{
			hr = MyCoCreateInstance(&quot;ExtractText.dll&quot;, CLSID_TextExtractor, NULL,IID_ITextExtractor, (void**) &amp;te);
		}
		hr = te-&gt;ExtractText(fileName, (long)lFileSize,&amp;cval);
		hr = te-&gt;Release();
	}
	CoUninitialize();
	::SysFreeString(fileName);
	if(cval)
	{
		re = _com_util::ConvertBSTRToString(cval); 
		::SysFreeString(cval);
		return re;
	}
	return NULL;
}
其中当debug到		hr = te-&gt;ExtractText(fileName, (long)lFileSize,&amp;cval);这一步时系统出现first exception Access   Violation错误，当进行全盘搜索所有文件时，内存会占用的几百M,请您帮忙看一下，谢谢。其中re已经在函数外部用delete[]进行释放。</description>
		<content:encoded><![CDATA[<p>你好，我在使用过程中发现ExtractText.dll好像有内存泄露问题，这是我源码：<br />
	BSTR cval;			// extracted text<br />
	LPTSTR re;<br />
	ULONGLONG lFileSize;	// file size<br />
	ITextExtractor *te = NULL;<br />
	HRESULT			hr;                  	// Declare and HRESULT and a pointer to the Simple_ATL interface</p>
<p>	hr = CoInitialize(0);		// Now we will intilize COM<br />
	BSTR fileName = _com_util::ConvertStringToBSTR(strFileName);<br />
	if(SUCCEEDED(hr))<br />
	{<br />
		hr = CoCreateInstance( CLSID_TextExtractor, NULL, CLSCTX_INPROC_SERVER,IID_ITextExtractor, (void**) &amp;te);//尝试调用COM组件<br />
		if (hr == REGDB_E_CLASSNOTREG)//如果组件未注册则使用MyCoCreateInstance直接调用<br />
		{<br />
			hr = MyCoCreateInstance(&#8221;ExtractText.dll&#8221;, CLSID_TextExtractor, NULL,IID_ITextExtractor, (void**) &amp;te);<br />
		}<br />
		hr = te-&gt;ExtractText(fileName, (long)lFileSize,&amp;cval);<br />
		hr = te-&gt;Release();<br />
	}<br />
	CoUninitialize();<br />
	::SysFreeString(fileName);<br />
	if(cval)<br />
	{<br />
		re = _com_util::ConvertBSTRToString(cval);<br />
		::SysFreeString(cval);<br />
		return re;<br />
	}<br />
	return NULL;<br />
}<br />
其中当debug到		hr = te-&gt;ExtractText(fileName, (long)lFileSize,&amp;cval);这一步时系统出现first exception Access   Violation错误，当进行全盘搜索所有文件时，内存会占用的几百M,请您帮忙看一下，谢谢。其中re已经在函数外部用delete[]进行释放。</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Minidxer</title>
		<link>http://blog.minidx.com/2008/01/10/373.html/comment-page-2#comment-16226</link>
		<dc:creator>Minidxer</dc:creator>
		<pubDate>Fri, 07 May 2010 07:23:29 +0000</pubDate>
		<guid isPermaLink="false">http://blog.minidx.com/2008/01/10/373.html#comment-16226</guid>
		<description>@LYK
该控件已经在成熟产品中应用了. 存在内存没释放的可能性非常小.</description>
		<content:encoded><![CDATA[<p>@LYK<br />
该控件已经在成熟产品中应用了. 存在内存没释放的可能性非常小.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: LYK</title>
		<link>http://blog.minidx.com/2008/01/10/373.html/comment-page-2#comment-16225</link>
		<dc:creator>LYK</dc:creator>
		<pubDate>Fri, 07 May 2010 07:21:26 +0000</pubDate>
		<guid isPermaLink="false">http://blog.minidx.com/2008/01/10/373.html#comment-16225</guid>
		<description>能否把ExtractText.dll控件的源码提供我们学习学习。谢谢
这个里面好像有内存申请了没有释放，</description>
		<content:encoded><![CDATA[<p>能否把ExtractText.dll控件的源码提供我们学习学习。谢谢<br />
这个里面好像有内存申请了没有释放，</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: LYK</title>
		<link>http://blog.minidx.com/2008/01/10/373.html/comment-page-2#comment-16217</link>
		<dc:creator>LYK</dc:creator>
		<pubDate>Thu, 06 May 2010 02:16:06 +0000</pubDate>
		<guid isPermaLink="false">http://blog.minidx.com/2008/01/10/373.html#comment-16217</guid>
		<description>你的这个控件，在读取doc，pdf时有时候内存增长了，却不释放。一直这样读下去内存不停的增长，有没有什么好的办法解决，还有一个问题？有没有支持office2007的docx读取控件</description>
		<content:encoded><![CDATA[<p>你的这个控件，在读取doc，pdf时有时候内存增长了，却不释放。一直这样读下去内存不停的增长，有没有什么好的办法解决，还有一个问题？有没有支持office2007的docx读取控件</p>
]]></content:encoded>
	</item>
</channel>
</rss>

<!-- Performance optimized by W3 Total Cache. Learn more: http://www.w3-edge.com/wordpress-plugins/

Minified using disk
Page Caching using disk (enhanced)
Database Caching 1/10 queries in 0.075 seconds using disk
Object Caching 160/320 objects using disk

Served from: blog.minidx.com @ 2012-02-10 00:00:51 -->
