SIGIA-L Mail Archives Subscribe/Unsubscribe | Home

Printer-Friendly Version

SIGIA-L Mail Archives: Re: SIGIA-L: XML Search Engines?

Re: SIGIA-L: XML Search Engines?

From: Avi Rappoport (
Date: Wed Jul 11 2001 - 17:14:53 EDT

At 12:44 PM -0500 7/11/01, Matthew Theobald wrote:
>'lo all,
>What search engines exist that search XML rather than HTML?

Google (test by searching for "inurl: searchtools-rss.xml"), but none
of the other webwide search engine robots will touch XML files

>Is XML searching currently contained only to a site or an array of sites
>that comply with a common DTD?

XML files can be treated as unstructured text files so a search
engine doesn't have to know about DTDs or Schema. However, this
means you lose a good portion of the value of XML markup, including
what to put as the title! Some text search enigines, such as Inktomi
(formerly Ultraseek) can recognize tags as well.

>Any recommendations?

Please take a look at my page on XML search
<> and my listings for XML
search engines at
<>. Feel free to
contact me directly if you have any other questions.


Search Server Industry Analysis from Search Tools Consulting
    (510) 845-2551  -- <mailto:>
Complete Guide to Search Engines for Web Sites, Intranets,
       and Portals: <>

This archive was generated by hypermail 2.1.2 : Sun Nov 23 2003 - 22:54:47 EST

Subscribe/Unsubscribe | Home