| Thousands of servers ...billions of web
| |
| | subheadings, and so on.Now, unless you
|
| pages.... the possibility of individually
| |
| | have a clear idea of what you're looking
|
| sifting through the WWW is null. The
| |
| | for, it may be difficult or impossible to
|
| search engine gods cull the information
| |
| | use a keyword search, especially if the
|
| you need from the Internet...from
| |
| | vocabulary of the subject is unfamiliar.
|
| tracking down an elusive expert for
| |
| | Similarly, the concept based search of
|
| communication to presenting the most
| |
| | Excite (instead of individual words, the
|
| unconventional views on the planet. Name
| |
| | words that you enter into a search are
|
| it and click it. Beyond all the hype
| |
| | grouped and attempted to determine the
|
| created about the web heavens they rule,
| |
| | meaning) is a difficult task and yields
|
| let's attempt to keep the argument
| |
| | inconsistent results.
|
| balanced. From Google to Voice of the
| |
| |
|
| Shuttle (for humanities research) these
| |
| |
|
| ubiquitous gods that enrich the net, can
| |
| |
|
| be unfair ...and do wear pitfalls. And
| |
| | Besides who reviews or evaluates these
|
| considering the rate at which the
| |
| | sites for quality or authority? They are
|
| Internet continues to grow, the problems
| |
| | simply compiled by a computer program.
|
| of these gods are only exacerbated
| |
| | These active search engines rely on
|
| further.
| |
| | computerized retrieval mechanisms called
|
|
| |
| | "spiders", "crawlers", or "robots", to
|
|
| |
| | visit Web sites, on a regular basis and
|
|
| |
| | retrieve relevant keywords to index and
|
| Primarily, what you need to digest is the
| |
| | store in a searchable database. And from
|
| fact that search engines fall short of
| |
| | this huge database yields often
|
| Mandrake's magic mechanism! They simply
| |
| | unmanageable and comprehensive
|
| don't create URLs out of thin air but
| |
| | results....results whose relevance is
|
| instead send their spiders crawling
| |
| | determined by their computers. The
|
| across those sites that have rendered
| |
| | irrelevant sites (high percentage of
|
| prayers (and expensive offerings!) to
| |
| | noise, as it's called), questionable
|
| them for consideration. Even when sites
| |
| | ranking mechanisms and poor quality
|
| like Google claim to have a massive 3
| |
| | control may be the result of less human
|
| billion web pages in its database, a
| |
| | involvement to weed out junk. Thought
|
| large portion of the web nation is
| |
| | human intervention would solve all
|
| invisible to these spiders. To think they
| |
| | probes....read on.
|
| are simply ignorant of the Invisible Web.
| |
| |
|
| This invisible web holds that content,
| |
| |
|
| normal search engines can't index because
| |
| |
|
| the information on many web sites is in
| |
| | From the very first search engine - Yahoo
|
| databases that are only searchable within
| |
| | to about.com, Snap.com, Magellan,
|
| that site. Sites like - The Internet
| |
| | NetGuide, Go Network, LookSmart, NBCi
|
| Movie Database , - IncyWincy, the
| |
| | and Starting Point, all subject
|
| invisible web search engine and - The
| |
| | directories index and review documents
|
| Complete Planet that cover this area are
| |
| | under categories - making them more
|
| perhaps the only way you can access
| |
| | manageable. Unlike active search engines,
|
| content from that portion of the
| |
| | these passive or human-selected search
|
| Internet, invisible to the search gods.
| |
| | engines like don't roam the web directly
|
| Here, you don't perform a direct content
| |
| | and are human controlled, relying on
|
| search but search for the resources that
| |
| | individual submissions. Perhaps the
|
| may access the content. (Meaning - be
| |
| | easiest to use in town, but the indexing
|
| sure to set aside considerable time for
| |
| | structure these search engines cover only
|
| digging.)
| |
| | a small portion of the actual number of
|
|
| |
| | WWW sites and thus is certainly not your
|
|
| |
| | bet if you intend specific, narrow or
|
|
| |
| | complex topics. Subject designations may
|
| None of the search engines indexes
| |
| | be arbitrary, confusing or wrong. A
|
| everything on the Web (I mean none).
| |
| | search looks for matches only in the
|
| Tried research literature on popular
| |
| | descriptions submitted. Never contains
|
| search engines? AltaVista to Yahoo, will
| |
| | full text of the web they link to - you
|
| list thousands of sources on education,
| |
| | can only search what you see titles,
|
| human resource development, etc. etc. but
| |
| | descriptions, subject categories, etc.
|
| mostly from magazines, newspapers, and
| |
| | Human-labor intensive process limits
|
| various organizations' own Web pages,
| |
| | database currency, size, rate of growth
|
| rather than from research journals and
| |
| | and timeliness. You may have to branch
|
| dissertations- the main sources of
| |
| | through the categories repeatedly before
|
| research literature. That's because most
| |
| | arriving at the right page. They may be
|
| of the journals and dissertations are not
| |
| | several months behind the times because
|
| yet available publicly on the Web.
| |
| | of the need for human organization. Try
|
| Thought they'll get you all that's hosted
| |
| | looking for some obscure topic....chances
|
| on the web? Think again.
| |
| | for the people that maintain the
|
|
| |
| | directory to have excluded those pages.
|
|
| |
| | Obviously, machines can blindly count
|
|
| |
| | keywords but they can't make common-sense
|
| The Web is huge and growing
| |
| | judgement as humans can. But then why
|
| exponentially. Simple searches, using a
| |
| | does human-edited directories respond
|
| single word or phrase, will often yield
| |
| | with all this junk?!
|
| thousands of "hits", most of which will
| |
| |
|
| be irrelevant. A layman going in for a
| |
| |
|
| piece of info to the internet has to deal
| |
| |
|
| with a more severe issue - too much
| |
| | And here's about those meta search
|
| information! And if you don't learn how
| |
| | engines. A comprehensive search on the
|
| to control the information overload from
| |
| | entire WWW using The Big Hub, Dogpile,
|
| these websites, returned by a search
| |
| | Highway61, Internet Sleuth or Savvysearch
|
| result, roll out the red carpet for some
| |
| | , covering as many documents as possible
|
| frustration. A very common problem
| |
| | may sound as good an idea as a one stop
|
| results from sites that have a lot of
| |
| | shopping.Meta search engines do not
|
| pages with similar content. For e.g., if
| |
| | create their own databases. They rely on
|
| a discussion thread (in a forum) goes on
| |
| | existing active and passive search engine
|
| for a hundred posts there will be a
| |
| | indexes to retrieve search results. And
|
| hundred pages all with similar titles,
| |
| | the very fact that they access multiple
|
| each containing a wee bit of information.
| |
| | keyword indexes reduces their response
|
| Now instead of just one link, all hundred
| |
| | time. It sure does save your time by
|
| of those darn pages will crop up your
| |
| | searching several search engines at once
|
| search result, crowding out other
| |
| | but at the expense of redundant, unwanted
|
| relevant site. Regardless of all the
| |
| | and overwhelming results....much more -
|
| sophistication technology has brought in,
| |
| | important misses. The default search mode
|
| many well thought-out search phrases
| |
| | differs from search site to search site,
|
| produce list after list of irrelevant web
| |
| | so the same search is not always
|
| pages. The typical search still requires
| |
| | appropriate in different search engine
|
| sifting through dirt to find the gold. If
| |
| | software. The quality and size of the
|
| you are not specific enough, you may get
| |
| | databases vary widely.
|
| too many irrelevant hits.
| |
| |
|
|
| |
| |
|
|
| |
| |
|
|
| |
| | Weighted Search Engines like Ask Jeeves
|
| As said, these search engines do not
| |
| | and RagingSearch allows the user to type
|
| actually search the web directly but
| |
| | queries in plain English without advanced
|
| their centralized server instead. And
| |
| | searching knowledge, again at the expense
|
| unless this database is updated
| |
| | of inaccurate and undetailed searching.
|
| continually to index modified, moved,
| |
| | Review or Ranking Sources like Argus
|
| deleted or renamed documents, you will
| |
| | Clearinghouse ( (eblast.com) and
|
| land yourself amidst broken links and
| |
| | Librarian's Index to the Internet
|
| stale copies of web pages. So if they
| |
| | (lii.org). They evaluate website quality
|
| inadequately handle dynamic web pages
| |
| | from sources they find or accept
|
| whose content changes frequently, chances
| |
| | submissions from but cover a minimal
|
| are for the information they reference to
| |
| | number of sites.
|
| quickly go out-of-date. After they wage
| |
| |
|
| their never ending war with over-zealous
| |
| |
|
| promoters (spamdexers rather), where do
| |
| |
|
| they have time to keep their databases
| |
| | As a webmaster, your site registration
|
| current and their search algorithms
| |
| | with the biggest billboards in Times
|
| tuned? No surprise if a perfectly
| |
| | Square can get you closer to bingo! for
|
| worthwhile site may go unlisted!
| |
| | the searcher. Those who didn't even know
|
|
| |
| | you existed before are in your living
|
|
| |
| | room in New York time!
|
|
| |
| |
|
| Similarly, many of the Web search engines
| |
| |
|
| are undergoing rapid development and are
| |
| |
|
| not well documented. You will have only
| |
| | Your URL registration is a no-brainer,
|
| an approximate idea of how they are
| |
| | considering the generation of flocking
|
| working, and unknown shortcomings may
| |
| | traffic to your site. Certainly a quick
|
| cause them to miss desired information.
| |
| | and inexpensive method, yet is only a
|
| Not to mention, amongst the first class
| |
| | component of the overall marketing
|
| information, the web also houses false,
| |
| | strategy that in itself offers no
|
| misleading, deceptive and dressed up
| |
| | guarantees, no instant results and
|
| information actually produced by
| |
| | demands continued effort for the
|
| charlatans. The Web itself is unstable
| |
| | webmaster. Commerce rules the web. Like
|
| and tomorrow they may not find you the
| |
| | how a notable Internet caveman put it,
|
| site they found you today. Well if you
| |
| | "Web publishers also find dealing with
|
| could predict them, they would not be
| |
| | search engines to be a frustrating
|
| god!...would they?! The syntax (word
| |
| | pursuit. Everybody wants their pages to
|
| order and punctuation) for various types
| |
| | be easy for the world to find, but
|
| of complex searches varies some from
| |
| | getting your site listed can be tough.
|
| search engine to search engine, and small
| |
| | Search sites may take a long time to list
|
| errors in the syntax can seriously
| |
| | your site, may never list it at all, and
|
| compromise the search. For instance, try
| |
| | may drop it after a few months for no
|
| the same phrase search on different
| |
| | reason. If you resubmit often, as it is
|
| search engines and you'll know what I
| |
| | very tempting to do, you may even be
|
| mean. Novices... read this line - using
| |
| | branded a spamdexer and barred from a
|
| search engines does involve a learning
| |
| | search site. And as for trying to get a
|
| curve. Many beginning Internet users,
| |
| | good ranking, forget it! You have to keep
|
| because of these disadvantages, become
| |
| | up with all the arcane and ever-changing
|
| discouraged and frustrated. Like a
| |
| | rules of a dozen different search
|
| journalist put it, "Not showing
| |
| | engines, and adjust the keywords on your
|
| favoritism to its business clients is
| |
| | pages just so...all the while fighting
|
| certainly a rare virtue in these times."
| |
| | against the very plausible theory that in
|
| Search engines have increasingly turned
| |
| | fact none of this stuff matters, and the
|
| to two significant revenue streams. Paid
| |
| | search sites assign rankings at random or
|
| placement: In addition to the main
| |
| | by whim.
|
| editorial-driven search results, the
| |
| |
|
| search engines display a second - and
| |
| |
|
| sometimes third - listing that's usually
| |
| |
|
| commercial in nature. The more you pay,
| |
| | "To make the best use of Web search
|
| the higher you'll appear in the search
| |
| | engines--to find what you need and avoid
|
| results. Paid inclusion: An advertiser or
| |
| | an avalanche of irrelevant hits-- pick
|
| content partner pays the search engine to
| |
| | search engines that are well suited to
|
| crawl its site and include the results in
| |
| | your needs. And lest you'd want to cry
|
| the main editorial listing. So?...more
| |
| | "Ye immortal gods! where in the world are
|
| likely to be in the hit list but then
| |
| | we?", spend a few hours becoming
|
| again - no guarantees. Of course those
| |
| | moderately proficient with each. Each
|
| refusing to favor certain devotees are
| |
| | works somewhat differently, most
|
| industry leaders like Google that
| |
| | importantly in respect to how you broaden
|
| publishes paid listings, but clearly
| |
| | or narrow a search.
|
| marks them as 'Sponsored Links.'
| |
| |
|
|
| |
| |
|
|
| |
| |
|
|
| |
| | Finding the appropriate search engine for
|
| The possibility of these 'for-profit'
| |
| | your particular information need, can be
|
| search gods (which haven't yet made much
| |
| | frustrating. To effectively use these
|
| profit) for taking fees to skew their
| |
| | search engines, it is important to
|
| searches, can't be ruled out. But as a
| |
| | understand what they are, how they work,
|
| searcher, the hit list you are provided
| |
| | and how they differ. For e.g. while using
|
| with by the engine should obviously rank
| |
| | a meta search engine, remember that each
|
| in the order of relevancy and interest.
| |
| | engine has its own methods of displaying
|
| Search command languages can often be
| |
| | and ranking results. Remember, search
|
| complex and confusing and the ranking
| |
| | strategies affect the results. If the
|
| algorithm is unique to each god based on
| |
| | user is unaware of basic search
|
| the number of occurrences of the search
| |
| | strategies, results may be spotty.
|
| phrase in a page, if it appears in the
| |
| |
|
| page title, or in a heading, or the URL
| |
| |
|
| itself, or the meta tag etc. or on a
| |
| |
|
| weighted average of a number of these
| |
| | Quoting Charlie Morris (the former editor
|
| relevance scores. E.g. Google ( uses its
| |
| | of The Web developer's journal) - "Search
|
| patented PageRank TM and ranks the
| |
| | engines and directories survive, and
|
| importance of search results by examining
| |
| | indeed flourish, because they're all
|
| the links that lead to a specific site.
| |
| | we've got. If you want to use the wealth
|
| The more links that lead to a site, the
| |
| | of information that is the Web, you've
|
| higher the site is ranked. Pop on
| |
| | got to be able to find what you want, and
|
| popularity!
| |
| | search engines and directories are the
|
|
| |
| | only way to do that. Getting good search
|
|
| |
| | results is a matter of chance. Depending
|
|
| |
| | on what you're searching for, you may get
|
| Alta Vista, HotBot, Lycos, Infoseek and
| |
| | a meaty list of good resources, or you
|
| MSN Search use keyword indexes - fast
| |
| | may get page after page of irrelevant
|
| access to millions of documents. The lack
| |
| | drivel. By laboriously refining your
|
| of an index structure and poor accuracy
| |
| | search, and using several different
|
| of the size of the WWW, will not make
| |
| | search engines and directories (and
|
| searching any easier. Large number of
| |
| | especially by using appropriate specialty
|
| sites indexed. Keyword searching can be
| |
| | directories), you can usually find what
|
| difficult to get right.In reality,
| |
| | you need in the end."
|
| however, the prevalence of a certain
| |
| |
|
| keyword is not always in proportion to
| |
| |
|
| the relevance of a page. Take this
| |
| |
|
| example. A search on sari - the national
| |
| | Search engines are very useful, no doubt.
|
| costume of India -in a popular search
| |
| | Right from getting a quick view of a
|
| engine, returned among it's top sites,
| |
| | topic to finding expert contact
|
| the following links:
| |
| | info...verily certain issues lie in their
|
|
| |
| | lap. Now the very reason we bother about
|
| ? of the Scottish Crop research Institute
| |
| | these search engines so much is because
|
|
| |
| | they're all we've got! Though there sure
|
|
| |
| | is a lot of room for improvement, the
|
| ? -a health resort in Indonesia
| |
| | hour's need is to not get caught in the
|
|
| |
| | middle of the road. By simply
|
| ? - The South Asia Regional Initiative
| |
| | understanding what, how and where to
|
| for Energy Cooperation and Development
| |
| | seek, you'd spare yourself the fate of
|
|
| |
| | chanting that old Jewish proverb "If God
|
|
| |
| | lived on earth, people would break his
|
|
| |
| | windows."
|
| Pretty useful sites for someone very much
| |
| |
|
| interested in knowing how to drape or the
| |
| |
|
| tradition of the sari?! (Well, no prayer
| |
| |
|
| goes unanswered...whether you like the
| |
| | Happy searching!Liji is a PostGraduate in
|
| answer or not!) By using keywords to
| |
| | Software Science, with a flair for
|
| determine how each page will be ranked in
| |
| | writing on anything under the sun. She
|
| search results and not simply counting
| |
| | puts her dexterity to work, writing
|
| the number of instances of a word on a
| |
| | technical articles in her areas of
|
| page, search engines are attempting to
| |
| | interest which include Internet
|
| make the rankings better by assigning
| |
| | programming, web design and development,
|
| more weight to things like titles,
| |
| | ecommerce and other related issues.
|