Internet Communications Blog

Searching the Web May 19, 2008

Filed under: Module 4 — tooyu @ 9:33 pm
Tags:

Task:   Make sure you have a meta searching program installed, such as Copernic or Sherlock. Choose your most commonly used internet search engine and do a search with words of your choosing.

Log entry:  For this exercise, I chose terms used I’d researched for my concepts assignment - for concept # 9 ’Permanent Ephemerality’.  The terms I used were: Internet, persistence, ephemerality, permanence.

I used Google, which found around 1,050 hits (fairly small compared to a normal Google search). The first page of hits is shown below, with the first being my blog entry for the Concepts assignment! Other results are more mixed – some were references that I cited in my assignment, while others were completely off my intended topic.

  Google search results

 

Task:  Using copernicus or similar, set it up to search at least three search engines (including one that will search the ‘deep web’ and repeat exactly the same search.

Log entry:  For this task I used the Copernic Agent Basic meta-search engine. Using the same terms as the previous exercise, it returned 36 hits.  The first few were paid advertsing, and the first unpaid term was unrelated to what I was searching for.  However, it did return some interesting hits that the Google search didn’t. On the whole, however, I found it annoying having to navigate between the ‘junk’ returns and those that were relevant.

Copernic search results

Because the exercise using Copernic didn’t appear that successful, I also tried the Turbo10 search engine, which reportedly lists searches from popular search engines and the deep web.  It returned only 9 hits, and again the first appeared to be unrelated to the seach terms. This suggests that my choice of search terms could have been further refined.

 Turbo10 results

An interesting feature of this search engine is that it includes screen shots of the pages next to the hit description. However, this isn’t particularly useful as the screen shots are very small. In addition, the summary information for each hit is very brief and doesn’t give you sufficient information about the web page it refers to.  While it did return one additional useful reference, the low number of hits and irrelevant returns mean that I wouldn’t be inclined to use it again.

 

Task:  Read Boolean searching on the Internet.  Think about how you would best search for the following:

  • the biggest number of hits relating to these key words
  • information most relevant to what you ACTUALLY wanted to look for!
  • information coming only from university sources

Log entry:  My strategies to refine the search terms are listed below.

  • To find the biggest number of hits relating to these key words – this is the strategy I adopted initially – i.e. to list all the terms, and to look for hits for any that return these words.
  • To find information most relevant to what I want to look for – following the mixed results from my previous searches, which included poetry and theatre guides, I refined my search to exclude ‘art’ and ’performance’. This reduced my Google search results from 1,050 to 31. This approach still returned many hits that were off-topic, but the lower number of hits overall meant that these were more manageable to sort through. 
  • To find information only from univesity sources – I added ‘.edu’ to the search terms. While this may also able to pick up iformation from non-university sources (e.g. schools), the number is likely to be small. This strategy returned 993 hits. Interestingly, all the hits found under the second approach (excluding ‘art’ and ‘performance) were also from .edu domains.

 

Task:  Choose the best three sources found in the previous task. Using any useful software, record this information on your computer  in the way that you think will be most useful to you in terms of preparing for your assignment (that is, in terms of how easy you will find it to access, retrieve, understand, contextualise, and recall why you saved it – and with what).

  • URL
  • author
  • instution
  • blurb / summary / screen shot (this can be a direct copy of an appropriate abstract or introductory paragraph on the site: but make sure you record and reference it as such)

Log entry:  I used Google bookmarks to record the information for this exercise.

This bookmarking system allows me to save the same bookmark under multiple topics. It’s also flexible enough to allow me to record metadata of the nature referred to above, and to search for any of that information. However, it doesn’t have more advanced features that some bookmark managers have, such as including screenshots.

I find that the best feature of this bookmarking system is its portability – I can have access to my saved bookmarks on any computer that has the Google toolbar downloaded. Because I use more than one computer for my university work, this feature is invaluable and makes me more inclined to invest the effort in maintaining and managing my bookmarks. 

The Information I saved in Google bookmarks is listed below:

  1. Smith, A. (2003), In Support of Long-term access, in Council on Library and Information Resources.  Accessed 24/5/08 from  http://www.clir.org/pubs/reports/pub126/smith2.html  Extract – ‘The central question that the papers address is whether or not the infrastructure in place for preservation is appropriate to the new information environment.’  
    • Saved under Net11-Module 4-permanent ephemerality 
  2. Graham, P.S. (2000), Issues in Digital Archiving, in Preservation. Accessed 24/5/08 from http://web.syr.edu/~psgraham/pgsite/pglibwork/pgtexts/digarchiv2000.html. Syracuse University Library. Extract – ‘Most libraries are now trying to provide an ever-increasing volume of scholarly electronic information to their clienteles. Research libraries have taken on the provision, organization and preservation of information with the same long-term commitment made for print materials. It is an expensive, uncharted and difficult task.’
    • Saved under Net11-Module 4-permanent ephemerality 
  3. McBride, M. (2008), Select-all delete: Endangered Species? from Melabnie McBride Online. Accessed 24/5/08 from http://melaniemcbride.net/2008/04/28/select-all-delete-an-endangered-species/ First para – ‘Select-all is one of my favourite user controls. It allows me to quickly move, archive or delete large volumes of data quickly and easily. It’s also a means of defining my ownership over my content. In an increasingly undemocratic web of surveillance, abuse of power and corporate control, I believe users deserve improved control over their data.’
    • Saved under Net11-Module 4-permanent ephemerality