PUTTING MDS AND LSI TO WORK

Creative Approaches

Much of the work in applied MDS has come from the fields of advertising and cognitive psychology (where it is also known as perceptual mapping). Researchers in both fields use the technique to transform questionnaires about relative preferences and similarities into a visual representation using the scaling techniques we have outlined. These techniques do not appear to have been applied to linguistic data until relatively recently.

This illustrates a common theme in latent semantic research - combining familiar techniques from different disciplines in a novel way to tackle problems in data retrieval. This kind of creative juxtaposition is one of the things that makes LSI interesting to work on, and levels the playing field between major research institutions and liberal arts colleges. One does not need an enormous supercomputer or advanced mathematical knowledge to do interesting work with these techniques. In fact, because LSI research draws on pure and applied mathematics, linguistics, computer science, psychology, information retrieval, and the social sciences, what really matters is breadth of knowledge. There are likely to be connections further afield that remain to be discovered.

With this eclectic background in mind, here are some potential applications of semantic indexing coupled with MDS data visualization:

  1. Archive Management Tools:

    We already mentioned the potential use of LSI as an archivist's assistant, using LSI to highlight content patterns in a data collection, and more traditional taxonomies to formalize and heighten those patterns. One intuitive method for creating such tools is to display data visually using MDS, and allow for human feedback. An interactive program using multi-dimensional scaling would allow an archivist to graphically manipulate data, draw boundaries between clusters, examine content relationships and add classifiers using an intuitive, click-and-drag type interface. What's more, different expert users would be able to use MDS to generate their own personal view of a data set, and then reconcile or combine those views.

  2. Concept Maps:

    Concept maps take this notion of interactivity and classification further, letting users manipulate and edit LSI-generated views of a data collection to produce a spatial map of topics and concepts. By drawing connections between items and moving them around, users can create their own view of a data collection. These views can be 'untangled' using mathematical techniques to create clear, visually direct concept maps. These maps can be shared, combined, and compared with others, making a unique pedagogical or research tool.

  3. Bioinformatics:

    The same LSI techniques we use to find similarities in language have enormous potential in the field of bioinformatics. Both DNA and protein molecules consist of long strings of biochemical 'words'. Finding and understanding patterns in those words is one of the major research problems in modern biology. Using the tools we describe would make it possible to detect and visualize such patterns, and conduct important basic research in this nascent field.



< previous     next >



This work is licensed under a Creative Commons License. 2002 National Institute for Technology in Liberal Education. For more info, contact the author.

Gain a Competitive Advantage Today

Your top competitors have been investing into their marketing strategy for years.

Now you can know exactly where they rank, pick off their best keywords, and track new opportunities as they emerge.

Explore the ranking profile of your competitors in Google and Bing today using SEMrush.

Enter a competing URL below to quickly gain access to their organic & paid search performance history - for free.

See where they rank & beat them!

  • Comprehensive competitive data: research performance across organic search, AdWords, Bing ads, video, display ads, and more.
  • Compare Across Channels: use someone's AdWords strategy to drive your SEO growth, or use their SEO strategy to invest in paid search.
  • Global footprint: Tracks Google results for 120+ million keywords in many languages across 28 markets
  • Historical data: since 2009, before Panda and Penguin existed, so you can look for historical penalties and other potential ranking issues.
  • Risk-free: Free trial & low price.
Your competitors, are researching your site

Find New Opportunities Today








    Email Address
    Pick a Username
    Yes, please send me "7 Days to SEO Success" mini-course (a $57 value) for free.

    Learn More

    We value your privacy. We will not rent or sell your email address.