A Politics Mash-Up
Short Description
A mashup in the worldwide web is an application that collects data from multiple sources of information and presents them in a unique way. This paper describes a mashup on politics that integrates both geographical and chronological dimensions regarding the 2008 presidential election of the United States of America.
Website: www.seas.upenn.edu | Filesize: 278kb
No of Page(s): 13
Content
…
The system is composed of three major subsystems. First, the back subsystem is responsible for collecting data from multiple sources of information, analyzing the assembled data, and extracting useful information from them. Second, the middle subsystem is responsible for permanently storing constructed information in a structured way so that it serves as a link between the front and back. Finally the front subsystem is responsible for presenting these pieces of information in a geographically and chronologically meaningful way.
Back End: Crawling
The back end is mainly based on several web crawlers which periodically fetches news articles, poll results, etc. that are available on the web and store them. Stand-alone web crawler programs were developed in Java language, version 6, to leverage a rich set of available libraries that facilitates development process. The Eclipse platform was used as it is the most widely used Java development tool. It was developed and tested in a regular desktop IBM-PC. For the networking modules, Sun Microsystems’s own network libraries as well as Apache.org’s HTTP Client (http://jakarta.apache.org/httpcomponents/) library have been used. The crawler is a single-threaded process.
…
Get the file Download here
Related Books:Related Searches: java development tool, web crawlers, web crawler, jakarta apache, network libraries
Comments
Leave a Reply