Information Exploration and Retrieval:
A Brief Introduction to the Internet
North Carolina State University
Department of Statistics
Compiled from several network sources by
Paul Marsh
Tim Arnold
Table of Contents
The Internet - what is it? We certainly see and hear allusions to
this vague and vast term seemingly daily. Is it going to be the basis for
the "information highway"? Perhaps. Many basic information exploration
and retrieval tools are already available to virtually all computer
literates.
The Internet is broad and vast not only in geographic area but also
in the tools available to users. New users joining the Internet
community usually have the same questions: How do I get connected? Once
I am connected, what can I do?
This manual attempts to answer several such questions. Beginners
should scan the whole document first to get a conceptualization of the
exciting power of the Internet. Also, an overall view of the Internet is
needed since many of the information browsing tools work jointly to
perform a specific task. After that? Buckle up and try it!!
Loosely defined, a network is a set of computers using the same
protocol to communicate among themselves. In a general sense, a protocol
is the set of rules used in communicating.
Cybermap
(14K gif)
The Internet is a large collection of networks, all of which run the
same protocol (TCP/IP). The Internet started with the ARPANET, but now
includes such networks as NSFNET and NYSERnet. There are other major
networks, such as BITNET and DECnet, that are not based on the same
protocol and are thus not really part of the Internet. However, users
on these networks still may have some access to the Internet: Gateways
are established to allow for inter-network communications. Gateways are
computers that physically link the various networks and translate the
protocols among them. Gateways are usually not noticeable the people who
use them.
Once on the Internet, you have access to
- all the resources on
your own Internet host,
- on any other Internet host on which you have
an account, and
- on any other Internet host that offers publicly
accessible information.
The Internet gives you the ability to move
information between these hosts via file transfers (generically known as
ftp-ing). Once you are logged into one host, you can use the Internet
to open a connection to another, login, and use its services
interactively (this is known as telnet). In addition, you can send
electronic mail to users at any Internet site and to users on many
non-Internet sites.
There are various other services you can use. For example, some
hosts provide access to specialized databases or to archives of
information. The Internet Resource Guide provides information regarding
some of these sites. The guide is published by the NSF Network Service
Center (NNSC) and is continuously updated. The files which make up the Guide
may be obtained through electronic
mail by sending a message to mailserv@ds.internic.net.
To retrieve
individual files, include the FILE command in the message body. Some
examples are:
file /ftp/resource-guide/overview
file /ftp/resource-guide/chapter.1/section1-14.ps
The LS (or DIR) command may also be used to get a directory listing of
the Guide. Use the following command in the message body:
ls /ftp/resource-guide .
The Resource Guide is also available via anonymous FTP to ds.internic.net.
The /resource-guide/overview file is a good place to start.
Military, governmental, and educational institutions usually have
full access to the network. Recently, commercial vendors have begun
offering connections for companies and persons not affiliated with such
organizations.
If you just have a personal computer and a modem, you can obtain a
list of internet access providers to whom you may connect via modem. The
list is called PDIAL and may be retrieved by sending e-mail to:
info-deli-server@pdial.com
with this message:
Send PDIAL
There is also a web site with a PDIAL
list.
For more introductory information, see the
PDIAL FAQ's (Frequently Asked Questions.
O'Reilly and Associates has a mail server that provides a list of
Internet Access providers who offer dedicated line connections (a
direct link from your computer to the Internet).
Send e-mail to:
dlist@ora.com with an empty message body.
Electronic Mail
By using electronic mail (abbreviated as e-mail), you are able to
send messages or entire files at your convenience to anyone, anywhere on
the network. Just as with postal mail, the receiver need not be present
to receive the message and may in turn read it at his/her convenience.
One great difference between postal and electronic mail is that messages
delivered via the latter often arrive within minutes, regardless of the
distance that must be travelled.
To send a message, you must know the addresses of persons to whom
you wish to send e-mail. E-mail addresses can be quite complex. There
are a number of address directories on the Internet; however, all of
them are far from complete. Generally, it is still necessary to ask the
person for his or her e-mail address.
E-mail on the Internet uses addresses of the form user@location.
user is generally the account name of the mail recipient. location
identifies the name and place of the computer housing the recipient's
account. The '@' merely acts as a separator between these two
identifiers.
Much information regarding who sent the electronic mail can be
gleaned from breaking down location into its one or more components. The
parts of location are separated by '.' (periods). The rightmost part is
always the domain. Networks outside the United States are identified by
two-letter codes: CA for Canada, ES for Spain, TW for Taiwan, UK for the
United Kingdom, and so forth. Generally speaking, networks within the
United States are still identified by the nature of their owners: COM
for commercial companies, EDU for educational institutions, GOV for
government facilities, MIL for military sites, NET for network support
groups, and ORG for other miscellaneous organizations. It is becoming
more common, though, to see the .US domain in addresses for institutions
in the United States.
The next rightmost part of location is the subdomain. The subdomain
usually identifies the organization (company, military base, university)
where the sender is located. Sometimes a second subdomain is added to
distinguish different areas of the same organization.
For more information: How
to Find People's Email Addresses
You see a strange looking address?
There are several networks accessible via e-mail from the Internet,
but many of these networks do not use the same addressing conventions
the Internet does. Often you must route e-mail to these networks
through specific gateways as well, thus further complicating the
address. Messages that go through several gateways often include a
"bang" (!), a percent sign (%), or double colons (::), suffixed to the
person's user id.
Additional reading on electronic mail:
Frey, D. & Adams, R. (1990) A Directory of electronic mail
addressing and networks. Sabastopol, CA: O'Reilly Associates.
A mailing list is really nothing more than a nickname for multiple
users: Electronic mail directed to the list is sent to all those users.
Mailing lists are usually created to discuss specific topics. Anybody
interested in that topic, may (usually) join that list. Some mailing
lists have membership restrictions, others have message content
restrictions, and still others are moderated.
There is a "list-of-lists" file available on the host
sri.com
that lists most of the major mailing lists, describes
their primary topics, and explains how to subscribe to them. The file
is available for anonymous ftp in the netinfo directory under the name
interest-groups (that is, the path is: netinfo/interest-groups). It
can also be obtained via electronic mail.
Send a message to
mail-server@sri.com
with the body of the message reading,
Send interest-groups
(or for a directory of files:
Send directory )
and the file (a large one!) will be returned in moderate size pieces
via electronic mail.
Mailing lists are sometimes called LISTSERV or as electronic
discussion groups. LISTSERV stands for "list server". Originally,
LISTSERV was a mailing-list programmed to make group communication
easier. Similar programs now exist on the Internet, and although they
are not technically the same, most people speak of "listserv lists" as a
generic term.
People with a common interest (e.g. network protocols, issues
related to handicapped people in education, system administration
problems) may join or subscribe (although there is no exchange of money)
to a list. Merely send the following e-mail message to the host computer
(it "owns" the list):
subscribe list-name Your Full Name
where list-name is the name of the desired electronic discussion
group.
Messages sent to the list are automatically distributed to all
subscribers. Any member can reply to the message and this reply will in
turn be distributed to every one else in the group. The electronic
conversation continues as long as anyone is interested in discussing the
subject at hand. Then someone else raises a new issue and the process
begins anew.
Another tool available to Internet users is electronic file transfer
via a facility generically know as FTP (which stands for file transfer
protocol). Network software packages generally include such software in
the distribution. It should be noted, however, that only the "client"
side of the software is generally available. That is, the software to
contact a host computer and subsequently send or receive files. This is
usually sufficient for most users. Your network software documentation
should outline the commands available to you within the ftp facility. If
you wish to set up a host computer for other Internet users to contact
for file transfers, you will have to locate the "server" side of an FTP
software package.
Generic FTP allows authorized users to contact a host machine and
perform file transfers. An authorized user generally already has an
account on the host machine.
Anonymous FTP allows guest users to have restricted access to many
information archives on the network. To allow this wider class of
Internet users to perform file transfers, system administrators
authorize a generic anonymous userid. Users merely enter that name when
prompted for user-id and are allowed to access certain disk directories
on the host computer. At the password prompt, reply with your electronic
mail address.
To access an anonymous ftp site you must know the address of the
site. For example, nic.ddn.mil, is the address of the Network
Information Center of the Dept. of Defense Network. It is generally a
good idea to list the contents of possible directories before
transferring a file via anonymous ftp. Look for files named readme or
index first. Names and locations of directories and files may change
over time. By checking your listing of possible directories and files,
you increase your chances of successfully transferring a file.
The procedure for accessing an anonymous ftp site follows:
- type: ftp nic.ddn.mil
- Connected to nic.ddn.mil
- 220 NIC.DDN.MIL FTP Server Process at Tue 30-Jul-91
- Name (nic.ddn.mil:msmith) : anonymous
- Password (nic.ddn.mil:anonymous) :---------
- 331 ANONYMOUS user ok, send real ident as password.
- 230 User ANONYMOUS logged in at Tue 30-Jul-91
- ftp>
Step by Step Explanation
- at the Unix prompt, user ftp's to nic.ddn.mil
- user is connected to ftp site
- site notes the time of user login
- user login as anonymous
- user types address (e.g., bjp@sura.net) which does not appear
- system acknowledges user login and notes time
- ftp prompt
To transfer a file, you may have to change directories to the
directory that your file is located in. It is a good idea to list the
contents of the directory before attempting to transfer a file.
Unix Commands for Retrieving Files
The ls (or dir ) command lists the contents of the active
directory.
The
cd directory name command enables the user to change directories (move
up or down the hierarchy)
The cd .. command allows the user to return
to the previous directory.
To transfer a document:
get filename
To view the document while still connected to the ftp site:
get filename |more
For more information: FTP
FAQ List
There are literally thousands of interesting programs, datasets, and
documents freely available on the Internet. Two tools to aid in the
location of such interesting information are archie and wais.
Archie
The archie service is a collection of resource discovery tools that
together provide an electronic directory service for locating
information in an Internet environment. Originally created to track the
contents of anonymous ftp archive sites, the archie service is now being
expanded to include a variety of other online directories and resource
listings.
Users can access an archie server either through interactive
sessions (provided they have a direct Internet connection) or through
queries sent via electronic mail messages (provided they can at least
gateway electronic mail messages onto the Internet).
Interactive access to archie may be through a conventional telnet
session to a machine running an archie server or through an archie
client.
The archie server automatically updates the listing information from
each site about once a month, ensuring users that the information they
receive is reasonably timely.
Users with direct Internet connectivity can try out an interactive
archie server using the basic "telnet" command (available at most
sites). To try this, telnet to the host archie.mcgill.ca and login as
user archie (no password is needed). A banner message giving latest
developments and information on the archie project will be displayed and
then the command prompt will appear. First-time users should try the
help command to get started.
Users with only email connectivity to the Internet should send a
message to archie@archie.mcgill.ca, with the single word help in
either the subject line or body of the message. You should receive back
an email message explaining how to use the email archie server, along
with details of an email based ftp server operated by Digital Equipment
Corporation that will perform ftp transfers through email requests.
Documentation for the archie system is still limited, but what there
is is also available for anonymous ftp from the same host
(archie.mcgill.ca) under the directory bunyip-docs/archie.whatis.
Wais
The second information tool, WAIS (Wide Area Information Servers),
not only locates files based upon their names (as archie does), but also
by knowing what is in the files. Wide Area Information Servers,
pronounced "wayz," allow you to search through archives of files for
which indexes exist. The program searches the text itself, not just the
file name, for your desired term. The results are then presented in a
weighted list, with those judged "best" listed first, and others listed
in decreasing order of relevance. You may then select items you want to
see in full, and the system will retrieve them for you.
The major drawback is that WAIS can search only those documents that
have been specially formatted to make them compatible to the WAIS
software. Unfortunately, this includes only a small percentage of the
files available on the Internet.
How Can I Find Out More About Wais? There is a mailing list that has
weekly postings on progress and new releases; to subscribe send an
email note to
wais-discussion-request@think.comwith the word 'help' in
the body of the message.
For more information:
WAIS
FAQ's
A basic Internet service is the provision of interactive login to a
remote host. Telnet is both a protocol and a program that enables you to
do so. It is the standard TCP/IP remote login protocol. You must know
the Internet (IP) address of the remote host computer before you can
initiate a session with telnet. Many remote hosts require you to have an
account to log in (You must have a user id and a password). However,
some remote hosts do not require that users have accounts (as in the
archie example above). Users can log in with a general user id such as
info or guest (or some other word that is published in guides to the
Internet). Passwords are usually not required.
UseNet is the formal name, and Netnews a common informal name, for a
distributed computer information service that some hosts on the Internet
use. UseNet handles only newsgroups and not mail. It uses a variety of
underlying networks for transport. Netnews can be a valuable tool to
economically transport traffic that would otherwise be sent via mail.
UseNet has no central administration.
Hundreds of forums, called newsgroups, can be found on UseNet. They
differ from LISTSERV mailing lists in that each article is posted only
once, for all to read and reply to, rather than having separate messages
sent individually to each reader. It functions like an electronic
bulletin board.
It should be kept in mind that UseNet is a very different entity
from the Internet. The Internet is a wide-ranging network but it is only
one of the various networks carrying UseNet traffic.
To help hold Usenet together, various articles are periodically
posted in newsgroups in the news hierarchy. It is recommended that new
users subscribe to and read the group news.announce.newusers since it
will help to become oriented to UseNet and the Internet.
The following sites have sources to the current newsreader software:
Site Contact
---- -------
osu-cis postmaster@cis.ohio-state.edu
philabs usenet@philabs.philips.com
pyramid usenet@pyramid.com
rutgers usenet@rutgers.edu
tektronix news@tektronix.tek.com
watmath usenet@watmath.waterloo.edu
uunet info@uunet.uu.net
The archie service can be used to locate ftp archives containing
various news software packages.
What is the "FAQ" list?
This list provides answers to "Frequently Asked Questions" that
often appear on various USENET newsgroups. The list is posted
every four to six weeks to the news.announce.newusers group. It
is intended to provide a background for new users learning how to
use the news. The FAQ list provides new users with the answers to
such questions, helping to keep the newsgroups themselves
comparatively free of repetition. Often specific newsgroups will
have and frequently post versions of a FAQ list that are specific
to their topics. These FAQs are themselves treasures of information
on a vast array of topics.
For more information:
A FAQ about FAQ's
News.Newusers.questions
What
is Usenet
Usenet
FAQ
Gopher
With the Internet Gopher you can easily access publicly available
information stored on many of these connected computers. Gopher
combines features of electronic bulletin board services and databases,
allowing you to either browse a hierarchy of documents, or search for
documents that contain certain words or phrases.
You can use gopher to search thousands of sites and millions of
documents without knowing a single Internet address, file name, or
locally idiosyncratic command. You just sniff around "Gopher Space"
like you would browse your local library, using structured menus to
guide you along the way.
Gopher systems are multiplying rapidly, and becoming more
sophisticated. And since they all connect to one another, as well as to
various WAIS and Archie servers, you can explore virtually any part of
the net using a single, relatively simple, interface.
The Internet Gopher software was conceived at the Computer and
Information Services department of the University of Minnesota. Software
developed at the University of Minnesota is freely distributable for
non-commercial purposes. The Internet Gopher works on a number of
platforms and operating systems.
Gopher supports a diverse range of data, all of which can be
accessed by a simple keystroke or click of the mouse. The most basic
information type in gopher is a directory. A directory is simply a list
of documents. Directories allow easy browsing of information since
items can be organized into specific areas.
There are several different special types of directories also. One
special type of directory called a link allows gopher to reference other
gopher directories on a different computer. This allows gopher to
traverse a hierarchy of information residing on multiple machines across
the Internet. These links are transparent: you won't even notice that
you're connecting to another machine.
Gopher also provides for a special kind of directory which allows
document searching. You can specify any number of keywords to a search
item. Only those documents that match the given criteria will show up in
the resulting list.
In addition to directories, items can be "documents". Usually
documents are plain text files. Other types of documents allow for
public telnet connection, "phone book" databases that allow you to
search, and multimedia file formats including images, audio and video
file formats. Images such as weather maps are available. Audio data,
including the presidential debates is available. Movies in Quicktime and
MPEG formats are available. Some Gopher servers will allow you to view
documents in formats other than plain text. Postscript is one of the
most popular formats.
When using Gopher, looking for information located in other
continents is as easy as looking for information residing on a computer
in the next room.
You may want to follow the paths in Gopher from level to level until
you find the data you're looking for (browsing). From the first level,
you can choose a topic, which leads to another level, and another, until
you finally come to an item that looks interesting.
Alternatively, you may want Gopher to do the work for you by using a
search item. For instance, you could select a search item called Search
Recipes. A message prompts you to type in the words you're looking for;
you type salmon. The server searches the text of a collection of items
and lists the ones that have the word salmon in them. You can then
examine these items until you find one that contains a recipe that
strikes your fancy.
If you merely wish to find interesting Internet materials using
gopher, you need only a gopher client program. Most of the software for
the Internet Gopher is available on the machine boombox.micro.umn.edu.
You may retrieve the software using anonymous FTP. Just look within the
/pub/gopher directory for the subdirectory that refers to your specific
computer type. Alternatively, if you cannot locate a gopher client, you
may telnet to any of the public gopher sites listed below and login
using the referenced special user account. Either way, you can be
quickly on your way to gopherland!!
Public Gopher Sites
Hostname Login name
consultant.micro.umn.edu gopher
gopher.uiuc.edu gopher
panda.uiowa.edu panda
For more information: Gopher
FAQ
Veronica
Closely linked to gopher is veronica (Very Easy Rodent-Oriente d
Net-wide Index to Computerized Archives).
Veronica evolved as a solution to the problem of resource discovery
in the rapidly expanding gopher world. Veronica offers a keyword search
of most gopher server menu titles in the entire gopher web. As archie
is to ftp archives, veronica is to gopherspace. A veronica search
produces a menu of gopher items, each of which is a direct pointer to a
gopher data source. Because veronica is accessed through a gopher
client, it is easy to use, and gives access to all types of data
supported by the gopher protocol.
If your local gopher server does not already have a link to
veronica, use gopher to go to the server at gopher.micro.umn.edu. Choose
the menu item "Other Gopher and Information Servers". Choose veronica
from that menu.
GopherMail
In addition to Veronica, GopherMail is a gopher client that uses
electronic mail to interact with the user. It is pretty clumsy to use,
but it works. Messages containing menus and gopher link information are
mailed to users in response to their requests. Users reply to these
messages and indicate which menu items they want. It lets people use
Gopher without requiring them to have an account directly on the
Internet, because it communicates through email messages instead of
direct "live" network connections. Thanks to the GopherMail program,
most of the resources of Gopher are now available to everyone with
email-only access to the Internet.
You can get started by sending mail to gophermail@ncc.go.jp with any
or no subject and any or no message body. GopherMail will reply by
sending you it's main gopher menu. You then use your email program to
reply to that message, including it in the text of your reply. Mark
which menu options you want to follow-up by putting an X anywhere near
the beginning of the line, before the menu numbers for those options.
From there you can just keep repeating the process, sending replies
back to gopher with the desired items marked with an X.
As technology continues to develop in the area of information
navigation, presentation and retrieval on the Internet, the information
servers present on the Internet continue to refine their goals and
implementation. Recent months have given rise to three important,
interlocking developments on the Internet. These are the World-Wide
Web, HyperText Markup Language, and the Mosaic user interface.
The WWW project, started and driven by CERN (the European Laboratory
for Particle Physics), seeks to build a distributed hypermedia system.
The World-Wide Web is a client/server system, similar to the Gopher
information system, for providing integrated text, graphics, sound, and
video over the Internet. It has the capability, through the use of
HyperText Markup Language (HTML), to link files to one another in a
hypertextual structure.
HTML is a specific document definition specified in Standard
Generalized Markup Language (SGML). The chief power of HTML comes from
its ability to link regions of text and images to another document,
document section, sound or image. These links can also offer an
interactive search of a set of documents.
To access the web, you run a browser program. The browser reads
documents, and can fetch documents from other sources. Information
providers set up hypermedia servers from which browsers can get
documents.
These browsers can also access files by FTP, gopher and an ever
increasing range of other methods. The browsers will also permit
searches of documents and databases.
The documents that the browsers display are hypertext documents
(text with pointers to other text). The browsers let you deal with the
pointers in a transparent way -- select the pointer, and you are
presented with the text that is pointed to.
Hypermedia is a superset of hypertext -- it is any medium with
pointers to other media. This means that browsers might display not
simply text files, but images or sound or animations.
WWW software use URLs to locate information. URL stands for "Uniform
Resource Locator". It is a standard for specifying an object on the
Internet, such as a file, picture, video clip, or newsgroup.
URLs look like this:
file://wuarchive.wustl.edu/mirrors/msdos/graphics/gifkit.zip
file://wuarchive.wustl.edu/mirrors
http://info.cern.ch:80/default.html
news:alt.hypertext
telnet://dra.com
The first part of the URL before the colon, specifies the access
method. The part of the URL after the colon is interpreted specific to
the access method. In general, two slashes after the colon indicate a
machine name.
You have two options to access the world-wide web. Either use a
browser that can be telnetted to, or use a browser on your machine.
While WWW, gopher, and Wais all are client-server based, they differ
in terms of their model of data. In gopher, data is either a menu, a
document, an index or a telnet connection. In WAIS, everything is an
index and everything that is returned from the index is a document. In
WWW, everything is a (possibly) hypertext document which may be
searchable.
In practice, this means that WWW can represent the gopher (a menu is
a list of links, a gopher document is a hypertext document without
links, searches are the same, telnet sessions are the same) and WAIS (a
WAIS index is a searchable page, returning a document with no links)
data models as well as providing extra functionality.
Currently accessible through the web:
- anything served through gopher
- anything served through WAIS
- anything on an FTP site
- anything on Usenet
- anything accessible through telnet
- anything in hytelnet
- anything in hyper-g
- anything in techinfo
- anything in texinfo
- anything in the form of man pages
- sundry hypertext documents
One of the few limitations of the current networked information
systems is that there is no simple way to find out what has changed,
what is new, or even what is out there. As a result, a definitive list
of the web's contents is impossible at this moment.
For more information: The World Wide Web
FAQ
Mosaic (The Web via a mouse-based browser)
Building on the Web's initial structures, NCSA Mosaic uses a
client/server model for information distribution. A server sits on a
machine at an Internet site answering queries sent by NCSA Mosaic
clients, which may be located anywhere on the Internet. To users, the
client looks like any other application on their machine, only this one
has immediate access to information all over the world. The pieces of
information sent from servers to clients are known simply as documents,
which may contain plain text, formatted text, graphics, sound, and other
multimedia data.
The NCSA Mosaic system is an integrated set of browsers, viewers,
servers, gateways, and filters that allow you to approach the Internet
as one consistent information source. At the simplest level, the NCSA
Mosaic clients provide navigation and document viewing capabilities for
browsing the information universe of the Internet. As your interest,
needs, and skills develop, the configurable addition of external viewers
allows easy and straightforward expansion to handle virtually any
specific type of data. This flexibility allows you to keep pace with the
quickly evolving world of multimedia.
The NCSA Mosaic system is a completely open framework; you can enter
at your most comfortable level, progress through the use of various
viewers, learn to create your own hypertext documents, set up your own
server, engage in multimedia collaboration with colleagues in distant
locales, develop scripts for specialized information filtering and
presentation, and develop gateways to unique information resources and
integrate them into the NCSA Mosaic information space for other users.
In short, the possibilities are endless; you can think of NCSA Mosaic
and the Web as allowing the progressive customization of your own
information space, and that of your group.
NCSA Mosaic is implemented for three types of platforms:
- X Window System
- Macintosh
- Microsoft Windows
Each implementation takes advantage of the strengths of its
respective platform, but they have all been implemented to preserve as
much cross-system compatibility as possible. All three clients can be
downloaded for personal use from NCSA's anonymous FTP server.
To use NCSA Mosaic as an Internet browser, the system on which it is
installed must be fully connected to the Internet. This is because the
first thing NCSA Mosaic usually does when you run it is download one of
the NCSA Mosaic Home Pages (the startup document) from an NCSA server.
If NCSA Mosaic executes but you get an error message instead of a home
page, the problem is most likely in your Internet connection; you will
have to get that connection established before you can use NCSA Mosaic
to navigate the Internet.
Once you have successfully installed NCSA Mosaic, you should spend
some time becoming familiar with the interface. The NCSA Mosaic Demo
Document, available via a hyperlink in the NCSA Mosaic Home Page or via
a menu selection, provides an overview of NCSA Mosaic's capabilities
with hyperlinks to a wide variety of information sources. As you browse
the information universe, you can access material through other menu
selections and other home pages.
Lynx (The Web via a text-based browser)
Lynx is a general-purpose distributed information browser and is
part of the World Wide Web project. Lynx was designed to support a
Campus Wide Information System (CWIS), but can be used for many other
applications.
When started, Lynx is given a local path specifying a file
containing text to be displayed, or a Uniform Resource Locator (URL)
that specifies a resource to be displayed (usually the name of a file
containing text information), the type of server that will provide the
resource, and the Internet address of the system on which the specified
server is running.
When a hypertext document is being displayed, links appear different
from standard text, and users press the up- or down-arrows to "select" a
particular link. Selected links show up as highlighted text, and users
press Return or the right arrow when a link is highlighted to "follow"
the selected link. When the link is followed Lynx finds the associated
file and displays it on the screen in place of the first file.
Lynx is available for most flavors of Unix and VMS. Lynx can
currently be obtained by anonymous ftp from: ftp2.cc.ukans.edu in the
pub/lynx directory.
A listserv list has been created for information about Lynx and for
notification of updates.
You may sign-up by sending a message with
subscribe lynx-dev your name
as the only line in the message to
listserv@ukanaix.cc.ukans.edu.
Miscellaneous Notes
In many electronic mail messages, it is sometimes useful to indicate
that part of a message is meant in jest. It is also sometimes useful to
communicate emotion which simple words do not readily convey. To
provide these nuances, a collection of "smiley faces" has evolved. If
you turn your head sideways to the left, :-) appears as a smiling
face. Some of the more common faces are:
:-) smile :-( frown
:) also a smile ;-) wink
:-D laughing 8-) wide-eyed
:-} grin :-X close mouthed
:- smirk :-o oh, no!
What do "btw", "fyi", "imho", "wrt", and "rtfm" mean?
Often common expressions are abbreviated in informal network
postings. These abbreviations stand for "by the way", "for your
information", "in my humble [or honest opinion", "with respect to", and
"read the fine manual".
As with any profession, computers have a particular terminology
all their own. Below is a condensed glossary to assist in making
some sense of the Internet world.
address
There are two separate uses of this term in internet
networking: "electronic mail address" and "internet
address". An electronic mail address is the string
of characters that you must give an electronic mail
program to direct a message to a particular person.
See "internet address" for its definition.
ARPA
Advanced Research Projects Agency
The former name of what is now called DARPA.
ARPANET
Advanced Research Projects Agency Network
A pioneering long haul network funded by ARPA. It
served as the basis for early networking research as
well as a central backbone during the development of
the Internet. The ARPANET consisted of individual
packet switching computers interconnected by leased lines.
BITNET: Because It's Time Network
BITNET has about 2,500 host computers, primarily at universities, in many countries. It is managed by EDUCOM, which provides administrative support and information services. There are three main constituents of the network: BITNET in the United States and Mexico, NETNORTH in Canada, and EARN in Europe. There are also AsiaNet, in Japan, and connections in South America. See CREN.
CREN
The Corporation for Research and Educational Networking
BITNET and CSNET have recently merged to form CREN.
CSNET
Computer + Science Network
A large data communications network for institutions doing
research in computer science. It uses several different
protocols including some of its own. CSNET sites include
universities, research laboratories, and commercial
companies. See CREN.
DARPA
U.S. Department of Defense Advanced Research Projects Agency
The government agency that funded the ARPANET and later
started the Internet.
DNS
The Domain Name System is a mechanism used in
the Internet for translating names of host computers
into addresses. The DNS also allows host computers
not directly on the Internet to have registered
names in the same style, but returns the electronic
mail gateway which accesses the non-Internet network
instead of an IP address.
dot address: (dotted address notation)
Dot address refers to the common notation for Internet
addresses of the form A.B.C.D; where each letter represents,
in decimal, one byte of the four byte IP address.
EARN
European Academic Research Network
Ethernet
A network standard for the hardware and data link levels.
There are two types of Ethernet: Digital/Intel/Xerox (DIX)
and IEEE 802.3.
FDDI
Fiber Distributed Data Interface
FDDI is a high-speed (100Mb) token ring LAN.
FTP: File Transfer Protocol
The Internet standard high-level protocol for
transferring files from one computer to another.
host number
The part of an internet address that designates which
node on the (sub)network is being addressed.
internet
internetwork
Any connection of two or more local or wide-area networks.
Internet
The global collection of interconnected local, mid-level and
wide-area networks which use IP as the network layer
protocol.
internet address
An assigned number which identifies a host in an internet.
It has two or three parts: network number, optional subnet
number, and host number.
IP: Internet Protocol
The network layer protocol for the Internet. It is a packet
switching, datagram protocol.
network number
The part of an internet address which designates the
network to which the addressed node belongs.
NSFNET: National Science Foundation Network
The NSFNET is a highspeed "network of networks" which is
hierarchical in nature. At the highest level is a
backbone network currently comprising 16 nodes connected
to a 45Mbps facility which spans the continental United
States. Attached to that are mid-level networks and
attached to the mid-levels are campus and local
networks. NSFNET also has connections out of the U.S.
to Canada, Mexico, Europe, and the Pacific Rim. The
NSFNET is part of the Internet.
packet
The unit of data sent across a packet switching network.
The term is used loosely. While some Internet
literature uses it to refer specifically to data sent
across a physical network, other literature views
the Internet as a packet switching network
and describes IP datagrams as packets.
PPP: Point-to-Point Protocol
The Point-to-Point Protocol (PPP) provides a method for
transmitting datagrams over serial point-to-point links (e.g.,
phone lines).
protocol
A formal description of message formats and the rules
two computers must follow to exchange those messages.
router
A special-purpose dedicated computer that attaches to
two or more networks and routes packets from one
network to the other. In particular, an Internet
router forwards IP datagrams among the networks it
connects. Gateways route packets to other
gateways until they can be delivered to the final
destination directly across one physical network.
subnet
A portion of a network, which may be a physically independent
network, which shares a network address with other portions
of the network and is distinguished by a subnet number. A
subnet is to a network what a network is to an internet.
T1
A term for a digital carrier facility used to transmit a
DS-1 formatted digital signal at 1.544 megabits per second.
T3
A term for a digital carrier facility used to transmit a DS-3
formatted digital signal at 44.746 megabits per second.
TCP: Transmission Control Protocol
A transport layer protocol for the Internet. It is a
connection oriented, stream protocol.
TCP/IP: Transmission Control Protocol/Internet Protocol
This is a common shorthand which refers to the suite
of application and transport protocols which run over IP.
telnet
The Internet standard protocol for remote terminal
connection service. TELNET allows a user at one site
to interact with a remote timesharing system at
another site as if the user's terminal was connected
directly to the remote computer.
UNIX
An operating system developed by Bell Laboratories that
supports multiuser and multitasking operations.
UUCP: UNIX-to-UNIX Copy Program
A protocol used for communication between consenting
UNIX systems.
WHOIS
An Internet program which allows users to query a database of
people and other Internet entities, such as domains,
networks, and hosts, kept at the DDN NIC. The information for
people shows a person's company name, address, phone number
and email address.
A: Network Guides -- Books
This is a highly selective list. Books on "How to Use the
Information Highway" are multiplying rapidly, and the end is not in
sight. Each title listed below has been cited by at least one reviewer
as among the best now available.
Aboba, Bernard
1994 The Online User's Encyclopedia: Bulletin Boards and
Beyond. Addison-Wesley: Reading, MA.
Praised by Jim Milles as "a remarkably useful and
comprehensive work, covering everything from the basics of
computer communication, to using local bulletin boards, to the
intricacies of the Internet."
Dern, Daniel P.
1994 The Internet Guide for New Users. McGraw-Hill: NY.
Generally regarded as one of the best books available.
Includes a good survey of the history of the Internet, as well
as just enough Unix to get you by. "For all Internauts," says
Jean Armour Polly.
Hahn, Harley, and Rick Stout
1994 The Internet Complete Reference. Osborne/McGraw-
Hill: Berkeley.
Widely acclaimed as the best book written about the Internet
so far. Witty, complete (800+ pages), easy to read.
Kehoe, Brendan P.
1994 Zen and the Art of the Internet: A Beginner's Guide
to the Internet, 3rd edition. Prentice-Hall:
Englewood Cliffs, NJ.
One of the earliest and most popular guides. Short, but
packed with information.
NOTE: An electronic version of the first edition is widely
available from many network sites.
Kochmer, Jonathan, and NorthWestNet
1993 The Internet Passport: NorthWestNet's Guide to Our
World Online. NorthWestNet: 15400 SE 30th Place,
Suite 202, Bellvue, WA, 98007.
[E-mail: passport@nwnet.net]
One of the most complete guides, covering areas not included
in others. Strong in K-12 computing and supercomputing.
Krol, Ed
1992 The Whole Internet User's Guide & Catalog. O'Reilly
& Associates: 103 Morris Street, Suite A, Sebastopol
CA, 95472. [E-mail: nuts@ora.com]
Another of the early guides. Although many of the resources
are out of date, it is still essential reading.
LaQuey, Tracey, and Jeanne C. Ryder
1993 The Internet Companion: A Beginner's Guide to Global
Networking. Addison-Wesley: Reading, MA.
Milles considers this the best guide for the beginner, or even
the "pre-beginner" who has not yet signed on to the Internet.
Tennant, Roy, John Ober, and Anne G. Lipow
1992 Crossing the Internet Threshold. Library Solutions
Institute, 2137 Oregon St., Berkeley CA 94705.
Intended as a teaching manual, it contains excellent "real
world" examples and handy one-page fact sheets.
B: Network Guides -- FTP-able Documents
Electronic Internet guides are multiplying almost as rapidly as
printed manuals. The dilemma, of course, is that you cannot fetch a
guide on "how to use the 'Net" until you have learned at least a little
about how to use the 'Net.
The full filenames are given, in the format
/directory/sub-directory(ies)/filename
You can get the file by using the full path name (including the
slashes) or you can change directories one level at a time and browse
along the way. If you see a related README file you should get and read
it before you try to get the full file.
Note that almost all of these sites contain other documents. You
may find the same items in several places. But not all will be the same
versions, so remember check the date of anything you fetch.
The dates given are the latest of which I am aware. Updates could
appear at any time.
Ciolek, T. Matthew. Internet Voyager: Social Scientist's Guidebook
to AARNET/INTERNET Online Information Services. (March 1993)
FTP: coombs.anu.edu.au -- 150.203.76.2
FILES: /coombspapers/coombsarchives/coombs-computing/
internet-voyager-inf/
internet-voyager-1-2.txt
internet-voyager-2-2.txt
December, John. Information Sources: The Internet and Computer-
Mediated Communication. (January 1994)
FTP: ftp.rpi.edu -- 128.113.1.5
FILE: /pub/communications/internet-cmc
Gaffin, Adam. Big Dummy's Guide to the Internet. (December 1993)
FTP: ftp.eff.org -- 192.77.172.4
FILE: /pub/Net_info/Big_Dummy/bigdummy.txt
Hancock, Lee. Internet/Bitnet Health Sciences Resources.
(September 1993, updated 4-5 time a year)
FTP: ftp.sura.net -- 128.167.254.179
FILE: /pub/nic/medical.resources.xx-xx
where xx-xx = latest release date
Harris, Judi, and the students of TEB 8000. Internet Resources
Directory, Part 3: File Archives (FTP Sites) of Interest to
Educators. (August 1993)
FTP: tcet.unt.edu -- 129.120.20.191
FILE: /pub/telecomputing-info/IRD/IRD-infusion-ideas.txt
Maas, Robert Elton. MaasInfo.TopIndex: Toplevel Index to All
Major InterNet Indexes. (April 1993)
FTP: niord.shsu.edu -- 192.92.115.8
FILES: /maasinfo/MaasInfo.TopIndex_1of2
/maasinfo/MaasInfo.TopIndex_2of2
Martin, Jerry. There's Gold in them thar Networks!, or,
Searching for Treasure in all the Wrong Places, (Network
Working Group RFC 1402). (January 1993)
FTP: nis.nsf.net -- 35.1.1.48
FILE: /documents/rfc/rfc1402.txt
Strangelove, Michael. The Electric Mystic's Guide to the
Internet: A Complete Directory of Networked Electronic
Documents, Online Conferences, Serials, Software, and Archives
Relevant to Religious Studies. (February 1993)
FTP: panda1.uottawa.ca -- 137.122.6.16
FILE: /pub/religion/electric-mystics-guide-v1.txt
NOTE: Volume 2 has not yet appeared
NOTE: Also via listserv@uottawa as MYSTICS V1-TXT
SURAnet. Information Available on the Internet: A Guide to
Selected Sources. (Updated weekly; posted 9 am Mondays).
FTP: ftp.sura.net -- 128.167.254.179
FILE: /pub/nic/infoguide.<.txt
NOTE: Before you attempt to retrieve the guide, get and
read the file 00-README FIRST.
JSE Homepage | Subscription
Information | Current Issue | JSE Archive (1993-1998) | Data Archive | Index | Search
JSE | JSE Information Service | Editorial Board | Information for Authors | Contact JSE | ASA Publications
Copyright © 1999 American Statistical Association. All rights reserved.
|