The most commonly used Internet service nowadays is World Wide Web (WWW). It is the single most commonly used Internet service and creates a very large fraction of the whole Internet traffic. WWW is an example of client-server model of communications using TCP/IP. When user wants to read a web page, he/she makes a connection to the web server using HTTP protocol which runs on top of TCP/IP protocol. The web browser makes a connection to web server using TCP/IP, sends HTTP protocol request to web server, receives the data from the web server, closes the connection and displays the received document on the computer screen.

If the web page consists of multiple object (pictures, sounds, video, Java applets), then web browser makes one HTTP request to the web server for each object it needs to receive. Depending on the web server and browser configuration, the system can either make a new TCP connection for each object or use one TCP connection to receive more than one object. In typical case a web browser has 1-6 TCP connections active in parallel when receiving the document objects from the server. When those objects have been received, those TCP connections are closed immediately or after a specified timeout time.

The Hypertext Markup Language (HTML) is a simple markup language used to create web pages. A typical web page consts of one HTML document, that includes the most of the information on the web page, and includes links to the other parts needed to complete the plage (pictures, JavScript code on separate file, Java applets etc..).

Foreword on web publishing

The World Wide Web is a multimedia hypertext informationretrieval system that sits on top of Internet.Publishing on the Web is very different from older methods of publication. A Web publication is inherently a general, device-independent and program-independent document with structural markup. The presentation of a document may vary greatly to allow viewingthe same document on a wide variety of devices, ranging fromsmall mobile phone screen to full-size movie screens.T

he HTML language was designed to promote worldwide distribution ofdocuments in a device-independent form. HTML file consists of contentand it's structure all stored in one file in a standardized form.HTML is far from being perfect for the purpose, but it has served well and is suitable for a wide range of documents. It is easy to learn and easy to use. Practically any computer literate people can put the documents onto theWeb in a few hours, after an initial education of a day or two.

One reason for putting up this web page is to provide people a place to find the information on web publishing from one place and learn the most necessary skills in few days.

Some people (mostly from traditional publishing industries) think that Web authors should decide the physical appearanceof documents like font, color, layout, and other presentation features. For such reasons HTML is implemented usually with nonstandard extensions(some of them though later standardized) to control the dicumentfeatures like colors, fonts and font sizes. This kind of way of making web pages breaks the whole basic idea that HTML should be viewable with practically any kind of device. Many advertised "HTML programing skills" are quite much bluffand breaking the whole idea of HTML.

Journalists may say that presentation issues cannot be distinguished from structure and content, so presentation must be designed for each concrete publication and issue separately. Layout will not lose its importance but it will take more and moreplace on users' systems and the users have their own preferences on layout and style (colors and margins and fonts and so on). In such case layout by the author or by the publishing side will not get through as wanted and an attempt to enforce it might fail miserably. When the presentation fails,the document will look like a mess and the user will therefore discard it. If exact presentation always in the same look and layout is essential to the document, it is usually be better to publish suchdocuments but by using other methods than HTML (for example Adobe Acrobat PDF format is very suitable format for publishingsuch documents on the web).

Linking turns texts to hypertext, and hypertextuality is among the keyfactors that make the World-Wide Web a web of interconnected things.Using links on your Web pages, you can conveniently let your readersfind background information, technical details,definitions of terms, etc., on your pages or somewhere else.A link is a just a pointer or reference, it does not do anything by itself. A URL is rather like a telephone number or a street address which just tells how to reach a document on the web.

The general rule proposed for linking is that one may freely set up non-framed HREF links to the web sites of others, is a rather reassuring rule since it happens to comport well with common practice and with common sense. Webmaster should be prepared for the possibility that members of the public may set up bookmarks to subpages, and that other HTML authors may set up links to subpages. Since this sort of bookmarking and linking can and will happen, the webmaster should be courteous to those visitors and HTML authors. The webmaster, upon moving a page, should have the courtesy to supply a "forwarding" page that lets the visitor know the new page URL.

It is very good idea to have linked on the pages recognized as links so that users have no difficulty in noticing which word is a link. It belongs to the very basic skills needed in Web browsing to be able to recognize a link as displayed on one's browser and to follow a link. Any attemt to hide the link or change the apprearance what tha user had used to see on their browser will usually lead to more harms to usability than good to the presentation. When you do not show clear what is link and what is not, the browsing becomes guesswork, trial and error.

Speed and usablity are also necessary factors to consider. Most of the web is is too slow nowadays. The slow operation comes from the available bandwidth to access the site, amount of data to be transferred and how the materialis put to the pages. The bandwidth available is controlled by the network connection your web server has to Internet and the speed of the connection your client has and the load of the core network in between. This speed hardto control by the web master in any other way than selecting good connection from a good networking company or using a good web hotel service.

Amount of transferred data and how it is presented can be controlled by proper web design. You can reduce the amount of transferred data by making good HTML code (some HTML tools produce few times bigger HTML files than what is needed to present the document), selecting rightfile formats and conversion parameters for pictures and other documents and avoiding using unnecessary large pictures. Sometimes the loading speed seen by user is also affected by the web browser rending speed and way they do it(for example some versions of Netscape draw tables only when it has get all it's cells completely which sometimes can takelots of time). With good network connection, powerful enough web server and good web design the web page accesses can be made quick to operate. When designing a public web site, keep always in mind those uses that have slower connection than you. It is a good idea to try the web pages through a slow "average user" connection before publishing the web page to see that it is useable for them as well. A fancy filled with graphics and animation web page might look nice when demonstrated on corporate LAN environment by the web page maker, but might turn to be so slow to load that the intended users do not want to wait for your page to load (this causes your intended service/business to fail).

When making a new large web site it is a good idea to design the sites tructure well so that you don't have to redesign it many time over and over. Usually this means designing the menus and directory structures for the whole site. When you get this ready don't hurry in putting it online.Publishing just a site skeleton with menus and subpages without realcontent in the end just frustrates the users who come by to your site. It is better to first make a small working site with little material and then enhance it later than publishing a large page structure without any content. It is better to promise what is coming later on the main page or subpage than making links to pages which just say"under construction" or "coming soon" or "page not found". Users of such site just get frustrated because you seemed to promise than your site has some interresting material by putting the link visible, but failed to keep that promise. Users of the web pages are dissappointed when they encounter many this kind of thing on your page and go to some other site and propably never come back.

Be also careful to fullfill your promises you have made and not make to many promises. Promises that some nice things are coming usually are not believed by the users, unless you make them believble that thay may come. The fact of the life is that most of the things promised to "come soon" usually don'tcome anytime soon on-line if ever come on-line. A web site with lots of pages with just "coming soon" in them is just makes many dissapointed users who don't come back to your site later to check out of your promises of new material is true or not (because on most those promises are not kept).

If you are running a site that you expect users to constantly come back, then keep the site running and not not temporarily close it. For a company to which online presence is important, it does not make sense to close the web pages for weeks the reason that pages are being updated! It does not make sense. Generally when you see a web page needs to be updated a lot (more than can be reasably done "on the fly") then the right procedure to do that is this: First start designing the new site but still keeping the old site running that time. Develop and put the new page version on separate server (or different directory on same server) during the time of development. When you are completely sure that new pages are ready to put on-line, replace the old pages with the new ones (change the server, change the directory from where pages are served or upload the new page version to server to replace the old one). In this way your service keeps running all the sime and the users are happy. There is no reason to close a web site for a long time because of web pages being updated. Thing can be done so smoothly that the service interruptions when moving from old pages to new pages can be kept from seconds to few hours with just a little but thinking how to do it.

Here are some general rules on web publishing:

1. The pages you publish must serve a purpose. There's no point in having a page if it has nothing useful to offer.
2. Be economical with graphics. Many users have limited bandwidth and don't want to wait their pages to load for too long.
3. Always use the alt field with images so that people who can't or don't wnat to see the grpahcis can still use your page.
4. Use GIF or JPEG file formats appropriately for the application they are good for.
5. Keep in mind that readership is international. Always write the date in words because numerical formats are interpreted differently in different countries. Be aware of words which belong to a local dialect.
6. Keep your page browser-independent. The last thing you want to do is turn away potential readers merely because they have the "wrong" software.
7. Support graceful degradation. Graceful degradation as a means to provide for backward compatibility is built into the W3C's recommendations. Do not try to break it. This means that if you heavily use technologies like JavaScript menus, Java menus, DHTML etc. try to make the pages also usable (possibly with reduced functionality/usability) also without those techniques. Some browsers do not support those scripting techniques and there are many users who disable scripting languages support on their web browser for various reasons (security threads, work slowly on older computers).
8. Write correct HTML. It works better with any browser than incorrect one with errors.
9. It is a good idea to use a footer which gives the document URL, and an email contact address. This practice leaves the reader with no doubt as to the authorship of the document.
10. Publish the web site only when it has content worth to show. If you do not have much ready now, make a small working site and enhance it when you get more material.
11. Do not use too small fonts. Very small fonts are from hard to read to impossible to read depending on the browser and font set used.
12. Avoid using very long URLs. If you plan your document or site address to publish on some paper magazine or such, keep the URLs very short so that people can remeber them and bother them to write it to the browser. People also tend to send URLs sometimes through different electronic media (E-mail, Usenet news), and in those medias the very long URLs or URLs with many very special characters tend to get broken easily on the way from sender to receiver. To keep those users in mind it is good idea to try to keep the URLs shorter than around 70 characters so that the e-mail software does not try to cut them to two different lines.
13. Test your site with many different browsers so that you know that it works.
14. Keep the site up to date. Replace outdated information with new one. Putting a publication date to page is a good idea.
15. Consistency is one of the most powerful usability principles: when things always behave the same, users don't have to worry about what will happen. Instead, they know what will happen based on earlier experience.
16. Cool URLs do not change. If your site is such that people tend to link to it, it is good idea to keep old URLs working even if you move the contents. In many cases sensible server redirects can lead the user to the new place easily and avoid the user to see the "page not found" error. In most cases defining the redirects in the server end is not hard (it is good idea to learn how to do it).

Dead link free Web site
Compatible on all browsers (i.e. Netscape, Internet Explorer, Macintosh)
HTML error free Web site
Proper usage of Meta Tags throughout HTML and content of Web site
Theme of site is accurately recognized using top keywords
Spelling error free throughout site
Fast load time on various connection speeds (i.e. 56K, ISDN, T1)
Search Engine acceptance ready
Easy to navigate and fun to use
Nice look and feel
Check that it is usable with many resolutions (Need for horizontal scrolling should be avoided if possible. It's awkward to do and users hate it.)

General information

The Web was originally developed to enable people throughout the world to communicate with one another. Having a single system that can deal with all languages and cultures has many advantages: when the same protocols are used everywhere, the same software can likewise be used.W3C's Device Independence Activity is working to ensure a seamless Web for all access devices by reorganizing the Mobile Access and Television and the Web Activities into one. Web services are becoming accessible from a wide range of devices from desktop PCs to in-car computers, TV, digital cameras, and cellular phones. W3C is well-positioned to lead development to avoid incompatibility and to achieve single Web authoring.A British physicist working at the CERN Laboratory inSwitzerland, a particle physics lab, constructed hypertexprotocols and a browser-server platform to launch the World WideWeb (WWW) 10 years ago. Tim Berners Lee, thefather of the World Wide Web, developed a programming language(html) that anyone with little or no programming experience coulduse to produce documents to be shared any where in the world.The early browsers of the WWW were text based only. But in1993, college student Marc Andreessen added a simple littletag (<img>) to his Mosaic browser and graphics startedappearing everywhere. (Andreessen later founded a companyknown as Netscape.)The ability to combine text, graphics,sound, and video was born.The Web has truly become commonplace in our lives.So what of the future? Berners Lee is now involved in W3C(the World Wide Web Consortium, www.w3.org) and with softwareengineers, government agencies, academia, and others in developinginteroperable specifications, software tools, and more tolead the WWW into the next generation and beyond. W3Cis actively developing future web technologies likeXML, XHTML and so on.

2K Mediat - web realted documents in Finnish Rate this link
Addressing Schemes - This is (an attempt at) an exhaustive list of URI schemes. We try to list them all, whether they're standard or not. Rate this link
URL Anatomy Tutorial Rate this link
How a Web Page Works Rate this link
How Web Servers and the Internet Work Rate this link
Internet-resurssipalvelu - links to web related documents, pages in Finnish Rate this link
On the Need for "the Samba of Web Browsers" - slow IE-ization of the Web, and what it means for the Unix community Rate this link
sfnet.viestinta.www vastauksia usein esitettyihin kysymyksiin - WWW topics FAQ in Finnish Rate this link
The Web Design Group's Web Authoring FAQ Rate this link
Unconventional Wisdom: Traffic is Overrated - not all traffic makes on-line business Rate this link
W3C Technical Reports and Publications - web technology specifications Rate this link
Web Design Group - web page tips Rate this link
Web Developer.Com Rate this link
Web Hosting Tutorial - If you want your Web Site to be visible to the world, it has to be Hosted on a Web Server. In this tutorial we will teach you what Web Hosting is, and what Web Hosting has to offer. Rate this link
Web Programming Unleashed - This is a full book on-line on WWW development and related information. Rate this link
Webreview.com - site with lots of web related articles Rate this link
WebTools - web site dedicated to webmasters Rate this link
World Wide Web Consortium (W3C) - defines web related specifications Rate this link

Web protocol information

The protocol is the pre-defined way that someone who wants to use a service talks with that service. Every Web server on the Internet conforms to the HTTP protocol, which isthe protocol used for communication between web browser and web server.The most basic form of the protocol understood by an HTTP server involves just one command: GET. If you tell the server to "GET filename", the server will respond by sending you the contents of the named file and then disconnecting.In the original HTTP protocol, all you would have sent was the actual filename like "/" or "/web-server.htm". The protocol was later modified to handle the sending of the complete URL. This has allowed companies that host virtual domains, where many domains live on a single machine, to use one IP address for all of the domains they host. Originally HTTP protocol opened one TCP connection, requested one file/object through it and after the data is received the connection is closed. HTTP procol was later also extended to support transfer of many files through one HTTP session and to upload data to web server (used for example with forms). There are two methods to get data from a web server: GET and POST. HTTP GET is designed so that all information necessary for the interaction is part of the URI, thus promoting URI addressability. With HTTP POST, some information intended to affect change to the resource state may be part of the protocol headers, not in the URI. Whether and how one chooses between GET and POST depends on the format specification and the application context. When HTTP URIS are used for hyperlinks in HTML, SMIL, and SVG, for example, the application determines which method will be used (generally GET). However, for both HTML forms and XForms, the author can choose between GET and POST. Here is an overview of most commonly used methods:

GET Method: Information from a form using the GET method is appended onto the end of the action URI being requested. Your CGI program will receive the encoded form input in the environment variable QUERY_STRING. The GET method is used to ask for a specific document - when you click on a hyperlink, GET is being used. GET should probably be used when a URL access will not change the state of a database.
HEAD method: The HEAD method is used to ask only for information about a document, not for the document itself. HEAD is much faster than GET, as a much smaller amount of data is transferred. It's often used by clients who use caching, to see if the document has changed since it was last accessed. If it was not, then the local copy can be reused, otherwise the updated version must be retrieved with a GET.
POST Method: This method transmits all form input information immediately after the requested URI. Your CGI program will receive the encoded form input on stdin.

There are two version of HTTP protocol nowadays in use. The simpler version 1.0 and more advanced version 1.1. HTTP 1.0 has enough features for web traffic, but it's limitation is that a new connection is created every time a file is transfered and closed when file loaded. A significant difference between HTTP/1.1 and earlier versions of HTTPis that persistent connections are the default behavior of any HTTPconnection. The persistent connection means that more than one file is transferred through the connection and the connection is not closed immediatly after the files for one page are transferred, but it cna be kept open to wait for new request to be sent quickly to the server (the connection is closed after some timeout info more data is to be transffered for some time).

HTTP Request Methods - HTTP/1.0 allows an open-ended set of methods to be used to indicate the purpose of a request. The three most often used methods are GET, HEAD, and POST. Rate this link
URIs, Addressability, and the use of HTTP GET and POST - An important principle of Web architecture is that all important resources be identifiable by URI. The finding discusses the relationship between the URI addressability of a resource and the choice between HTTP GET and POST methods with HTTP URIs. Rate this link
Common Internet File Formats Rate this link
How Web Servers and the Internet Work: How Protocols Work - example using HTTP protocol Rate this link
Hypertext Transfer Protocol -- HTTP/1.1 Rate this link
MIME and BASE64 coding (RFC2045) Rate this link
MIME media types - web server MIME Content-Type to any file it sends according to the file's extension Rate this link
The Original HTTP as defined in 1991 Rate this link

Universal Resource Identifier (URI) information

Uniform Resource Identifiers (URI) provide a simple and extensible means for identifying a resource. This specification of URI syntax and semantics is derived from concepts introduced by the World Wide Web global information initiative, whose use of such objects dates from 1990 and is described in Universal Resource Identifiers in WWW. The term Uniform Resource Locator (URL) refers to the subset of URI that identify resources via a representation of their primary access mechanism (e.g., their network "location"). URL schemes are usually named after protocols.

RFC 2396 Uniform Resource Identifiers (URI) Generic Syntax Rate this link
Uniform Resource Identifiers (URI): Generic Syntax Rate this link
Uniform Resource Identifier (URI) - In the Uniform Resource Locators (URI) definition [RFC2396,RFC1738] there is a field, called "scheme", to identify the type of resource and access method. This is a list of those schemes. Rate this link

HTML details

The Hypertext Markup Language (HTML) is a simple markup language used to create hypertext documents that are platform independent. HTML documents are SGML documents with generic semantics that are appropriate for representing information from a wide range of domains. HTML markup can represent hypertext news, mail, documentation, and hypermedia; menus of options; database query results; simple structured documents with in-lined graphics; and hypertext views of existing bodies of information. HTML has been in use by the World Wide Web (WWW) global information initiative since 1990. World Wide Web (WWW) is the most common use for HTML.HTML is one of the most widely used computer languages in the world. The popularity of HTML is due to the fact that it is the coding technology used to publish content on the World Wide Web (also referred to as the Internet or Web). Programmers quickly discovered that HTML is a user friendly language and is very easy to learn. This ease of coding significantly aided in the proliferation of Web sites. The latest version of HTML is 4.01 which is defined by the standard published on 24 December 1999 by the World Wide Web Consortium (W3C).HTML is not a complete programming language.First, it lacks conditional tests (IF) and flow control (GOTO, DO, and FOR) statements. Some implementation may offer extensions to the HTML languageto accomplish these functions, but they are not part of the HTML standards.The most common method to add the power of real programming languageto HTML page is to embed some suitable programming laguage codeinside HTML code such way that the HTML server or browser cantun them. The most commonly used client end solutions are to useJavaScript or Java to do this kind of functionality. In web serverssolutions like ASP (Active Server Pages), PHP, Java Server Pagesand many other techniques are used.W3C is the organization behind the development of HTML. The newest and onfly official standard on HTML is ISO 15445 (this is normal HTML standard). The newest original HTML specification by W3C is HTML 4.01. Nowadays W3C recommends to use the XML based HTML specification XTHML 1.0 (there is also a newer specification XHTML 1.1).

HTML Specifications

HTML 2.0 Spec - outdated standard Rate this link
HTML 3.0 Spec - outdated standard Rate this link
HTML 3.2 Reference Specification - W3C Recommendation 14-Jan-1997 Rate this link
HTML 4.0 Spec - working draft Rate this link

Reference material

Forms in HTML 4.0 Rate this link
HTML 4.0 Tags - This article lists all the tags, their possible attributes (and which are required), and which recommendation they are part of, in a single, printable page. A handy resource to bookmark for quick information! Rate this link

Introductory material on HTML

Dan's Web Tips Rate this link
HTML Code Tutorial - The Idocs Guide to HTML Rate this link
HTML Elements List - explanation of the original core of the HTM Language Rate this link
HTML-luntti - quick introduction to HTML in Finnish Rate this link
HTML Reference Manual Rate this link
HyperText Markup Language (HTML) - overview of all HTML related materials at W3C and around the web Rate this link
Introduction to HTML Rate this link
Introducing HTML 3.2 Rate this link
Johdatus HTML-kieleen - text in Finnish Rate this link

Tips on writing HTML documents

Indenting with HTML - describes possible solutions to do indenting using HTML Rate this link
Make use of the non-breaking space - Both Netscape Navigator and Internet Explorer allow using the non-breaking space. Wherever you put this code into your HTML, it will result in a single space character that's a bit different from a normal space. In particular, such a space will never get away or get jammed into adjacent spaces (be they non-breaking or not), and it won't allow the browser to break (hence non-breaking) the text line at where it's sitting. Rate this link
Tables on non-table browsers - when HTML Tables are viewed on non-table-capable browsers, the result is usually a mess, but there are ways of marking up tables so that they would offer some reasonable level of fallback capability when viewed on non-table-capable browsers Rate this link
The Table Sampler - intended to be a tutorial by example of Tables Rate this link
Why attribute values should always be quoted in HTML - several reasons why an HTML author should always put attribute values into quotes in HTML Rate this link

Browser specific extensions

Browser specific extensions allow you to do more than what standard HTML allows you to do. The downside of those extensions is that they work only on some specific browsers. Be careful if you use those browser specific extensions and be prepared that your page will be viewed with browsers which do not have those extensions. Generally it is a bad idea to use browser specific extensions in web documents ment for wide audiences because of potential compatibility problems with other browsers.

XML

The Extensible Markup Language (XML) is a subset of SGML.Its goal is to enable generic SGML to be served, received, and processed on the Web in the way that is now possible with HTML. XML has been designed for ease of implementation and for interoperability with both SGML and HTML.An XML page looks something like an HTML page, but there the similarity ends. XML uses HTML-style tags not just to format documents, but also to identify the kinds of information used in documents, so that information can be reformatted for use in other documents and can also be used for information processing.To put that another way, you can use XML to represent data portably.XML is clearly the way of the future as it establishes a common base from which content can be expanded into other formats.If you peek under the hood of high-profile open-source projects such as Mozilla, Apache, Perl, and Python, you'll find a little program called "expat" handling the XML parsing.XML is not a programming language. XML files don't run or execute.By itself, an XML file doesn't do anything. It's a data format thatsits in a disk file until you run a program which reads it anddoes something with it. While XML may give a very precise descriptionof the text or data, it says nothing about what you should do with it(clever XML people select element names that hint what they areused for). Since XML is text-based, the fiel can be written or fixedby using a simple text editor; you don't necessarily need a complexprogramming tool for that (there are such tools if you want to take that approach).Before two applications which use XML can work together, the users/developersmust on basics what they are doing. They have to agree on which tagswill be used allowed, how elements may be nested with each other and amountof XML vocabulary and structure in DTD (Document Type Definition).Application programs which use XML data use validating parsers to read the DTD to before they read the XML document so they can identify the elementtypes and how they relate to one another. It is possible to write anXML without DTD, but using a DTD makes it easier for everyone involved to understand the markup and write software to process it. An alternativeto DTD is a Schema, which provides means od specifying element contentsin terms of data types (including data ranges and checks).Schemas are written as XML files.XSL is a technology related to XML. XSL provides a mechanism for formatting and transforming XML, either at the browser or on the server. For example, XSL can be used to transform XML data into HTML/CSS documents on the Web server.

General information

A Gentle Introduction to XML - This an interactive XML tutorial that includes online access to XML validation and XSL translation and DOM implementation. Rate this link
A Triumph of Simplicity: James Clark on Markup Languages and XML - Markup languages, the standardization process, and the importance of simplicity Rate this link
Canonical XML - This document defines a subset of XML called canonical XML. It is a simple XML format representation of the result of parsing an XML document. Rate this link
Extensible Markup Language (XML) - official home page of XML by World Wide Web Consortium Rate this link
Extensible Markup Language (XML) 1.0 (Third Edition) - W3C Recommendation 04 February 2004 Rate this link
Johdatus XML-ohjelmointiin - XML information in Finnish Rate this link
Johdatus XML-tekniikkaan - introduction to XML written in Finnish Rate this link
XML FAQ Rate this link
XML Transforms Content Management - XML technologies are reshaping the world of enterprise content management. Content goes strategic with XML-based systems that unlock the value of enterprise assets. Rate this link

XML Resource pages

Developer Works - IBM's XML resource, with lots of valuable XML compatibility information and home to xCentral, the XML web search engine Rate this link
Free XML tools and software - This is a frequently-updated and hopefully complete index of free XML tools, with much metadata about the tools to make them easier to locate. Rate this link
The Development Exchange's XML Zone - home to links for valuable XML news, information, and web sites Rate this link
The XML Cover Pages - a comprehensive online reference work for the Extensible Markup Language (XML) and its parent, the Standard Generalized Markup Language (SGML) Rate this link
xml.com - XML information site Rate this link
DevX.com - Timely news and information for the XML industry, with features written by experts and insiders Rate this link
XML Resources - links to XML related documents Rate this link
XML Software Guide Rate this link
XML World - XML article collection with a Software Guide, FAQ, References and more Rate this link

Applications for XML

MathML - mathematical markup language Rate this link
SEML - Semi-Extensible Markup Language is a new language similar to XHTML and WML that allows the serving of both WML or HTML from a single source document Rate this link
SMIL - syncronized multimedia integration language Rate this link
W3C Scalable Vector Graphics (SVG) - SVG is a language for describing two-dimensional graphics in XML. SVG allows for three types of graphic objects: vector graphic shapes (e.g., paths consisting of straight lines and curves), images and text. Rate this link
WML - wireless markup language, used in WAP system Rate this link
XForms 1.0 - XForms is an XML application that represents the next generation of Forms for the Web. By splitting traditional XHTML forms into three parts - data model, instance data, and user interface - it separates presentation from content, allows reuse, gives strong typing - reducing the number of round-trips to the server, as well as offering device independence and a reduced need for scripting. Rate this link
XHTML - HTML 4.01 defined according to XML notations Rate this link
XHTML 1.1 - Module-based XHTML - XHTML 1.1 is a reformulation of XHTML 1.0 strict, but using Modularization. The purpose of XHTML 1.1 is to serve as the basis for future extended XHTML 'family members', and to provide a consistent, forward-looking document type cleanly separated from the deprecated, legacy functionality of HTML 4 and XHTML 1.0. Rate this link
XHTML Basic - XHTML Basic is designed to provide a common subset across various Web clients that do not support the full set of XHTML features, and is built from basic XHTML modules, such as Structure, Text, Hypertext, List, Basic Forms, Basic Tables and Image. While the document type is simple, it is rich enough for content authoring. This profile of XHTML has been adopted for use in the WAP2 standard for mobile telephony. Rate this link

Using XML with your programs and applications

Associating Style Sheets with XML documents Rate this link
Electronic Commerce Modeling Language (ECML) - ECML, a universal format for online checkout form data fields, was announced in June 1999. ECML provides a simple set of guidelines for web merchants that enables digital wallets from multiple vendors to automate the exchange of information between consumers and merchants. Rate this link
JavaSoft: Java & XML - information on using XML with Java Rate this link
The XML library for Gnome - libxml, a.k.a. gnome-xml Rate this link

XML Tools

Apache XML Project - The goals of the Apache XML Project are to provide open commercial-quality standards-based XML solutions and focus for XML-related activities within Apache projects Rate this link
Xalan - an XSLT processor for transforming XML documents into HTML, text, or other XML document types Rate this link
Xalan-C++ version - Xalan is an XSLT processor for transforming XML documents into HTML, text, or other XML document types. Xalan-C++ version 1.3 is a robust implementation of the W3C Recommendations for XSL Transformations (XSLT) and the XML Path Language (XPath). It includes a compatible release of the Xerces-C++ XML parser. Rate this link
Microsoft XML Notepad - a simple prototyping application for HTML authors and developers that enables the rapid building and editing of small sets of XML-based data Rate this link
Publicly Available Software for SGML/XML/DSSSL - The wealth of SGML software made freely available for public. The scope of interest in this list is mainly the Internet. Rate this link
Simple XML Subset Parser - included in GLib Rate this link
XT - XT is a fast, free implementation of XSLT in Java Rate this link

XHTML information

XHTML 1.0 is the first step toward a modular and extensible web based on XML (Extensible Markup Language). It is a page description language which has XML format, but maintain compatibility with today's HTML 4 browsers. XHTML is an acronym for "eXtensible HyperText Markup Language", a reformulation of HTML 4.0 as an XML 1.0 application. XHTML provides the framework for future extensions of HTML and aims to replace HTML in the future. XHTML is the reformulation of HTML 4 as an application of XML. It looks very much like HTML 4, with a few notable exceptions, so if you're familiar with HTML 4, XHTML will be easy to learn and use.XHTML 1.0 is nothing else than HTML 4 formulated to XML format. XHTML 1.0 Transitional allows doing everything HTML 4.01 allows, execept some syntactic restrictions. XHTML 1.0 Strict is more limites. XHTML 1.1 is modular version of XHTML 1.0 Strict.The history of XHTML is very simple; it is derived directly from HTML version 4.01 and is designed to be used with XML. Indeed, XHTML is part of a whole new suite of "X" technologies, with acronyms such as XML, XPATH, XSL, and XSLT, that are destined to have a profound effect on the Internet. XHTML is a new technology. On 26 January 2000, the W3C issued the recommendation for XHTML version 1.0. It is also a rapidly evolving technology. The recommendation for version 1.1, which is a module-based concept for XHTML, has already been published.

HTML Compatibility Guidelines - This appendix summarizes design guidelines for authors who wish their XHTML documents to render on existing HTML user agents. Rate this link
Introduction to XHTML Rate this link
Introduction to XHTML, with eXamples Rate this link
Modularization of XHTML - This Recommendation specifies an abstract modularization of XHTML and an implementation of the abstraction using XML Document Type Definitions (DTDs). This modularization provides a means for subsetting and extending XHTML, a feature needed for extending XHTML's reach onto emerging platforms. Rate this link
XHTML.ORG - This site provides information related to XHTML. Rate this link
XHTML Tables - This article is a basic introduction to tables in XHTML, which covers how to construct a table and the options available with different properties. Rate this link
XHTML: The Clear Code Solution - XML continues to be a hot topic among web developers. Why? Because it delivers a standardized markup that separates display and layout code from syntax, making the creation, maintenance, and parsing of documents much easier for all involved. Rate this link

SMIL information

SMIL is a XML based language for making multimedia presentations. It enables simple authoring of TV-like multimedia presentations such as training courses on the Web. The SMIL language is an easy-to-learn HTML-like language. Thus, SMIL presentations can be written using a simple text-editor. A SMIL presentation can be composed of streaming audio, streaming video, images, text or any other media type. W3C's Synchronized Multimedia Activity focusses on the design of a language for scheduling multimedia presentations where audio, video, text and graphics are combined in real-time. The language, the Synchronized Multimedia Integration Language (SMIL) is written as an XML application and is currently a W3C Recommendation. Simply put, it enables authors to specify what should be presented when.

General information

Synchronized Multimedia - W3C resources for SMIL and related technologies Rate this link
Synchronized Multimedia Integration Language (SMIL) 1.0 Specification Rate this link

SMIL Browsers

JustSmil - SMIL program Rate this link
RealPlayer G2 - program which can play SMIL files Rate this link
QuickTime 4 - includes SMIL support Rate this link

RSS

Rich Site Summary (RSS) is a lightweight XML format designed for sharing headlines and other Web content. RSS is short for RDF Site Summary or Rich Site Summary or Really Simple Syndication, an XML format for syndicating Web content. A Web site that wants to allow other sites to publish some of its content creates an RSS document and registers the document with an RSS publisher. A user that can read RSS-distributed content can use the content on a different site. RSS, first developed by Netscape in the 1990s, allows newsreaders and aggregators to scrape links and article summaries for syndication. They wanted an XML format (RSS .90) that would be easy for them to get news stories and information from other sites and have them automatically added to their site. They then came out with RSS .91 and dropped it when they decided to get out of the portal business. UserLand Software picked up RSS .91 and continued to develop it. At the same time a non-commercial group picked up RSS and developed RSS 1.0 based on their interpretation of the original principles of RSS. They based RSS 1.0 on RDF and re-named it RDF Site Summary. UserLand was not happy with RSS 1.0, and continued development of their version of RSS (Really Simple Syndication), eventually releasing RSS 2.0. RSS 1.0 is a bit more verbose than 0.9x, mostly because it needs to be compatible with other versions of RSS while containing the markup that RDF processors need. The RSS technology, which is built into most blog publishing tools, is used primarily to syndicate news content. RSS has evolved into a popular means of sharing content between sites (including the BBC, CNET, CNN, Disney, Forbes, Motley Fool, Wired, Red Herring, Salon, Slashdot, ZDNet, and more). The use of it is expanding to other uses as well. Already syndicated content includes such data as news feeds, events listings, news stories, headlines, project updates, excerpts from discussion forums or even corporate information. Some people think that RSS could be a replacement for e-mail newsletters. In many ways, RSS is similar to the subscription newsletters that many sites offer to keep viewers up-to-date. The big difference is that they don't have to supply an e-mail address. The Rich Site Summary (RSS) format, previously known as the RDF Site Summary, has quietly become the dominant format for distributing news headlines on the Web. RSS defines an XML grammar (a set of HTML-like tags) for sharing news. Each RSS text file contains both static information about your site, plus dynamic information about your new stories, all surrounded by matching start and end tags. Each story is defined by an <item> tag, which contains a headline TITLE, URL, and DESCRIPTION. Future versions of RSS will incorporate popular additional fields like news category, time stamps, and more. Each RSS channel can contain up to 15 items and is easily parsed using Perl or other open source software. RSS, really a mini database containing headlines and descriptions of what's new on your site, is a natural for layering on additional services. RSS encourages in context multiple points of entry to one primary article, rather than multiple copies of the same article (which introduces its own maintenance problems). Why should I make an RSS feed available? Your viewers will thank you, and there will be more of them, because RSS allows them to see your site without going out of their way to visit it.While this seems bad at first glance, it actually improves your site's visibility; by making it easier for your users to keep up with your site - allowing them to see it the way they want to - it's more likely that they'll know when something that interests them is available on your site. Without a feed, your viewers have to remember to come to your site and see if they find anything new - if they have time. If you provide a feed for them, they can point their aggregator or other software at it, and it will give them a link and a description of developments at your site almost immediatly. By providing an RSS feed, you are in front of them constantly, improving the chances that they'll click through to an article that catches their eye. Syndication of web content via RSS is unlikely to make you rich. However, it can be an easy way to draw attention to your material, bringing you some traffic and perhaps a little net fame, depending on how good your information is. By supplying an RSS feed, you can control what information is syndicated in the feed; only the links and metadata are normally distributed. You can also protect the RSS feed itself with SSL encryption and HTTP username/password authentication too, if you'd like.

Creating an RSS News Feed with PHP and MySQL - Website Automation with PHP and MySQL, Part 16 Rate this link
Dmoz RSS Links - A large collection of links to RSS resources. Rate this link
Headline Syndication Overview - complete guide to publishing your headlines Rate this link
Introduction to RSS Rate this link
Is RSS the Answer to the Spam Crisis? Rate this link
MagpieRSS - MagpieRSS provides an XML-based (expat) RSS parser in PHP. MagpieRSS is compatible with RSS .9 through RSS 1.0. Rate this link
Making An RSS Feed - This document describes the basics or RSS and how to make your own feed. Rate this link
Open Source Scripts - Here you'll find scripts used on webref, feedback forms, web-based emailers, signup scripts, popup menus, and more. This page has many RSS related scripts. Rate this link
Reading News and Blogs via Really Simple Syndication Rate this link
RDF Site Summary (RSS) 1.0 Official Specification Rate this link
RSS 2.0 Specification Rate this link
RSS Feed Reader / News Aggregators Directory Rate this link
RSS DevCenter Rate this link
RSS Info - News and information on the RSS format Rate this link
RSS Syndication and Aggregation - RSS encourages in context multiple points of entry to one primary article, rather than multiple copies of the same article. There are a number of RSS news aggregators out there that automatically suck up RSS files from content providers and present the news in a variety of ways. Many make it easy to drop an RSS feed into your site. Rate this link
RSS Tutorial for Content Publishers and Webmasters - This tutorial explains the features and benefits of a Web format called RSS, and gives a brief technical overview of it. Rate this link
RSS Headliner - The RSS code generator will help you to create the RSS file required. Rate this link
Using RSS News Feeds - The Rich Site Summary (RSS) format, previously known as the RDF Site Summary, has quietly become the dominant format for distributing news headlines on the Web. In this Mother of Perl tutorial, we will write a short Perl script (less than 100 lines) that retrieves an XML RSS file from the Web or local file system and converts it to HTML. Rate this link
WebReference Articles on RSS Rate this link
What is RSS and How Do You Use It? - If you frequent Weblogs, you've seen the little XML icons inviting you to "syndicate this site", but what does that really mean? Rate this link

Other XML software

Java Technology and XML Rate this link
Microsoft Internet Explorer 5 - includes partial support for XML documents Rate this link
Mozilla - web browser software with XML and CSS Level 2 support Rate this link
W3C Amaya - Amaya is a highly advanced and powerful Web client which acts as both a browser and an authoring tool. It has been designed with the primary purpose of being a testbed for experimenting with, testing and demonstrating new specifications and extensions of Web protocols and formats. Rate this link

Character sets

The current HTTP/HTML standards (HTTP/1.0, HTML 2.0 and 3.2) only define one representation of an 8-bit character code on the net, and that is the ISO8859-1 code. No other codings are required by the current standards.If your system uses other character codes, native storage code must be mapped (=translated) into the ISO-8859-1 code that is mandated for network transmission. Uusally this mapping is done when the document is converted to HTML format.The entity names for the accented letters have been clearly defined and used in HTML from the early days, and need no special treatment here. The same goes for the low-half characters (< > & and ") that have to be "entified" because they play a role in the syntax of HTML.W3C has successfully stressed the role of Unicode as the basis for identifying characters in documents. Work is continuing on providing markup and style components for international needs.

A tutorial on character code issues - concepts of character repertoire, character code, and character encoding especially in the Internet context Rate this link
Character Entity Set(s) Rate this link
Character code coverage - browser report Rate this link
ISO-8859 briefing and resources - This document started out as a brief introduction to the ISO-8859-1 character code, with pointers to a number of sources of additional information about iso-8859-1 specifically and about iso-8859 codes in general. Rate this link
ISO 8859-1 Table - gzipped postscript file Rate this link
Special Characters in HTML - iso8859-1 table Rate this link
The euro sign in HTML and in some other contexts - The euro currency unit has an official symbol, the euro sign. In principle, there is nothing particularly specific about it as a character in data processing, except that it was introduced relatively recently. Consequently, the problems of presenting it on Web pages written in HTML include the general problems of presenting special characters, but in addition to them, there can be special problems since old fonts often lack this character due to its recent introduction. This document tries to summarize the problems and solutions of presenting characters in HTML as applied to this important special case. Rate this link
Unicode and Multilingual Support in HTML, Fonts, Web Browsers and Other Applications - Have you ever tried to include a passage in a different alphabet in one of your documents, for example a quotation in Russian in an English document, only to find that you have no Cyrillic characters available? Or produced a Web page that includes technical symbols and found that it works with Windows but not with Mac OS or Unix? Problems like these arise with non-Latin alphabets and Symbol fonts because until recently most computers used fonts that contain a maximum of 256 characters. The solution is to leave behind the assortment of 8-bit fonts with their limit of 256 characters, where the same character number can represent a different character in different alphabets, and move to a system that assigns a unique number to each character in each of the major languages of the world. Rate this link

Character set problems related to Finland

Web design

General guides

Cool URIs don't change - there are no reasons at all in theory for people to change URIs (or stop maintaining documents), but millions of reasons in practice Rate this link
Research-Based Web Design & Usability Guidelines Rate this link
The Web Design Group's Web Authoring FAQ Rate this link
Top Ten Mistakes in Web Design - read also Rate this link
Web Design and Monitor Resolution - what are the most common reslutions in use so to which resolutions to optimize the web pages Rate this link
When Bad Design Elements Become the Standard Rate this link
Who Should You Hire to Design Your Web Site? - You need to hire someone to design your Web site. What should you look for before signing on the dotted line? Let's look at a few different types of consultants. Rate this link
Yale C/AIM Web Style Guide - also at Rate this link

Usability design

Changes in Web Usability Since 1994 - most findings about Web usability are the same now as they were in 1994 Rate this link
Eyetracking Study of Web Readers Rate this link
The Alertbox: Current Issues in Web Usability Rate this link
useit.com: Jakob Nielsen's Website - site for usability issues Rate this link
Web Response Times: The Need for Speed - users beg us to speed up page downloads, read also Rate this link
Usability Expers are from Mars, Graphic Designers are from Venus - here is an unarticulated war currently raging among those who make web sites. Nielsen thinks today's web is an advanced but ill-used database. Kioken thinks today's web is a fledgling but ill-used multimedia platform. Rate this link
User Testing Techniques - A Reader-Friendliness Checklist Rate this link

Site accessability

Accessible Site Design - how to make a site which is Rate this link
Augmentative authoring - a different look at "graceful degradation" in Web authoring Rate this link
Fed Opens Web to Disabled - New federal rules in USA mandate that virtually all government websites be fully accessible to disabled people Rate this link
Graceful Degradation - There's nothing wrong with using all the latest bells and whistles to support snazzy features of newer browsers, but try to do it in a way that still allows users not supporting (or intentionally disabling) these features to access your basic content. Fortunately, this is easy to do on the Web, if you follow the spirit of the languages and protocols instead of fighting it. Rate this link
Graceful Degradation of Scalable Internet Services Rate this link
List of Checkpoints for Web Content Accessibility Guidelines 1.0 Rate this link
Text-friendly authoring topics - how to reach the widest accessability of your documents Rate this link
Web Content Accessibility Guidelines - explain how to make Web content accessible to people with disabilities Rate this link
Web Design Group - to promote the creation of non-browser specific, non-resolution specific, creative and informative sites that are accessible to all users worldwide, includes references, FAQs and other tips Rate this link

Use of frames

Miksi kehykset sopivat Webiin huonosti - why frames are not good for Web, text in Finnish Rate this link
Why Frames Suck (Most of the Time) - For new or inexperienced Web designers, just say no. People who really know what they are doing can sometimes use frames to good effect, though even experienced designers are advised to use frames as sparingly as possible. Rate this link

Use of tables

Dan's Web Tips: Table - general information on using tables Rate this link

Handling errors

Multimedia

Guidelines for Multimedia on the Web Rate this link

Specific site design tips

Designing Music-Related Web Sites - a brief overview Rate this link
Julkishallinnon WWW-sivuston suunnittelun ohjeet - information how to design govermential web pages in Finland, text in Finnish Rate this link
Word division in IE and other notes on the nobr markup and on suggesting possible "word" breaks - Internet Explorer divides strings into two lines in a problematic way Rate this link

Bad things to avoid

Web Pages That Suck - Learn good design by looking at bad design Rate this link
What's wrong with the FONT element? - how documents can become invisible, illegible, or inaccessible to many viewers (info how FONT works can be found from Rate this link

Useful sites for more information

DevArticles - DevArticles is a website that provides both beginning and intermediate web developers with the knowledge, tools and skills they need to get the job done. It publishes articles and tutorials. Rate this link
The Web Developer's Virtual Library Rate this link
useit.com: Jakob Nielsen's Website - site for usability issues Rate this link

HTML checkers and validators

For web pages to work well, it is important that your markup is free of errors.When editing HTML it's easy to make mistakes. Although quite often somewhat broken web page works on many browsers, it can cause compatibility problems with some browsers.HTML page checkers and validators read markup generated by HTML editors and conversion tools, and report if there are errors in the page.HTML validators are designed to bring to your attention errors that you need to work on yourself. It is a good habbit to runyour web pages through a vlaidator before publishing them to avoidcompatibility problems to occuring immediatly or later on.A convenient way to automatically fix markup errors is to use the HTML Tidy utility. This also tidies the markup making it easier to read and easier to edit. I recommend you regularly run Tidy over any markup you are editing. Tidy is very effective at cleaning up markup created by authoring tools with sloppy habits.

Why to validate

4 Reasons to Validate your HTML Rate this link
HTML-validaattori - mik? se on - information on HTML validators in Finnish Rate this link

Page validators

Checking your HTML - links to some good HTML checking services Rate this link
Lehtori - Finnish HTML checker Rate this link
W3C HTML Validation Service - validation service based on an SGML parser, hecks HTML documents for compliance with W3C HTML Recommendations and other HTML standards Rate this link
WDG HTML Validator Rate this link
Weblint Gateway - Enter a URL for a Weblint 1.020 HTML error scan and an HTML listing with error messages in context and tags, URLs, and error messages highlighted in color. Rate this link
Weblint - free HTML validation program written in Perl Rate this link
WWW-ty?kalut - links to HTML and CSS validators, accessability testing programs, compatibility tests and other web pages test tools Rate this link

Link checkers

Checkbot - tool to verify links on a set of HTML pages Rate this link
LinkCheck - on-line web link checker Rate this link
LinkScan/QuickCheck - link checking and website management tool demonstration Rate this link
W3C? Link Checker Rate this link
Xenu's Link Sleuth - A software to find broken links on web sites Rate this link

Forms

Fill-out forms are used for user actions such as registration, ordering, or queries.HTML support quite nice set of form features.Forms can contain a wide range of HTML markup including several kinds of form fields such as single and multi-line text fields, radio button groups, checkboxes, and menus. Usually forms are processed by CGI scripts. An HTML user agent (web browser) begins processing a form by presenting the document with the fields in their initial state (as server sent it). The user is allowed to modify the fields, constrained by the field type etc. When the user indicates that the form should be submitted (using a submit button), the form data set is processed according to its method, action URI and enctype. NOTE: When there is only one single-line text input field in a form, the user agent should accept Enter in that field as a request to submit the form. Most forms you create will send their data using the POST method. POST is moresecure than GET, since the data isn?t sent as part of the URL, and you can send more data with POST.The advantage of GET in some applications is the fact that you can bookmark the generated result page, for example easily create a link to a search engine search with given keyword.General recommendation is thatif the processing of a form is idempotent (i.e. it has no lasting observable effect on the state of the world), then the form method should be `GET'. Many database searches have no visible side-effects and make ideal applications of query forms. If the service associated with the processing of a form has side effects (for example, modification of a database or subscription to a service), the method should be `POST'. Your web server / user browser, when user is sending form data to your CGI, encodes the data being sent.Alphanumeric characters are sent as themselves; spaces are converted to plus signs (+); other characters - like tabs, quotes, etc. - are converted to ?%HH? - a percent sign and two hexadecimal digits representing the ASCII code of the character. This is called URL encoding (mime type application/x-www-form-urlencoded). In order to do anything useful with the data, your CGI must decode these.

General information

FORM - fill-out form - general information on forms Rate this link
Forms in HTML 4.0 Rate this link
RADIO button issues Rate this link
Reset and Cancel Buttons - The Web would be a happier place if virtually all Reset buttons were removed. This button almost never helps users, but often hurts them. Most Web forms would have improved usability if the Reset button was removed. Cancel buttons are also often of little value on the Web. Rate this link
SELECT tag issues - There's an area of confusion regarding the SELECTED attribute of the OPTIONs in a SELECT group. Rate this link
Vuorovaikutusta: lomakkeet (forms) - information in Finnish Rate this link

From form to mail

Sending contents of a form to a specified e-mail address is a very convient way to collect for example feedback information.

FormMail - FormMail is a generic WWW form to e-mail gateway, which will parse the results of any form and send them to the specified user. This script has many formatting and operational options, most of which can be specified through the form, meaning you don't need any programming knowledge or multiple scripts for multiple forms. Rate this link
Mail-in web forms with "yamform" - Yamform, which stands for "Yet Another Mail Form", is a forms-handling program for use with World Wide Web forms. The difference between yamform and other common mail-based forms-handling programs is that yamform allows the designer of the form to control the format of the resulting e-mailed report -- not just the input format but the output format as well. Rate this link

Information on writing form handling scripts

CGI Form Handling in Perl - Perl is an excellent language for a variety of tasks, especially those which require text management and data-parsing. Thus, it is well suited for writing code to manage the common gateway interface (CGI) forms which have become the mainstay of world wide web interactive communication via HTML. Rate this link
Decoding FORMs with CGI sripts Rate this link

Frames

Frames are a way to divide the browser screen to allow easier navigation under some circumstances. Frequently, frames are used to add a side menu bar to a web site where the constant back and forth clicking would become tedious in a single page. The frameset tag is used to declare multiple frames. Frames allow you to divide the page into several rectangular areas and to display a separate document in each rectangle. Each of those rectangles is called a "frame".Frames are very popular because they are one of the few ways to keep part of the page stationary while other parts change. Frames are also one of the most controversial uses of HTML, because of the way the frames concept was designed, and because many web framed web sites are poorly implemented. Normal frames are used to divide the entire browser window (or a frame) to subwindows.Inline frames appear inside the presentation of a document and allow embedding relatively small documents onto pages.

Fonts in WWW pages

Netscape and Microsoft added their web browsers an option to control the presentation of web documents in a form of FONT tag. Thisallowed changing the character font look and size to fit theneeds of web authors. This gives is used right more options forthe visual layout, but if font settings are used carelesslydocuments can become invisible, illegible, or inaccessible to many viewers.Well designed pages should "degrade gracefully".FONT tag is ofther used to vary font characteristics.Other way to change web page font setting is do it using Cascading Style Sheets(CSS) have have also many options to control the web fonts.

Microsoft typography Page Rate this link
Microsoft WEFT3 - font extension system for Microsoft Internet Explorer Rate this link
Web Font Comparison Rate this link
Web Font Test - The following test shows how fonts used in Microsoft Word look when turned into HTML. Rate this link
What's wrong with the FONT element? Rate this link

Web advertising

On-line advertising is used in many web sites to collect money torun the site or just for mose extra money from hobby site.Banners, buttons, interstitials and key words are all examples of online advertisements. Digital advertisement can be text, static graphic, animated graphic, video, audio or other. Banner is a commonly used interactive online advertisement in the form of a graphic image that typically runs across the top or bottom of a webpage, or is positioned in a margin or other space reserved for ads.Different ads and different ad sources are often rotated in the same space on a webpage. This is usually done automatically by software on the website or in a separate advertisement server.Among agencies, web publishers and the companies that technically distribute and manage web advertisements, there is little agreement over whether to charge per impression, per click, per customer registration or per sale. Some even disagree on what such definitions mean.Cache operation has some effect on on-line advertising.Web cache (both cacles in network and in web browsers)store pages, images, or other items, on a local server or user's computer to speed the rate at which webpages load. Ads, like other images, are cached unless some sort of cache-busting technique is used. When ads are cached, they will be served but will not be counted by an ad server. Cache busting is process of blocking the caching of certain files to guarantee new delivery from the external server for each page view (there are many techniques for this). Cache busting is necessary for the successful execution on online advertising but it can slow down the loading of web pages considerably.

Cookies

Cookies are a general mechanism which server side connections (such as CGI scripts) can use to both store and retrieve information on the client side of theconnection. The addition of a simple, persistent, client-side state extends the capabilities of Web-based client/server applications.A server, when returning an HTTP object to a client, may also send a piece of state information which the client will store. Included in that state object is a descriptionof the range of URLs for which that state is valid. Any future HTTP requests made by the client which fall in that range will include a transmittal of the current value ofthe state object from the client back to the server. The state object is called a cookie, for no compelling reason.

Do "Cookies" Pose any Security Risks? - document describes what web cookies are, how they affect to system security and your privacy Rate this link
Persistent Client Side Cookies Preliminary Specification Rate this link

Web programming and scripting

Several technologies are used to create customized Web pages, including the Perl language, Microsoft's Active Server Pages, the PHP hypertext preprocessor, Java, JavaScript and other methods.Nowadays customizd web pages can be created in the web server or onthe fly on the client side. There are many different scripting technologiesfor this kind of and they are each suitable for different applications.Anyone who has designed web sites for any length of time knows the limitations of html and other client side languages like CSS and JavaScript. While these languages remain at the core of web development, their primary function is to control how text and graphics are presented. Because they lack the ability to manipulate information on demand, or communicate with web servers, the result is a static web page. The ability to create dynamic pages opens doors. You can can always write a special program (for example using C)which prints out the desided HTML page, but this can be hard and time consuming. Using many available scripting languages it is usually easier tomake your dynaic web pages than using traditional programming languages.

General scripting resources

DevScripts - DevScripts is a script resource site that programmers can use to find free scripts for ASP, PHP, C++, JavaScript, etc. Rate this link
HotScripts.com - This site has collection of scripts and source code for ASP, C, C++, CFML, Flash, Java, JavaScript, PHP, Perl, Python and XML Rate this link
The JavaScript Source - resource with tons of "cut & paste" Rate this link

Java

Java is Sun's cross-platform, object-based programmign languagewhich can be used for wide variety of applications. In web environmentJava is most often used as applets which are included to webpages andthen run by the client web browser Java virtual machine. Java can alsobe used to make server side web server extensions called servlets.Nowadays Java technologies like Java Servlets and Java Server Pages are hot topics.Servlets are platform-independent, 100% pure Java server-side modules that fit seamlessly into a web server framework and can be used to extend the capabilities of a web server with minimal overhead, maintenance, and support.JavaServer Pages technology is an extension of the Java Servlet technology.

Apache Tomcat - Tomcat is the servlet container that is used in the official Reference Implementation for the Java Servlet and JavaServer Pages technologies. Rate this link
Creating Your First Java application in Linux Using Only VI and Viewing/Debugging It with Netscape Rate this link
Java(tm): Programming for the Internet Rate this link
Java Servlet API - provides web developers a mechanism for extending the functionality of a web server using Java programs called servlets Rate this link
Java Servlet Technology Rate this link
JavaServer Pages (JSP) technology Rate this link
The Java Tutorial Rate this link

JavaScript

JavaScript is the Netscape-developed object scripting language used in millions of web pages and server applications worldwide.JavaScript is Netscape's cross-platform, object-based scripting language for client and server applications. JavaScript is most commonly used to add code to web page which is executed by the web browser in the client computer. In this use Netscape Navigator, Internet Explorer or some other Javascript capable web browser interprets Client-Side JavaScript statements embedded in an HTML page.Netscape's JavaScript is a superset of the ECMA-262 Revision 3 (ECMAScript) standard scripting language, with only mild differences from the published standard. Microsoft's implementation is somewhat different subset of that standard. Both Netscape and MSIE support JavaScript, with newer browsers supporting later versions of JavaScript (support for some other brosers exist or is coming).Client side Javascript can be used to implement menus, clocks, dynamic HTML, interactive web pages, some limited search engines, it is used in web advertising (some banners) etc.JavaScript can also be used to make server side scripts on Netscape web server.Server-Side JavaScript is compiled into platform-independent bytecode and used by Netscape Enterprise Server, enabling you to create server-based applications similar to Common Gateway Interface (CGI) programs.Even though the name is quite similar to to Sun's Java, the JavaScript iscompletely different technology than Java. Contrary to popular misconception, JavaScript is not "Interpretive Java". JavaScript and Java are similar in some ways but fundamentally different in others. In a nutshell, JavaScript is a dynamic scripting language supporting prototype based object construction. The basic syntax is intentionally similar to both Java and C++ to reduce the number of new concepts required to learn the language. JavaScript can function as both a procedural and an object oriented language.

General information

comp.lang.javascript meta FAQ Rate this link
JavaScript Basics Rate this link
JavaScript Frequently Asked Questions Rate this link
The JavaScript Weenie - JavaScript expert and clone articles Rate this link
Webmonkey Programming JavaScript Pages - lots of JavaScript information Rate this link

Javascript tutorials

How to Steal JavaScript - "Borrowing" JavaScript code is easy. Making it work on your own pages, however, can prove difficult. Nadav reveals what you need to know to make those pirated scripts sail. Rate this link
JavaScript-opas - JavaScript tutorial in Finnish Rate this link
Thau's Advanced JavaScript Tutorial - Take JavaScript to the next level with if-then-else statements, cookies, string handling, browser detection, preloaded images, debugging techniques, and a lot more. Rate this link
Thau's JavaScript Tutorial - Learn the basics including variables, if-then statements, link events, and image swaps in what's been called the best JavaScript tutorial on the Web. Rate this link

Javascript references

Coolnerds Electronic JavaScript Reference - reference with most important operators described Rate this link
Event Handlers - Event Handlers allow embedded scripting languages to trap events and actions that occur as a page is experienced by a reader Rate this link
JavaScript Guide - official guide from Rate this link
Official Netscape Javascript documentation Rate this link

Javascrit books

Complete Idiot's Guide to JavaScript, Second Edition - full book on-line Rate this link
DevGuru JavaScript Quick Reference - This is an extensive 214 page reference source that explains and gives comprehensive, working examples of code in a definitive manner for the JavaScript language (and hence, for the ECMAScript and JScript languages). All elements of the language are covered, including the events, functions, methods, objects, operators, properties, statements, and values. Rate this link

Example source code resources

Doc JavaScript - biweekly column on JavaScript, a programming language for web page's object manipulation Rate this link
JavaScript Clock Rate this link
Popup Closer Rate this link
Programming JavaScript Rate this link
The JavaScript Source - resource with tons of "cut & paste" Rate this link
Website Abstraction Free JavaScripts! Rate this link

Related technologies

An Introduction To DHTML - JavaScript and HTML make a nice couple Rate this link
ECMAScript - based on several originating technologies, the most well-known being JavaScript (Netscape Communications) and JScript (Microsoft Corporation) Rate this link

PHP

PHP is a server-side, cross-platform, HTML embedded scripting language. PHP is a scripting language, with a particular affinity for and emphasis on enhancing Web pages. It has a syntax very similar to C (with a smattering of Perl and shell), and includes lots and lots of functions for things like database access, dealing with CGI requests, and image creation and manipulation. PHP is a simple scripting language to implement server side fuctionalityto web servers which support that scripting language.PHP, which stands for PHP Hypertext Preprocessor, is a server-side embedded scripting language. In non-technical terms: a PHP processor is run on the server (Windows, or a flavor of UNIX). When a page is requested that contains PHP, the processor translates and executes all the commands in the page, and then outputs the result to the browser as regular HTML. Because this translation occurs on the server, a page written with PHP is viewable with any browser, on any operation system.One of the nice things about PHP - and one of the primary reasons for its popularity as a rapid application development (RAD) tool - is the fact that PHP code can be inserted into regular HTML markup to turn otherwise static HTML content into dynamic, intelligent Web pages. This feature makes it possible to quickly write PHP scripts that build Web pages on the fly from a database (or other external data source), and to create "smart" Web applications more efficiently than would otherwise be possible with traditional programming languages like Java or Perl. PHP is usually embedded directly into HTML. PHP code is separated from HTML by Start and End entities. When a document is parsed, the PHP processor only interprets the demarked areas, and outputs the results in the same position. The information on PHP program are not kept in memory between different page loads. All objects created by your script are destroyed at the end of execution. So, once your script is done loading the page, all data stored in memory is marked for garbage collection. If you want data that needs to persist between different page load have to store your data structures in a database, flat file, or any other location that is to your liking. With PHP is it pretty easy to access file system and different databases. However, this ease of use comes with a price: most PHP-based Web sites are often a mush of intermingled HTML markup and PHP function calls, making them hard to decipher and maintain. PHP also includes the ability to almost completely separate code from HTML. PHP was originally designed to be a templating system. The original version of PHP was built to allow you to embed code in HTML toeffectively produce a template.PHP has grown over the years beyond its original purpose, but it is still atemplating system. Any competent developer can separate business logic,presentational logic and presentation using PHP natively.

Template-Based Web Development With patTemplate (Part 2) Rate this link
Adding PHP to Apache on Linux Rate this link
Creating a PHP Form Mail Script - One of the great features of PHP is its form handling capability. Using HTML forms is a great way to receive feedback from your website visitors, and PHP has lots of built-in tools to work with the information gathered in forms. In this tutorial, we will create a simple PHP script to send email from a web browser. Rate this link
Include/SSI: Learn how to use the include() function using the power of PHP - One of the most important and essential functions used in PHP is the include() function. The include() function lets you call an external file from your server, There are several key factors why it is rated much higher than SSI. Besides files also full URLs are supported. You can even use the virtual() function to run anything that you would parse through Apache (like CGI and Perl in a PHP file). Rate this link
Introduction to Regular Expressions - If you think of cartoon language when you see a string like ^[grmbl]f{23}$, this is a good time to introduce yourself to regular expressions. A web developer's job can get quite a bit easier with this powerful tool for searching and replacing. PHP has two types of regular expressions available. The one type this tutorial covers follows the POSIX standard and is implemented through Henry Spencer's regex library. They're a bit different from Perl's implementation. Rate this link
Learning PHP: The What's and the Why's - The ability to create dynamic pages opens doors. PHP, which stands for PHP Hypertext Preprocessor, is a server-side embedded scripting language suitable for creating dynamic web pages. Rate this link
Mixing PHP and SSI - You might sometimes want to mix PHP and SSIs for some reason. You can call a .php file from SSI with few problems as long as you don't need certain environmental variables. You can to include SSI files to PHP pages using PHP virtual() function. Rate this link
PHP and Regular Expressions 101 - A regular expression is a specially formatted pattern that can be used to find instances of one string in another. Several programming languages including Visual Basic, Perl, JavaScript and PHP support regular expressions, and hopefully by the end of this primer you should be able to implement some basic regular expression functionality into your PHP pages. Rate this link
PHP FAQ Rate this link
PHP Builder - lots of information and samples about PHP Rate this link
PHP Developer Shed - PHP Help, PHP Programming, PHP Code, PHP Tutorials, and more! Dev Shed contains tutorials, articles, code samples, and other various resources. Our writings feature in-house authors, freelance writers, and several exclusive publishing relationships. Rate this link
PHP: Hypertext Processor - official home page Rate this link
PHP Manual Function Reference Rate this link
PHP Phanatics - Articles for advanced PHP developers with a focus on scripts that are effective, maintainable, and efficient Rate this link
PHP With HTML Forms Rate this link
phpWizard.net - Building Dynamic Websites with PHP. Some great PHP Components. Rate this link
Secure Programming for Linux and Unix HOWTO: PHP Rate this link
Suggested Methods of Using PHP Securely Rate this link
Template-Based Web Development With patTemplate (Part 1) - Templates separate presentation and layout information from program code, and a template engine to combine the two to create the final product. This two-tiered approach affords both developers and designers a fair degree of independence when it comes to maintaining a Web site, and can substantially reduce the time and effort required in the post-release phases of a development project. Rate this link
The PHP Resource - lots of PHP information Rate this link
The PHP Resource Index Rate this link
Tuning Apache and PHP for Speed on Unix - Here is a compilation of tips on how to optimise Apache on Linux for PHP and CGI programs. These tips can also apply to Perl and Python. Rate this link
Using PHP for Date Processing in Forms - One of the most time-consuming tasks of any web application developer is dealing with forms. Not only must you decide what information to place onto the form and design the HTML for it, but you also must validate the submitted form data and process it. Rate this link
Using PHP to Interpret Forms Rate this link
WebReference Articles on PHP Rate this link
ezSQL - A class that makes it ridiculously easy to use mySQL, Oracle8, Inerbase/Firebase, PostgreSQL, SQLite (PHP), SQLite (C++), MS-SQL database(s) within your PHP/C++ script. Includes lots of examples making it very easy to understand how to work with databases. Rate this link

Active Server Pages (ASP)

Other web scripting tools

Server side includes (SSI)

Adding some web server executable code to the web pages usingsome suitable tools. Those pieces of code within the pageare executed and the output of those scripts plus the originalHTML (without those executable codes) are sent to the web browser.

Apache module mod_include documentation - module needed to be able to use Server Side Includes Rate this link
Apache Tutorial: Introduction to Server Side Includes Rate this link
How do I enable SSI (parsed HTML) in Apache server Rate this link
Module mod_include - This module provides for documents with Server Side Includes (SSI) in Apache HTTP Server. Rate this link
Server Side Includes (SSI) - NCSA HTTPd allows users to create documents which provide simple information to clients on the fly Rate this link
Using Server Side Includes - article on using Server Side Includes to make adding dynamic content to your documents easy Rate this link

Web server information useful for scripting

Web server program is the software application running ona computer which acts like a web server. Web server is neededfor making hypertext and other documents available to Web browsers. Modern web server programs allows one computer with one IP address to show to the networks as many different servers with different names. The idea is that all those different names point in DNS to this same IP, but the server program distinquises what is the name of server the user wants to access (web browsers tell the name of server they want to access in their request, so the server gets to know it in this way). In case o ufer tries to access the server with only IP address, the server just gives out the default page (usually service provider main page). General rule of thumb for hardware needs of web servers.For scripts, the main bottleneck is the CPU. For static HTML/images, the bottleneck is RAM and the network. A slow 400 Mhz Pentium can easily saturate a T3 line (that's 45 Mbps) with static HTML pages.Pages which use buil-in scripting (for example ASP, PHP or similar)are generally 2-10 times slower than a static HTML page by web server (for example Apache).Sometimes you have to choose between UNIX/Linux operating systemand Windows operating system for web server applications.The operating system you run on your PC has absolutely no bearing on your dealings with a host's web server. Even those designers using Microsoft's FrontPage will have no problem publishing their site on any of suitable Unix servers. The foundation of the Unix operating system is its reliability. Built with stability and efficiency in mind, the OS uses text-based commands instead of a graphical user interface. The resulting operating system doesn't waste memory or disk space on bells and whistles. Since Unix is an open source operating system it is freely available to anyone who wants to add to or improve it. The majority of web server applications available on the Internet are intended for use with the Unix OS. This includes many of the scripts, written in languages such as Perl, C, C++, Java, and PHP, which are used to add functionality to web sites. Also database systems are noadays well supported, most often used database system for web applications on UNIX platforms seems to be MySQL. The majority of existing CGI programs (generally Perl scripts) that are available on the Internet have been developed on Unix platforms and hence tend to install and operate much more smoothly under Unix. There are hundreds of thousands of free scripts out there that you can download and setup on a Unix server within minutes. Most UNIX systems support interactive access, via telnet or SSH. Generally one is able to do many configuration/setup tasks themselves rather than having support team do them. Many new users simply aren't comfortable with Unix. There are also several types of server applications, mostly those authored by Microsoft, which cannot be supported by Unix. These include Microsoft's Active Server Pages (ASP), Microsoft Access and SQL databases, among others. Windows 2000 allows those with little or no experience in the fields of web development and hosting to get advanced features working very quickly. Like any Microsoft product, Windows 2000 Advanced Server is very resource-intensive, meaning it will require a generous allocation of resources and hardware if stability plays a key factor in your determinationWindows 2000 Server supports all common Microsoft products including Microsoft FrontPage and Microsoft Office, making some everyday tasks simpler to interact with your web server. Windows 2000 Server supports Active Server Pages (ASP) and Microsoft database systems (Access and SQL).

General web server information

How to Choose a Web Hosting Service (ISP) for your Business Web Pages - One of the most important decisions you'll make for your business Web site is selecting which Web hosting service will display your Web pages to the Internet. With a good Internet Service Provider (ISP) you don't have many problems; with a poor ISP you have much pain. So how do you go about shopping? Rate this link
Open-Source Web Servers: Performance on a Carrier-Class Linux Platform - Ibrahim tests the performance of three open-source web servers on a typical Ericsson Research Linux cluster platform and reports his results. The tested servers were Apache, Tomcat and Jigsaw. Rate this link
Serving Web Pages from a Dynamic IP Address - This document describes how to have your own Linux server on the Internet without renting a dedicated machine or a leased line. More technically, it's about setting up a fixed domain name for a dynamic IP address. Rate this link
Summary of Access Permissions in UNIX File System - Perhaps the most important thing to obtain a grasp of when dealing with web development, is the prevalence of the UNIX style file system in almost everything you will be doing. If you have ever used HTTP or FTP (web browsing or file transfers), you will have already encountered a good deal of unix file system concepts. Rate this link
The Netcraft Web Server Survey: Market Share for Top Servers Across All Domains August 1995 - October 2001 - The Netcraft Web Server Survey is a survey of Web Server software usage on Internet connected computers. We collect and collate as many hostnames providing an http service as we can find, and systematically poll each one with an HTTP request for the server name. Rate this link
Unix vs. Windows 2000 - There are many common mis-conceptions about which operating system is right for you Rate this link
Using the UNIX File System - Perhaps the most important thing to obtain a grasp of when dealing with web development, is the prevalence of the UNIX style file system in almost everything you will be doing. If you have ever used HTTP or FTP (web browsing or file transfers), you will have already encountered a good deal of unix file system concepts. Rate this link

Managing sessions

Many personalization techniques and e-commerce application rely on session tracking. Since HTTP is a stateless protocol, this is a limitation if you want to implement such finctionality.A session is defined as a set of interactions between the client's web browser and web server. A session starts when user first invoke site's URL and ends when they terminte their browser, user log outs (if site has suych option) or when session times out after a certain period of inactivity. Since HTTP is a stateless protocol, an external mechanism is needed to manage and save the state information between client/server web interactions. Most common mechanisms used to maintain state are cookies and hidden fields. Security is a concern for both hidden fields and cookies (storing sensitive information to those is not recommended). An alternative is to store a session ID (SID) to URL and save state information to the server side. Once you have a mechanism to identify a session, you must determine how to save the state information on the web server. If you are using ASP and JSP, you can use the built-in session object to store the information into memory. PHP has a similar mechanism called "session variables". There are several ways to save session data. By far, the most common approach is to make the data persistent in a relational database.The session support allows you to register arbitrary numbers of variables to be preserved across requests. A typical way to implement session handling is the following: A visitor accessing your web site is assigned an unique id, the so-called session id. This is either stored in a cookie on the user side or is propagated in the URL. Using sessions, does not mean, you can be absolutely sure, that the session data can only be viewed by that user. This is important to keep in mind, when storing and displaying sensitive information.Therefore, when dealing with sensitive information, there should always be additional methods to decide whether it is a valid session. Sessions are not reliable as a secure authentication mechanism.

User Authentication and Password Protection of Web Pages Rate this link

Web server software links

Apache - propably the best and most widely used free web server program with good documentation available on-line Rate this link
Boa Webserver - a single-tasking HTTP server that internally multiplexes all of the ongoing HTTP connections, and forks only for CGI programs Rate this link
GoAhead WebServer - free embedded web server Rate this link
Jigsaw - The W3C's Server - Jigsaw is a Web server platform, providing a sample HTTP 1.1 implementation and a variety of other features on top of an architecture implemented in Java. Rate this link
Red Hat TUX Web Server - TUX is a kernel-based, threaded, extremely high performance HTTP server for Linux. It is able to efficiently and safely serve both static and dynamic data. TUX moves the HTTP protocol stack to the kernel, and can handle requests for data with both kernel-space and user-space modules. Rate this link
Microsoft - Internet Information Server is a Commercial web server for Windows NT Rate this link
NCSA HTTPd - NCSA HTTPd is an HTTP/1.0 compatible server for making hypertext and other documents available to Web browsers. Rate this link
Netscape Web Server products - Commercial web server program available for many platforms. Rate this link
thttpd - tiny/turbo/throttling HTTP server is a simple, small, portable, fast, and secure HTTP server Rate this link

Using Apache web server software

Adding PHP to Apache on Linux Rate this link
Apache Core Features - These configuration parameters control the core Apache features, and are always available. Rate this link
Apache HTTP Server Version 1.3 Documentation Rate this link
Apache IP-based Virtual Host Support Rate this link
Apache name-based Virtual Host Support - The HTTP/1.1 protocol, and a common extension to HTTP/1.0, includes a method for the server to identify what name it is being addressed as, so you can run many different name web servers in same computer with one IP address Rate this link
A Users Guide to URL Rewriting with the Apache Webserver Rate this link
Hints and Tips for Apache - Information on file access restrictions, how to speed up Apache, rotating log files, about single configuration file and running user CGI programs. Rate this link
How do I make htaccess work? - Setting up user authentication takes two steps. First, you create a file containing the usernames and passwords. Second, you tell the server what resources are to be protected and which users are allowed (after entering a valid password) to access them. After this is done you will need to create a user database. Rate this link
Limit Your Pages to Rice: Introduction to WWW Access Control - For the Unix-based NCSA and Apache web servers , access control is set on a per-directory basis by putting a ".htaccess" file in the directory. Subdirectories inherit access control rules from their parent directories, and each web server has a default set of rules which apply to any directory that contains no ".htaccess" file anywhere in its path. There are two kinds of access control: IP-address-based and password-based. Rate this link
Securing Apache - Access Control - This article describes how to use ACLs (Access Control List) to protect data, and how to use them to securely implement HTTP PUT and discuss how to use modules to increase Apache's functionality. Rate this link
The Red Hat Linux Apache Centralized Knowledgebase - This document is a collection of information gathered from far and wide using the best information resource known: the Internet. It's an analysis of the top 25 problem types reported regarding the Apache web server and the solutions to those problems whenever possible. It also contains installation FAQs on PostgreSQL and MySQL. Rate this link
Tuning Apache and PHP for Speed on Unix - Here is a compilation of tips on how to optimise Apache on Linux for PHP and CGI programs. These tips can also apply to Perl and Python. Rate this link
Tuning Your Apache Web Server - Setting up and configuring your Web server is a reasonably simple process for which most installations provide scripts. Optimization involves tweaking your server to achieve maximum performance. This is not well documented and is anything but automated. This article provides some simple tactics you can employ to improve your Web server's performance, and where appropriate it points to tools and information sources that will help you achieve even greater performance gains. Rate this link
Using .htaccess Files with Apache - The .htaccess file allows customising Apache functionality at the directory level. This article shows how you can use per-directory configuration files, called .htaccess files, to customise Apache behaviour -- or allow your users to do so for their own documents. Rate this link
Using User Authentication - how to do it with Apache web server Rate this link
Virtual Host examples for common setups Rate this link

Using other web server software

Apache 1.3 URL Rewriting Guide - This document describes how one can use Apache's mod_rewrite to solve typical URL-based problems webmasters are usually confronted with in practice. Rate this link
NCSA HTTPd Tutorials Rate this link
User Authentication and Password Protection of Web Pages Rate this link
Web Hosting Tutorial Rate this link

Accessory software

Analog - Analog is the most popular logfile analyser in the world. Analog shows you the usage patterns on your web server. Analog is a program to measure the usage on your web server. It tells you which pages are most popular, which countries people are visiting from, which sites they tried to follow broken links from, and all sorts of other useful information. Analog is free software. Rate this link
The Webalizer - The Webalizer is a fast, free web server log file analysis program. It produces highly detailed, easily configurable usage reports in HTML format, for viewing with a standard web browser. The Webalizer is distributed under the GNU General Public License, complete source code is available, as well as binary distributions for some of the more popular platforms. Rate this link

Perl

Beginner's Guide to CGI Scripting with Perl Rate this link
CGI.pm - a Perl5 CGI Library Rate this link
Embedded Perl 5 Language (ePerl) - ePerl interprets an ASCII file bristled with Perl 5 program statements by evaluating the Perl 5 code while passing through the plain ASCII data. It can operate in various ways: As a stand-alone Unix filter or integrated Perl 5 module for general file generation tasks and as a powerful Webserver scripting language for dynamic HTML page programming. The ePerl program itself is a fully functional Perl interpreter because internally it makes use of the original Perl interpreter library (libperl.a). So ePerl is based on the installed Perl system. The only noticeable difference to the user between the program eperl and perl is the source file format they use. Rate this link
ePerl - This tool allows you to expand perl statements inside of text files. ePerl reads a text file and expands any perl code it sees between some simple delimiters. Rate this link
Getting Started With Perl - script language you should consider when automatically building web pages Rate this link
Comprehensive Perl Archive Network (CPAN) - Here you will find All Things Perl. Rate this link
How PERL Works Rate this link
Mail::Sendmail FAQ - This is a little FAQ for the Mail::Sendmail Perl 5 module. Rate this link
Mailtest - small Perl script to send mail Rate this link
Matt's Script Archive - lots of Perl scripts for WWW related scripting applications Rate this link
Perl for MS-DOS - download directory for DOS and Windows versions Rate this link
Perl Program to Generate and Process Form Rate this link
Perl Reference Page Rate this link
Quick Perl Reference Rate this link
The Fool's Guide to CGI.pm, the Perl module for CGI scripting - No, this guide is not for fools but by a fool, who had kept wondering how CGI.pm is supposed to work and finally got a clue. So here's how (I think) it works in simplest cases. I hope this might make some other people's learning curve a little better. Examples include a simple Perl program that processes a form. Rate this link
The Apache/Perl Integration Project Rate this link
www.perl.com - the source for Perl Rate this link

Python

Python is a powerful, modern, free, open source, general purpose,interpreted programming language developed by Guido van Rossum.It is used by many programmers around the world. Python is easy to learnand allows very rapid developments. Pythin canbe used as scripting languagefor example for writing CGI-scripts or it can be embedded to HTMLpage using Zope system.

Python CGI FAQ Rate this link
Common Gateway Interface in Pythin: An Interctive Instruction Rate this link
Python CGI FAQ Rate this link
Python Common Gateway Interface support Rate this link
Python Language Website Rate this link
Python Tutorial Rate this link
Python Web Application Tools Rate this link
Using Python for CGI programming - slide set Rate this link
Writing CGI Programs in Python - Python is a powerful, free, open source, general purpose, interpreted programming language. Python runs on a wide variety of platforms including Linux, Microsoft Windows (95/98/NT), Macintosh (including OS X), virtually every flavor of Unix, and many other platforms. Python is roughly comparable to Perl or Java, though it has several significant strengths (and a few disadvantages) over each. Python makes it very easy to write clean, maintainable, and powerful programs for a variety of tasks with minimum hassle. Rate this link
Zipe - Zope is the leading Open Source web application server. Zope enables teams to collaborate in the creation and management of dynamic web-based business applications such as intranets and portals. Rate this link

CGI scripts

CGI programs are typically programs which usually run for a very short time to server one user request. For theis reason they should be designed in such way that they start quickly. Memory leaks are seldom a problem in CGI programming because CGI applications rarely run long enough to be concerned with leaks.Usually CGI scripts are run at the same user ID as the web serverprocess is running or run anonymously with the user set to "nobody".This means that you need to have execute permission enabled for the public as well as the script owner (eg CHMOD 755 giving rwxr-xr-x) and any files written to by the script need to have write permission enabled for the public (eg CHMOD 666 giving rw-rw-rw-).In some times it is beneficial if CGI scriptsare run under the used ID of the account they reside in.Sometimes be useful to run CGI scripts with yourself identified as the user. CGI Wrappers are used for the purpose of running CGI scripts onthe script owner used ID. When running a script under your own userid (using the CGI wrapper), you can generally give sensitive permissions only to the file owner. eg A CGI script may run with permissions of 700 (rwx------) and may write to files with permissions of 744 (rwxr--r--).Please be aware that whilst running scripts under your userid can in some ways be used to improve security (by using more restrictive file permissions for example), it can also be a serious security risk unless your code is carefully written with security in mind. This is because a script running under your userid will be able to access more sensitive files and areas of the system than would be possible if running anonymously. CGI scripts should always be written to be as secure as possible and to minimise the possibility of successful system attacks by hackers, but this becomes even more important when running scripts under your own userid.

General information

CGI Programming 101 - class intended for anyone who knows HTML, but doesn't know Perl or CGI programming Rate this link
CGI Programming FAQ Rate this link
FastCGI - FastCGI is a language independent, scalable, open extension to CGI that provides high performance without the limitations of server specific APIs. Rate this link
Guide to Writing CGI Scripts in REXX and Perl - This Guide is aimed at people who wish to write their own WWW executable scripts using WWW's Common Gateway Interface (CGI). Though the main emphasis is on REXX many examples are also provided in Perl. Rate this link
How CGI Scripting Works Rate this link
How the web works: HTTP and CGI explained Rate this link
How to use your CGI-BIN - Information how to use ready made CGI scripts. Rate this link
Palvelinskriptit (server-side scripting) - information in Finnish Rate this link
Remotely Hosted CGI Scripts - some services you can use without making yuour own CGI scripts Rate this link
Secure Programming for Linux and Unix HOWTO - This book provides a set of design and implementation guidelines for writing secure programs for Linux and Unix systems. Such programs include application programs used as viewers of remote data, web applications (including CGI scripts), network servers, and setuid/setgid programs. Specific guidelines for C, C++, Java, Perl, PHP, Python, TCL, and Ada95 are included. Rate this link
Security with CGI Scripts - ook out for security holes as you write your script Rate this link
The CGI Resource CGI Tutorials Index Rate this link
Uusmediasovellusten tekniikat, CGI, Perl, SSI - link page to good documents for web technologies Rate this link

Information on developing CGI scripts

CGI Developer's Guide - two chapers of book on-line Rate this link
CGI Developer's Guide - on-line version of book Rate this link
CGI Made Really Easy - Writing CGI scripts to process Web forms Rate this link
CGI Programming FAQ Rate this link
CGI Programming Unleashed - book on-line Rate this link
CGI: Common Gateway Interface - resource page by W3C Rate this link
CGI Developer's Guide - full book on-line Rate this link
Common Gateway Interface RFC Project Page - effort to turn the de facto Common Gateway Interface "standard" into an actual Informational RFC Rate this link
The Common Gateway Interface (CGI) - very good pages about making CGI scripts Rate this link
The WWW Common Gateway Interface Version 1.1 Rate this link
Writing More Secure CGI Scripts - a few guidelines to help ensure your program does not come under attack. Rate this link

CGI scripts and script collections

CGI Scripts Link - links to CGI scripts collection sites Rate this link
HotScripts.com - web scripting resources Rate this link
lakeWeb's PERL Scripts - some free to use script written using Perl Rate this link
Programs and Scripts in Perl for Survey and Voting Rate this link
The CGI Resource - site which has a collection of useful CGI scripts Rate this link
The CGI esource Index - lots of CGI information and scripts Rate this link

Solving problems

The Idiot's Guide to Solving Perl CGI Problems Rate this link

Tools for web page preprocessing

Style sheets

Cascading Style Sheets (CSS) is a simple mechanism for adding style (e.g. fonts, colors, spacing) to Web documents.Style sheets describe how documents are presented on screens, in print, or perhaps how they are pronounced. W3C has actively promoted the use of style sheets on the Web since the Consortium was founded in 1994. By attaching style sheets to structured documents on the Web (e.g. HTML), authors and readers can influence the presentation of documents without sacrificing device-independence or adding new HTML tags. For style sheets to work, it is important that your markup (both HTML and CSS) is free of errors. CSS can be used to style HTML & XML documents.Style sheets offer precise control over the presentation of Web pages. Not only can Web designers specify the visual effects they want, but also aural style sheets give control over voice, pitch and other aspects of how the text will sound when rendered into speech. After the publication of two recommendations, CSS1and CSS2, W3C continues to evolve the Cascading Style Sheets (CSS) language to provide even richer stylistic control, while emphasizing the importance of the thorough implementation of CSS on browsers.

Adding a touch of style - This is a short guide to styling your Web pages. It will show you how to use W3C's Cascading Style Sheets language (CSS) as well as alternatives using HTML itself. The route will steer you clear of most of the problems caused by differerences between different brands and versions of browsers. Rate this link
Cascading Style Sheets Rate this link
Cascading Style Sheets - W3C page Rate this link
Cascading Style Sheets, level 2 - CSS2 Specification Rate this link
Cascading Style Sheet Resources Rate this link
Cascading Style Sheets - WDG page Rate this link
Colophon: Cascading Style Sheet choices Rate this link
Cascading Style Sheets, level 1 - W3C Recommendation 17 Dec 1996, revised 11 Jan 1999 Rate this link
CSS-artikkelisarja - large article series about styles sheets in Finnish Rate this link
CSSCheck - validation tool for style sheets Rate this link
CSS Properties - quick view to style sheets Rate this link
Effective Use of Style Sheets Rate this link
Learning CSS Rate this link
RichInStyle.com CSS bug table Rate this link
WebReview.com's Style Sheet Reference Guide - Trying to keep track of how well CSS works on different platforms can be a daunting task, this document helps it a little bit. Rate this link

Graphics in WWW

Graphics are the most visible part of the modern Web and arguably one of the primary reasons for it popularity and explosive growth. Successful use of graphics on the Web depends on interoperability across platforms, output resolutions, color spaces, and software products.The most commonly used graphics formats for web are GIF and JPEG pictures.W3C has also issued a Recommendation for Portable Network Graphics (PNG), a format for bitmapped images. The graphics pictures presented in bitmap format (GIF, JPEG or PNG) are usually optimized for around 75 dpi resolution, because the is the resolution in typical computer monitor used to view web pages.For vector graphics W3C is developing Scalable Vector Graphics (SVG), an open vector graphics format written in XML, and designed to work across platforms, output resolutions, color spaces, and a range of available bandwidths.

Color information

16 Basic Colors - with names and hexadecimal numbers Rate this link
216-color Websafe Palette Rate this link
A Standard Default Color Space for the Internet - sRGB - The aim of this color space is to complement the current color management strategies by enabling a third method of handling color in the operating systems, device drivers and the Internet that utilizes a simple and robust device independent color definition. Rate this link
Background and Foreground Control Rate this link
ColorCenter - on-line tool to test suitable color combinations for your web pages Rate this link
Death of the Websafe Color Palette? - what was once believed to display always correctly do not do it across all browsers, platforms, and color depths Rate this link
HTML 3.2 16 Standard Color Names & Numeric Color Value Rate this link
Text and Background - apply color principles in web design, select colors for the background, text, and links on the page via the attributes of BODY tag Rate this link
User-Interface Design Guidelins for Color Rate this link
Victor Engel's No Dither Netscape Color Palette Rate this link
Web Font Colors - page to help to determine the best colors for type on your web page Rate this link

Display properties

Pictures

All About GIF89a Rate this link
GIF Wizard - online utility will help you reduce the file size of your GIF files Rate this link
Grafiikkaformaatit ja niiden k?sittely - meta FAQ on graphics formats in Finnish Rate this link
Imaging for the Web - get the right format for you web page graphics Rate this link
The background FAQ - about web page background images Rate this link
Web Graphics on a Budget: Paint Shop Pro Tutorials Rate this link

Free web graphics

Flags of the Nations of the World - over 300 free national, state and flag images Rate this link
Free Web Graphics - bullets, buttons and divider lines Rate this link
Graphics for your pages - icons, buttons, paragraph dividers and dingbats Rate this link
Honest Har's Button and Graphics Zone!! Rate this link
MediaBuilder Free Image and Animated GIF Library - backgrounds, animations, borders, icons and lines Rate this link
The Under Construction Zone - A Library of Live Action Shareware to Animate Sites Under Construction Rate this link
Web Clip Art - free icons, borders and animations Rate this link
Yahoo Clip Art - links to many clipart sites Rate this link

Other free web page resources

Free Web Templates.com - free templates, free wallpapers, free textures, free buttons Rate this link
Pagekits - website templates, with all the code and images you need to get started in making your own web site Rate this link

Vector graphics

Scalable Vector Graphics is an XML grammar defined by the World Wide Web Consortium (W3C). It defines a language for describing two-dimensional graphics, consisting of vector graphic shapes, images and text.

Adobe SVG Viewer - Viewer plug-in for W3C Scalable Vector Graphics Rate this link
Batik SVG Toolkit - Batik is a Java(tm) technology based toolkit for applications that want to use images in the Scalable Vector Graphics (SVG) format for various purposes, such as viewing, generation or manipulation. Rate this link
Mozilla Scalable Vector Graphics (SVG) Page Rate this link
W3C Scalable Vector Graphics (SVG) - SVG is a language for describing two-dimensional graphics in XML. SVG allows for three types of graphic objects: vector graphic shapes (e.g., paths consisting of straight lines and curves), images and text. Rate this link

Making multimedia web pages

Animation on WWW

Create your own banners - links to banner creation information and tools Rate this link
GIF 89a - information about GIF format used for GIF animations Rate this link
GIF animation on the WWW - very complete page with tutorials and links to software, maybe you should check if there is Rate this link
Microsoft GIF Animator - free GIF animation pgoram for Windows 95 Rate this link
SwishZone.com - create Flash aminations without Flash Rate this link
VideoCraft GIF Animator - a commercial prigram for GIF animations you can try 30 days when you download it Rate this link

Video and web

Captioning for the Web Rate this link
Put Your Video On The Web For Free - basics of putting videos to web Rate this link
Video and Streaming Media - When building web sites remeber that most users still do not have sufficient bandwidth to receive streaming video in an acceptable quality and they won't get broadband for another four years. Rate this link

Audio and web

Designing Music-Related Web Sites - brief overview Rate this link

Webcams

A simple Webcam consists of a digital camera attached to your computer. A piece of software connects to the camera and grabs a frame from it periodically. . The software then turns that image into a normal JPG file and uploads it to your Web server. The JPG image can be placed on any Web page (for information on creating Web pages and adding JPG images. Putting a standard JPG image into a standard Web page is straightforward, but it has the disadvantage that web page users must manually refresh the image. Fortunately there are technologies for automatic image refreshing. With use of a meta tag, a JavaScript function or a Java applet, it is possible to create a system that automatically refreshes the image for your readers.

How Webcams Work - This article will look at the steps you can take to put up your own simple Web camera. Rate this link

Multimedia player plug-ins

NetShow - network video player from Rate this link
RealPlayer - for Rate this link

Multi langual web publishing

The main language in web is English. But there are also many pages published on other languages. And web sites that support many languages. Nowadays multilingual authoring is really very limited at present on the Web. There is the overall dominance of English, and the effect that most people just write Web pages in English due to that.There are big differences in the difficulties when authoring in a language other than English. After all, there are thousands of languages, with different writing systems, and different people. For Western European languages, the difficulties are relatively small. For languages like Japanese or Chinese, the character encoding issues are certainly much bigger, but generally solvable under suitable guidance. There are many things to consider:different writing systems and encodings are handled on the WWW. Quite often, a page in an "exotic" language works in a particular cultural environment where that language is widely spoken but fails in the WWW context. This is mostly not a problem of knowledge gap among authors; rather, the problems are in servers, browsers, and authoring software.But the technology is not completely ready yet. Today the the goal of making characters show up on absolutely everyone?s browser is fundamentally unrealistic. World is aiming to that, but there are things to do. Character problems are practically important of course, for quite a many languages. One thing which will help the situation is coding systems like Unicode, which allows much larger character sets than normal ASCII character set.Some very small languages have particular problems with character codes: they might contain characters that do not belong even to Unicode. What needs to be done in the authoring side is he integration of existing technologies into mainstream software. Real multilingualism on the Web requires adequate tools for that. . It?s not that much a problem of producing pages but maintaining them. You can always pay some money to someone to translate your pages. Then what? Tomorrow you need to change a factual statement somewhere. How do you make sure it gets correctly changed in all versions?? Dealing with just two languages can be real hard, even if you have people who know the languages.

Legal things to consider

General web legal topics

Can other people copy my email or news postings? Rate this link
Copyright Myths FAQ: 10 big myths about copyright explained Rate this link
Cybercrime - Computer Crime and Intellectual Property Section from the Department of Justice of USA Rate this link
Digitaaliseen tietoon liittyv? lains??d?nt? - course material in Finnish Rate this link
European Union Copyright Directive Rate this link
Tekij?noikeus: vastauksia usein esitettyihin kysymyksiin - text in Finnish Rate this link
Tekij?noikeuslaki - text in Finnish Rate this link
The Copyright Website Rate this link
The Internet backlash - We are witnessing a backlash against the progressive potential of the Internet. With a mixture of technological fixes and legal pressures, large institutions are trying extend copyright protection in order to regain control over the flows of information. Rate this link
Web Law FAQ Rate this link

Linking

Bechtold: The Link Controversity Page - intended to provide an overview of the legal problems of using hyperlinks, inline images and frames in the WWW Rate this link
Links and Law: Myths Rate this link
Link Law: The Emerging Law of Internet Hyperlinks Rate this link
May I freely link to the Web sites of others? Rate this link

Web page usability issues

Article: don't mess with links - By changing the appearance of your links, you may compromise the usability of your web pages. In this article, Lois discusses why, and how to mitigate this. Rate this link
Johdatus web- k?ytett?vyyteen - introduction to web usability in Finnish Rate this link
Netsurfing Without a Monitor - article from Rate this link
Poynton's Typography and Design Rate this link
The Alertbox: Current Issues in Web Usability Rate this link
useit.com - Jakob Nielsen's Usability Website Rate this link
Use of ALT texts in IMGs - The ALT text is meant to provide alternative text, primarily for use when the image is not being displayed. Rate this link

Useful tools

HTML Editors

Amaya - Amaya is a highly advanced and powerful Web client which acts as both a browser and an authoring tool. It has been designed with the primary purpose of being a testbed for experimenting with, testing and demonstrating new specifications and extensions of Web protocols and formats. Rate this link
Carl Davis's HTML Editor Reviews - page to help to select what tool to use Rate this link
Emacs html-helper-mode - html-helper-mode is an emacs mode to make editing HTML files easier. Rate this link
Evrsoft First Page III - free HTML/web authoring tool, designed for DHTML authoring in mind Rate this link
HTMLjive - HTMLjive is a HTML editor written in Javascript. It allows an intermediate Internet user to create a HTML document by simply loading a Web page. No other programs are needed. Rate this link
Mozilla - web browser software with XML and CSS Level 2 support, has also HTML editing tool Rate this link
Netscape Navigator - the most popular web browser, has also a version which has HTML page editor "Composer" built-in Rate this link

Processing and converting material

Arachnophilia - Powerful Web Site Workshop with many options Rate this link
ASM to HTML converter - Java program written to display PIC assembly files in web pages Rate this link
Converting UNIX to DOS and DOS to UNIX Rate this link
gohtml.com - free on-line service which converts many document formats to HTML Rate this link
HTMLDOC - Convert HTML files to PDF or PostScript Rate this link
HTML Tidy - processes hard to read HTML files to simpler and easier to manipulate manually Rate this link
JTidy - JTidy is a Java port of HTML Tidy, a HTML syntax checker and pretty printer. Like its non-Java cousin, JTidy can be used as a tool for cleaning up malformed and faulty HTML. In addition, JTidy provides a DOM interface to the document that is being processed, which effectively makes you able to use JTidy as a DOM parser for real-world HTML. Rate this link
Weblint - free HTML validation program written in Perl Rate this link
Website META Language (WML) - off-line HTML generation toolkit for Unix Rate this link
wvWare - wv is a library which allows access to Microsoft Word files. It can load and parse the word 2000, 97, 95 and 6 file formats. wv allows other programs access to Word documents for the purpose of converting them to other formats (for example HTML). Rate this link

Web page upload and download

Expect FAQ - FAQ on a very useful tool program for automating interctive programs like FTP Rate this link
GNU Wget - free utility for retrieving files from the internet using HTTP and FTP Rate this link
Gnu Wget - a freely available network utility to retrieve files using HTTP and FTP Rate this link
Lynx - most popular text only browser, useful in scripting and HTML to text conversions Rate this link
WebReaper - web crawler or spider, which can work its way through a website, downloading pages, pictures and objects that it finds so that they can be viewed locally, without needing to be connected to the internet Rate this link

Off-line browsing programs

Teleport Pro Rate this link

On-line tools

WebCab.de: Fetch Page - tool for getting web pages from server and seeing debug information, this tool can imitate many browsers and even WAP Rate this link

Scripting languages

Perl - scripting laguage used very much in web projects Rate this link
PHP: Hypertext Preprocessor Rate this link

WWW based discussion boards

Discus - free discussion board script Rate this link
Ezboard - online community service Rate this link
Phorum - Phorum is a web based discussion software written in PHP. Unlike some popular web boards, Phorum utilizes a database to manage its messages. Rate this link
WWWBoard - free discussion group making script used in many sites Rate this link

Converting HTML to other formats

HTMLDOC - Convert HTML files to PDF or PostScript Rate this link
html2ps - HTML to PostScript converter that gives you nice printouts of HTML pages than many web browser's build-in printing functions Rate this link

Adobe Acrobat (pdf) file generators

HTMLDOC - Convert HTML files to PDF or PostScript Rate this link
FreePDF - Free PDF is a utility which allows you to create PDFs just by printing to a "Create PDF" printer, from any Windows 9x/ME application FOR FREE! Rate this link

Other utilities

LinkScan/QuickCheck - link checking program demonstration page Rate this link
Linux Online WWW Utilities - links to many useful utilities Rate this link
Urlchange - a small Python script that allows you to check if a list of URLs has changed Rate this link
Website META Language (WML) - free and extensible Webdesigner's off-line HTML generation toolkit for Unix Rate this link

Tips on using web tools

Tidy/Emacs combo tip - lets you highlight a region of edited text and run Tidy on it Rate this link

Web browsers

Browser programs

Amaya - a browser/authoring tool by W3C Rate this link
BrowseX - BrowseX is a free Open Source, cross-platform Web Browser, Mail Program, Talk/Chat client and more. There is a database interface, SQLite, and BrowseX is easily extensible via an API. BrowseX has been written primarily in C and Tcl and clearly demonstrates that Linux applications can indeed bridge to Windows. Rate this link
Galeon - Galeon is a GNOME Web browser based on gecko (the mozilla rendering engine). It's fast, it has a light interface, and it is fully standards-compliant. Rate this link
iCab - iCab is a web browser for the Macintosh. Rate this link
K-Meleon - K-Meleon is a lite Web browser based on gecko (the mozilla rendering engine). It's fast, has a minimal interface, and it is fully standards-compliant. K-Meleon is released under the GNU General Public License. Rate this link
Konqueror - an Open Source web browser with HTML4.0 compliance, supporting Java applets, JavaScript, CSS1 and (partially) CSS2, as well as Netscape plugins Rate this link
Links - Links is text WWW browser with tables. It runs on Unix and OS/2. Rate this link
Lynx - most popular text only browser Rate this link
Netscape Navigator - This was the the most popular web browser Rate this link
Microsoft Internet Explorer - standard web browser which comes with Windows Rate this link
Mozilla - web browser software with XML and CSS Level 2 support Rate this link
Opera - A small and fast web browser available for many operating systems, supports also WML Rate this link
Teleport Pro - offline browser Rate this link
ViewML - an extremely small yet fully functional browser for embedded systems Rate this link
W3M - A terminal based WWW browser Rate this link
X-Smiles - This is an open xml-browser for exotic devices. This browser supports xhtml basic + css, xforms, XML parsing, XSLT Transforms, SVG, ECMAScript, SMIL and SIP. Rate this link

Plug-ins

BrowserWatch - Plug-In Plaza! - site for helping to find plug-ins Rate this link

Web material filters

Muffin - Muffin is s world wide web filtering system written in Java. Muffin on Unix, Windows 95/NT, and Macintosh systems. Muffin supports HTTP/0.9, HTTP/1.0, HTTP/1.1, and SSL (https). Rate this link
Proxomitron - Proxomitron is an universal veb filter program. The Proxomitron is a Win32 program designed to run under the various flavors of Microsoft's Windows OS. It's been tested on Windows 95, 98, Millennium, NT, and 2000. Rate this link
The Internet Junkbuster Proxy - blocks unwanted banner ads and protects your privacy from cookies and other threats Rate this link
WebWasher - filter program for Web pages to avoid unwanted content and for faster loading times Rate this link

Other useful utilities

Web Secretary - detects changes based on content analysis and can email the changed page to you with the new contents highlighted! Rate this link
WebStripper - makes it easy to copy websites to your hard disk for you to browse offline later, free advertising supported software Rate this link
WGET - GNU Wget is a freely available network utility to retrieve files from the World Wide Web using HTTP, HTTPS and FTP. It works non-interactively, thus enabling work in the background, after having logged off. Read also Rate this link
WGET for Windows - Wget is a freely available network utility to retrieve files from the World Wide Web using HTTP and FTP, the two most widely used Internet protocols. It works non-interactively. Rate this link

Off-line browsing programs

Teleport Pro - Teleport Pro is an all-purpose high-speed tool for getting data from the Internet. Teleport Pro claims that it finds all of the files on all of the sites. Teleport Pro is sold as shareware, so you can download and try out a fully functional (but capacity-limited) copy for testing. Rate this link

Browser tips

How to change the default editor for use in viewing the source of web pages - This page tells how to change that editor when using IE running on Windows. Rate this link

Resource pages

BROWSERS.COM - download new releases of latest versions of popular browsers Rate this link

Web caching information

A Web cache sits between Web servers (or origin servers) and a clientand watches requests for HTML pages, images and files come by, saving a copy for itself. Then, if there is another request for the same object, it will use the copy that it has, instead of asking the origin server for it again.This is used to reduce latency and to reduce traffic.

Cacheability Engine Query - This service allows you to check Web pages to see how caches will handle them. Rate this link
Caching Tutorial for Web Authors and Webmasters - informational document to make the concepts involved in caching understandable and applicable in real-world situations Rate this link

Security issues

Web security is a complex topic, encompassing computer system security, network security, authentication services, message validation, personal privacy issues, and cryptography. Web is a complicated networked environment where there are lotsof computers, technology components and users interworking.This situation can cause lots of potential security risks.The security risk are related (but not limited to) informationsecurity, user authentication, web server security and the securityof the computer used for web browsing.One important feature for web sites is the ability to restrict access to part or all of the site. This is often used on subscription sites, such as online webzines or other members-only services. It's also used on administrative portions of web sites. There are two kinds of user authentication methods in use. One is HTTP authentication; this is actually done by the web server itself.With HTTP authentication, you can password-protect a directory and the files within.The other kind of authentication is done by forms and CGIs, and often uses cookies to track user sessions. When you use HTTPS to connect to a web server, the client and browser will negotiate a common protocol to secure the channel. Typical secure channel protocols used in use nowadays are PCT 1.0, SSL 3.0, and SSL 2.0.In cases where the server and client have multiple supported protocols in common, the web server will attempt to secure the channel with one of the protocols it supports (the order they are selected depends on web server preferences).The SSL protocol uses security keys to guarantee the security of the communication (makes sure that the communicating ends are what they say, data is encrypted with some key etc.). Right now, the only secure key distribution mechanism on the Internet is the SSL key mechanism, whereby a group of companies (around 5) with keys that got into the original Netscape release (and other web browsers) essentially rule the roost, because Joe Average has no idea how to install a new root key in his browser. There are also security things related to web servers itself. Most servers are launched as root so that they can open up the low numbered port 80 (the standard HTTP port) and write to the log files. They then wait for an incoming connection on port 80. As soon as they receive this connection, they fork a child process to handle the request and go back to listening. The child process, meanwhile, changes its effective user ID to the user "nobody" and then proceeds to process the remote request. All actions taken in response to the request, such as executing CGI scripts or parsing server-side includes, are done as the unprivileged "nobody" user.There are always software bugs andother problems which cause scurity risks. The idea is to minumize the security risks.Even though you can't make your server completely safe, you can increase its security significantly in a Unix environment by running it in a chroot environment. The chroot system command places the server in a "silver bubble" in such a way that it can't see any part of the file system beyond a directory tree that you have set aside for it. The directory you designate becomes the server's new root "/" directory. Anything above this directory is inaccessible

Do "Cookies" Pose any Security Risks? - document describes what web cookies are, how they affect to system security and your privacy Rate this link
How SSL Works - This document explains how Netscape uses RSA public key cryptography for Internet security. Netscape's implementation of the Secure Sockets Layer (SSL) protocol employs the techniques discussed in this document. Rate this link
Internet Explorer security - Georgi Guninski Security Research Rate this link
Protecting Confidential Documents at Your Site Rate this link
The World Wide Web Security FAQ Rate this link
User Authentication and Password Protection of Web Pages Rate this link
W3C Security Resources - This page contains links to various aspects of Web and Internet security. Rate this link

Web page announcing and searching

Anyone who has looked into implementing web search unctionality has probably found it's not as easy as it might seem. Various factors conspire to make searching difficult. The venerable and indispensable grep and its ilk are effective for scanning through lines of text. But grep, egrep and their relations won't do everything for you. They won't search across lines, they won't show search results in a ranked order and their linear search algorithms don't lend themselves to searching larger volumes of data. HTML doesn't help the situation either. Its display-oriented features, idiosyncratic grammar and bevy of formatting and entity tags make it fairly difficult to parse correctly. At the other end of the data storage spectrum is data slotted into a database. The ubiquitous example is that of the SQL database, which allows somewhat sophisticated search facilities but usually is not particularly fast for searching. There are various technologies that makes nowadays to search web effectively. There are many free to use web search engines, so the average web user does not have to worry about how search technology works internally. The problems in implementing the search is generally seen by the webmaster when he/she wants to implement a search engine to a web site.

General information on search engines

A promotion guide - information ad tips how to get more visitors to yur site Rate this link
Are You a Good Googler? - It's become the search engine of choice, so it pays to know its secrets. Sharpen your searches with these tricks. Rate this link
A Standard for Robot Exclusion - how to use robots.txt file in your web server to stop search engines to index some of your pages you don't want to get indexed Rate this link
Search Engine Forums - discussion forums on search engine topics Rate this link
Search Engine Watch - lots of information on search engines Rate this link
Web and Internet Search Engine FAQ Rate this link
Web and Internet Search Engine FAQ Rate this link
Web Search - Internet resources for searching Rate this link

Information on successfully adding web pages to search engines

Metadata and Resource Description - metadata is machine understandable information for the web Rate this link
Meta-tagien tekeminen - information on META tags in Finnish Rate this link
META Tags - information about the document, such as keywords for indexing, the name of the author, and so on Rate this link
META - Metadata - information about page description info you can add to your web pages Rate this link
Search Engine Watch - tips about Internet search engines & search engine submission Rate this link

Controlling search engines and robots

A Method for Web Robots Control - how to control how web robots index your site and how to avoid indexing Rate this link

Search engines you can add to your site

Atomz.com - site search service to quickly add a powerful search engine to your Web site with no hardware or software to install Rate this link
FreeFind.com - free search engine to add to a site Rate this link
ht://Dig Internet Search Engines - world wide web indexing and searching system for a small domain or intranet available under GNU General Public License Rate this link
Isearch - Isearch is an open source software package for indexing and searching text documents. It supports full text and field based searching, relevance ranked results and Boolean queries, and it can index many kinds of documents, including HTML, mail folders, list digests, and text with SGML-style mark up. Isearch also includes CGI tools to provide a web interface for searching. Rate this link
Sitelevel - free site search service, includes banner advertising Rate this link
WebGlimpse - powerful indexing and query system for UNIX that allows you to search through all your files very quickly Rate this link
Matt's Script Archive - includes also search scripts Rate this link
Using SWISH-E To Index Your Site - information on a fast, powerful, flexible, free, and easy to use system for indexing collections of Web pages or other text files Rate this link

HTTP protocol information

HTTP has been in use by the World Wide Web since 1990 and its use has increased steadily over the years, mainly because it has proven useful as a generic middleware protocol.

Basic HTTP as defined in 1992 - very basic protocol which is easy to understand, later version are basically just extended version of this basic protocol Rate this link
HTTP - Hypertext Transfer Protocol - overview materials related to the W3C HTTP activity Rate this link
HTTP Specifications and Drafts Rate this link
HTTP Status Code Definitions Rate this link

Web publishing magazines

Free web hosting

The following web sites provide free web hosting services.Usually the free web hosting services will add advertisementsto the pages in them. For more details check what those serviceproviders have to offer.

Freedom2Suft - free web hosting with MySQL and PostgreSQL databases and PHP support Rate this link
free.prohosting.com Rate this link
homepages.go.com Rate this link
HyperMart.net Rate this link
Pinoysa Rate this link
www.50megs.com Rate this link
www.angelfire.com Rate this link
www.crosswinds.net Rate this link
www.geocities.com Rate this link
pSend.com Rate this link
Spaceports - PHP scripting and MySQL database support Rate this link
www.tripod.com Rate this link
www.virtualave.net Rate this link
www.xoom.com Rate this link
www.crosswinds.net Rate this link
www.mysitespace.com Rate this link

Non-conventional ways to access web

Accessing the Internet by E-Mail FAQ Rate this link
FAQ: Can someone without access to the Web get the same information by e-mail? Rate this link
Ruel's Set-Top Page - information on accessing Web using TV Rate this link

Tools for webmasters and web publishers

ht://Dig WWW Search Engine Software - for a small domain or intranet, released under the terms and conditions of the GNU Public License version 2.0 Rate this link
Net Freebies - links to hundreds of free services for webmasters Rate this link
Web Page Tools - HTML help and utilities, graphics, editors, etc. Rate this link
WebMacro - free open-source HTML template engine and back end servlet development framework Rate this link
Z Object Publishing Environment (ZOPE) - free, Open Source(tm) web application platform used for building dynamic web sites Rate this link

Web statistics

Analog - The most popular web server logfile analyser in the world Rate this link
Free counter web page - free web counters Rate this link
Why web usage statistics are (worse than) meaningless - Web usage statistics, such as those produced by programs such as analog cannot be used to make any inferences about the number of people who have read it. Rate this link

Page for information about making WWW pages

Foreword on web publishing

HTML Specifications

Reference material

Introductory material on HTML

Tips on writing HTML documents

Browser specific extensions

General information

XML Resource pages

Applications for XML

Using XML with your programs and applications

XML Tools

SMIL information

General information

SMIL Browsers

Other XML software

Character set problems related to Finland

General guides

Usability design

Site accessability

Use of frames

Use of tables

Handling errors

Multimedia

Specific site design tips

Bad things to avoid

Useful sites for more information

Why to validate

Page validators

Link checkers

General information

From form to mail

Information on writing form handling scripts

General information

Javascript tutorials

Javascript references

Javascrit books

Example source code resources

Related technologies

General web server information

Using other web server software

Accessory software

General information

Information on developing CGI scripts

CGI scripts and script collections

Solving problems

Display properties

Pictures

Video and web

Audio and web

Multimedia player plug-ins

General web legal topics

Linking

HTML Editors

Processing and converting material

Web page upload and download

Off-line browsing programs

On-line tools

Scripting languages

WWW based discussion boards

Converting HTML to other formats

Adobe Acrobat (pdf) file generators

Other utilities

Browser programs

Plug-ins

Web material filters

Other useful utilities

Off-line browsing programs

Browser tips

Resource pages

General information on search engines

Information on successfully adding web pages to search engines

Controlling search engines and robots

Search engines you can add to your site

Other useful search engine links