| Document: FTS-0005 | Version: 003 | Date: February 7, 1996 | Maintainer: David Nugent, 3:632/348@fidonet The Distribution Nodelist Originally by Ben Baker Amended by Rick Moore, 1:115/333@FidoNet, February 5, 1989 Amended by David Nugent, 3:632/348@FidoNet, February 27, 1996 | Copyright 1986-1996 by the FidoNet Technical Standards Committee. All rights reserved. Duplication and/or distribution permitted for non-commercial purposes only. This document supersedes and replaces the document known under | the names of FSC002, FSC-0002, and FTS-0002. Significant changes, | which excludes mere formatting changes, to the previous version | of this document have been "redlined" (marked with a vertical | bar in the leftmost column). This document defines the format and content of the nodelist for the Public FidoNet Network (PFN) as published on Friday of each | week. This format is historically known as the "St. Louis nodelist | format". The PFN is an international network of independently owned electronic mail systems, most with interlocking electronic bulletin board systems. The distribution nodelist, or simply "nodelist", is the glue which holds the network together. It is the PFN's "phone book" and it defines the top-level network | structure and is the means by which FidoNet retains its integrity | as a point-to-point mail network. | THE NODELIST The nodelist is published as an ASCII text file named NODELIST.nnn, where nnn is a three digit number representing the | day-of-year of the Friday publication date, with zeros filling | positions to the left if necessary. This file is packed into a | archive file named NODELIST.?nn, where 'nn' are the last two | digits of day-of-year, and the character at the position of the | '?' indicating the type of compression used. Conventions as to | which compression method is used for the distributed nodelist is | a matter of local policy and is usually determined by each zone's | Zone Coordinator. As stated above, NODELIST.nnn is an ASCII text file. It contains two kinds of lines; comment lines and data lines. Each line is terminated with an ASCII carriage return and line feed character sequence, and contains no trailing white-space (spaces, tabs, | etc.). The file is terminated with a DOS end-of-file character | (character value 26 decimal, or "control-Z"). Comment lines contain a semicolon (;) in the first character position followed by zero or more alphabetic characters called "interest flags". A program which processes the nodelist may use comment interest flags to determine the disposition of a comment line. The remainder of a comment line (with one exception, | treated below) is free-form ASCII text. There are five types of | comments flags: | ;S This is of particular interest to Sysops | ;U This is of particular interest to BBS users | ;F This should appear in any formatted "Fido List" | ;A This is of general interest (shorthand for ;SUF) | ;E This is an error message inserted by the nodelist generator | ; This comment may be ignored by a nodelist processor The first line of a nodelist is a special comment line containing identification data for the particular edition of the nodelist. The following is an example of the first line of a nodelist: ;A FidoNet Nodelist for Friday, July 3, 1987 -- Day number 184 : 15943 This line contains the general interest flag, the day, date, and | three-digit (zero-filled) day-of-year number of publication, and ends with a 5 digit decimal number with leading zeros, if necessary. This number is the decimal representation of a check value derived as follows: Beginning with the first character of the second line, a 16 bit cyclic redundancy check (CRC) is calculated for the entire file, including carriage return and line feed characters, but not including the terminating EOF character. The check polynomial used is the same one used for many file transfer protocols: 2**16 + 2**12 + 2**5 + 2**0 The CRC may be used to verify that the file has not been edited. The importance of this will become evident in the discussion of NODEDIFF, below. CRC calculation techniques are well documented | in various technical references, and will not be treated further here. The content of the remaining comments in the nodelist are intended to be informative. Beyond the use of interest flags for distribution, a processing program need not have any interest in them. A nodelist data line contains eight variable length "fields" separated by commas (,). No space characters are allowed in a data line, and underscore characters are used in lieu of spaces. The term "alphanumeric character" is defined as the portion of the ASCII character set from 20 hex through 7E hex, inclusive. The following discussion defines the contents of each field in a data line. Field 1: Keyword The keyword field may be empty, or may contain one of the following: Zone Begins the definition of a geographic zone and define its coordinator. All the data lines following a line with the "Zone" keyword down to, but not including, the next occurrence of a "Zone" keyword, are regions, networks, and | nodes within the defined zone. Node entries defined | immediately after the "Zone" keyword and before the next | region or host entry are known as zone adminstrative nodes. | These are allocated by the Zone Coordinator for use by nodes | in the entire zone; for example, mail gateways between | FidoNet zones. Region Begins the definition of a geographic region and defines its coordinator. All the data lines following a line with the "Region" keyword down to, but not including, the next occurrence of a "Zone", "Region", or "Host" keyword, are independent nodes within the defined region. Host Begins the definition of a local network and defines its | network coordinator. All the data lines following a line with the Host keyword down to, but not including, the next occurrence of a "Zone", "Region", or "Host" keyword, are local nodes, members of the defined local network. Hub Begins the definition of a routing sub-unit within a multi-level local network. The hub is the routing focal point for nodes listed below it until the next occurrence of a "Zone", "Region", "Host", or "Hub" keyword. The hub entry MUST be a redundant entry, with a unique number, for one of the nodes listed below it, within its hub segment. This is necessary because some nodelist processors eliminate these entries in all but the local network. Pvt Defines a private node with unlisted number. Private nodes are only allowed as members of local networks. Hold Defines a node which is temporarily down. Mail may be sent to it and is held by its host or coordinator. Down Defines a node which is not operational. Mail may NOT be sent to it. This keyword may not be used for longer than two weeks on any single node, at which point the "down" node is to be removed from the nodelist.Go Back| The field contains no text (not the sequence " "), | and defines a normal node entry. | Only one of these may be used in any individual data line. Field 2: Zone/Region/Net/Node number This field contains only numeric digits and is a number in the range of 0 to 32767. If the line had the "Zone", "Region", or "Host" keyword, the number is the zone, net, or region number, and the node has an implied node number of 0. Otherwise, the number is the node number. The zone number, region or net number, and the node number, taken together, constitute a node's FidoNet address. Zone numbers must be unique. Region or net numbers must be unique within their zone, hub numbers unique be within their net, node | numbers unique within their net (and region, for regional | independent nodes, zone for zone administrative entries). Duplicate node numbers under different hubs within the same net are not allowed. Field 3: Node name This field may contain any alphanumeric characters other than commas and spaces. Underscores are used to represent spaces, and a comma delimits the end of the field. This is the name by which the node is known, usually as determined by the node or the coordinator responsible for compiling the segment. Field 4: Location This field may contain any alphanumeric characters other than commas and spaces. Underscores are used to represent spaces. This field contains the location of the node. It is usually expressed as the primary local location (town, suburb, city, etc.) plus the identifier of the regional geopolitical administrative district (state, province, department, county, etc.). Wherever possible, standard postal abbreviations for the major regional district should be used (IL, BC, NSW, etc.). Field 5: Sysop name This field may contain any alphanumeric characters other than commas and spaces. Underscores are used to represent spaces. This is the name of the system operator. Field 6: Phone number This field contains at least three and usually four numeric sub-fields separated by dashes (-). The fields are country code, city or area code, exchange code, and number. The various parts of the phone number are frequently used to derive cost and routing information, as well as what number is to be dialed. A typical example of the data in a phone number field is 1-800-555-1212, corresponding to country 1 (USA), area 800 (inbound WATS), exchange 555, and number 1212. Alternatively, this field may contain the notation "-Unpublished-" in the case of a private node. In this case, the | keyword "Pvt" must appear at the start of the line. Field 7: Baud rate This field contains one of the values: 300, 1200, 2400, 9600, | 19200, or 38400. | This baud rate is indicative only of the maximum baud rate that | may be expected when connecting to a node and is generally of use | only where a calling node needs to adjust the baud rate used to | dial to the caller's modem speed in order to achieve a | connection, a requirement that with modem technology available in | 1996 is rarely if ever needed. This information is largely | superseded by modem protocol flags (see next section) where any | two nodes using a common protocol may have other expectations | with regards to actual transfer rates. Use of the baud rate field | alone is therefore depreciated. Field 8 - Flags This optional field contains data about the specific operation of the node, such as file requests, modem protocol supported, etc. Any text following the seventh comma on a data line is taken collectively to be the flags field. The required format is zero or more sub-fields, separated by commas. Each sub-field consists of a flag, possibly followed by a value. The following flags define special operating conditions: Flag Meaning CM Node accepts mail 24 hours a day MO Node does not accept human callers | LO Node accepts calls only from valid listed node | numbers in the current FidoNet nodelist The following flags define modem protocols supported: Flag Meaning | V21 ITU-T V21 300 bps full duplex | V22 ITU-T V22 1200 bps full duplex | V29 ITU-T V29 9600 bps half duplex | V32 ITU-T V32 9600 bps full duplex | V32b ITU-T V32bis 14400 bps full duplex | V33 ITU-T V33 | V34 ITU-T V34 28800 bps full duplex H96 Hayes V9600 HST USR Courier HST up to 9600 | H14 USR Courier HST up to 14400 | H16 USR Courier HST up to 16800 MAX Microcom AX/96xx series PEP Packet Ensemble Protocol | CSP Compucom Speedmodem | ZYX Zyxel series | VFC V.Fast Class | V32T V.32 Terbo NOTE: Many V22 modems also support Bell 212A. If no modem flag is given, Bell 212A is assumed for 1200 bps systems, ITU-T V22bis is assumed for 2400 bps systems. The following flags define type of error correction available. A separate error correction flag should not be used when the error correction type can be determined by the modem flag. For | instance, a modem flag of HST implies MNP, V32b implies V32 and | V42b implies V42. Therefore MNP+HST, H14+MNP, H16+MNP, V32+V32b | and V42+V42b flag pairs are redundant and should not be used. Flag Meaning MNP Microcom Networking Protocol error correction | V42 ITU-T LAP-M error correction w/fallback to MNP 1-4 | V42b ITU-T LAP-M error correction w/fallback to MNP 1-5 The following flags define the type(s) of compression of mail packets supported. Flag Meaning MN No compression supported | NOTE: While FidoNet nodes usually exchange mail | using a variety of different file compression | formats negotiated between individual systems, the | presence of this flag indicates the INABILITY TO | RECEIVE MAIL compressed using the SEA ARC version 5 | compression format and/or named according to the | ARCmail 0.6 mail bundle naming method. This is, by | convention, the most common mail compression format | in use within FidoNet. The presence of this flag | would normally indicate that all mail should be sent | uncompressed unless there is some overriding | arrangement with the receiving system. The following flags indicate the types of file and file update requests supported. Flag Meaning XA Bark and WaZOO file/update requests XB Bark file/update requests, WaZOO file requests | XC Bark file requests, WaZOO file file/update XP Bark file/update requests XR Bark and WaZOO file requests XW WaZOO file requests | XX WaZOO file/update requests The following flag defines gateways to other domains (mail networks). Flag Meaning Gx..x Gateway to domain 'x..x', where 'x..x` is a string of alphanumeric characters. NOTE: Valid values for 'x..x' are assigned by the FidoNet | International Coordinator or the person appointed as | Internetworking Coordinator by the FidoNet | International Coordinator. Current valid values of 'x..x' may usually be found in the notes at the end | of the current FidoNet nodelist. The most common | gateway flag is "GUUCP", to denote a gateway to the | Internet mail system that gates on behalf of the | fidonet.org internet domain. The following flags define the dedicated mail periods supported. They have the form "#nn" or "!nn" where nn is the UTC hour the mail period begins, '#' indicates Bell 212A compatibility, and '!' indicates incompatibility with Bell 212A. Flag Meaning | #01 Zone 5 mail hour (01:00 - 02:00 UTC) #02 Zone 2 mail hour (02:30 - 03:30 UTC) | #03 Zone 4 mail hour (08:00 - 09:00 UTC) #09 Zone 1 mail hour (09:00 - 10:00 UTC) #18 Zone 3 mail hour (18:00 - 19:00 UTC) | #20 Zone 6 mail hour (20:00 - 21:00 UTC) NOTE: When applicable, the mail period flags may be strung together with no intervening commas, e.g.. "#02#09" | or "!02!09". Only mail hours other than that standard within a node's zone should be given. Since observance of mail hour within one's zone is mandatory, it should not be indicated. The following flag defines user-specific values. If present, this flag MUST be the last flag present in a nodelist entry. Flag Meaning Ux..x A user-specified string, which may contain any alphanumeric character except blanks. This string may contain one to thirty-two characters of information that may be used to add user-defined data to a specific nodelist entry. | NOTE: Ux..x flags are the mechanism by which new flags may | be experimentally introduced into the nodelist for a | trial period to assess their worth. They are | therefore of a temporary nature, and after their | introduction they are eventually either promoted | to a non-U flag or dropped from use altogether. The FTSC recognizes that the FidoNet International Coordinator is the ultimate authority over what appears in the FidoNet nodelist. Also, FTSC is by definition a deliberative body, and adding or changing a flag may take a considerable amount of time. Therefore, the FidoNet International Coordinator may temporarily make changes or additions to the flags as defined in this document. The FidoNet International Coordinator will then consult with FTSC over the changes needed to this document to reflect these temporary changes. The following are examples of nodelist data lines: Host,102,SOCALNET,Los_Angeles_CA,Richard_Martz,1-213-874-9484,2400,XP ,101,Rainbow_Data,Culver_City_CA,Don_Brauns,1-213-204-2996,2400, | THE NODEDIFF | With more than thirty-five thousand nodes as of this date (1996), | the nodelist, even in archive form, is a document of substantial | size. Since distribution of the nodelist occurs via electronic file transfer, this file is NOT routinely distributed. Instead, | when a new nodelist is prepared weekly, it is compared with the previous week's nodelist, and a file containing only the differences is created and distributed. | The distribution difference file, called NODEDIFF.nnn, where nnn is the day-of-year of publication, is actually an editing script which will transform the previous week's nodelist into the current nodelist. A definition of its format follows: The first line of NODEDIFF.nnn is an exact copy of the first line | of LAST WEEK'S nodelist (i.e. the first line of the nodelist to | which the current difference file applies). This is used as a first-level confidence check to insure that the correct file is being edited. The second and subsequent lines are editing commands and data. There are three editing commands and all have the same format: <command><number> <command> is a 1 letter command, one of A, C, or D. <number> is a decimal number greater than zero, and defines the number of lines to be operated on by the command. Each command appears on a line by itself. The commands have the following meanings: Ann Add the following nn lines to the output file. Cnn Copy nn unchanged lines from the input to the output file. Dnn Delete (or skip) nn lines from the input file. The following illustrate how the first few lines of a | hypothetical NODEDIFF.213 might look: ;A Friday, July 25, 1986 -- Day number 206 : 27712 D2 A2 ;A Friday, August 1, 1986 -- Day number 213 : 05060 ;A C5 This fragment illustrates all three editing commands. The first line is the first line from the previous nodelist, NODELIST.206. The next line says "delete the first two lines" from NODELIST.206. These are the identification line and the line following it. The next command says "add the next two lines" to NODELIST.213 at the "current" location. The two data lines are followed by a command which says "copy five unchanged lines" from NODELIST.206 to NODELIST.213. Notice that the first line added | will ALWAYS contain the new nodelist CRC, so that the software | applying the changes to the old nodelist may check the result of | its editing. Since only the differences will be distributed, it is important to insure the accuracy of the newly created nodelist. This is the function of the CRC mentioned above. It is sufficient for a program designed to perform the above edits to pick the CRC value from the first line added to the output file, then compute the CRC of the rest of the output file. If the two CRCs do not agree, one of the input files has been corrupted. If they do agree, the probability is very high (but not 100%) that the output file is accurate. For actual distribution, NODEDIFF.nnn is packed into an archive | file named NODEDIFF.?nn, where 'nn' are the last two digits of | day-of-year, and '?' indicates the compression format used. | NODELIST COMPILATION | This section is included for tutorial reasons and is not intended | as a definition of any specific method by which FidoNet MUST | compile its weekly nodelist. It merely represents an attempt to | document the method by which it currently does so. It is intended | to be explanatory, and seeks to answer commonly asked questions, | such as how the nodelist is compiled and where the information | comes from, why the nodelists used in different FidoNet zones are | not the same document, and why the difference file generated for | use in one FidoNet zone cannot be applied to the nodelist | generated for use in a different zone, even though the week | numbers match. | Nodelists are compiled via a distributed method, which follows | the same structure as the FidoNet coordinator hierarchy. At the | lowest level, network coordinators maintain a list of the nodes | in their network and are responsible for the addition, removal | and correction of individual node's listings in their "segment" | (as portions of the full nodelist are called). In some larger | networks, it is common for this job to be shared with hub | coordinators appointed by the net coordinator, though the | responsibility for those hub segments still remains with the | network coordinator. | At a nominated day during the week, before the regional level | segment is submitted to the zone coordinator, individual net | coordinators submit their segments to the regional coordinator | who subsequently compiles these segments and transmits the merged | copy to the zone coordinator. These are combined by the zone | coordinator with the separate segments of other zones and | compiled into that zone's version of the world nodelist. This | world nodelist is then compared with the previous week's version, | a difference file is generated and subsequently distributed | throughout the zone. | In some cases, in the interest of saving in transmission times | and therefore costs, the compilation process itself may be better | served by the submission of DIFFERENCE FILES rather than full | net- or region-level segments. Each coordinator therefore retains | a copy of the previously submitted segments and applies | difference files to those to derive the new one. This process is | exactly identical to the NODEDIFF/NODELIST scenario described | earlier in this document, with the same first line and CRC | validation method used to guard the integrity of the nodelist | segments. | For a number of reasons, it is important that publication of the | nodelist be as timely as possible. These reasons include: the | nodelist is a definitive list of valid FidoNet addresses that may | receive mail, and must therefore be as correct and up-to-date as | possible to save nodes the unnecessary expense of mail routed to | possibly non-existing addresses; the nodelist contains the list | of telephone numbers that may be called by any user of the | FidoNet nodelist and should therefore be accurate so as not to | unduly annoy owners of those phone numbers should a listed node | go down and an unsuspecting telephone subscriber inherit the same | telephone number. | Given this constraint, the expense of international calls and the | fact that FidoNet is a worldwide network that exists in many time | zones, it may be unreasonable to expect the compilation of the | nodelist to be delayed until each zone coordinator can transmit | their most up-to-date zone segment to a central authority for | compilation and subsequent redistribution in any week. For the | sake of expedience, each zone instead maintains its own separate | world nodelist which contains a compilation of the current zone's | latest segments and including the most current copy to hand of | all other FidoNet zone's segments. The zone level nodelist | generated each week by each zone coordinator is then transmitted | to all other zone coordinators for inclusion into their separate | world nodelist as timing permits. | In theory, then, the only difference between nodelists | distributed in each zone in any week are accounted for by timing | differences in the exchange of each zone's separate segment. In | practice, other constraints may interfere with timeliness, such | as the difficulty and expense of international telephonic | communications. Also, another point of variance is introduced by | the fact that each zone usually includes its own zone segment | first into its world nodelist to assist - amongst other things - | software that uses the nodelist for index generation. Some | software in common use in FidoNet indexes the nodelist according | to its sequential order (e.g. version 5 and 6 compiled nodelist | formats), and including the current zone first before others will | have a beneficial effect on software performance.