| Document: FTS-0005
 | Version:  003
 | Date:     February 7, 1996
 | Maintainer: David Nugent, 3:632/348@fidonet


                       The Distribution Nodelist

                        Originally by Ben Baker
       Amended by Rick Moore, 1:115/333@FidoNet, February 5, 1989
     Amended by David Nugent, 3:632/348@FidoNet, February 27, 1996


|   Copyright 1986-1996 by the FidoNet Technical Standards Committee.
    All rights reserved. Duplication and/or distribution permitted
    for non-commercial purposes only.

    This document supersedes and replaces the document known under
|   the names of FSC002, FSC-0002, and FTS-0002. Significant changes,
|   which excludes mere formatting changes, to the previous version
|   of this document have been "redlined" (marked with a vertical
|   bar in the leftmost column).

    This document defines the format and content of the nodelist for
    the Public FidoNet Network (PFN) as published on Friday of each
|   week. This format is historically known as the "St. Louis nodelist
|   format".

    The PFN is an international network of independently owned
    electronic mail systems, most with interlocking electronic
    bulletin board systems. The distribution nodelist, or simply
    "nodelist", is the glue which holds the network together. It is
    the PFN's "phone book" and it defines the top-level network
|   structure and is the means by which FidoNet retains its integrity
|   as a point-to-point mail network.


| THE NODELIST

    The nodelist is published as an ASCII text file named
    NODELIST.nnn, where nnn is a three digit number representing the
|   day-of-year of the Friday publication date, with zeros filling
|   positions to the left if necessary. This file is packed into a
|   archive file named NODELIST.?nn, where 'nn' are the last two
|   digits of day-of-year, and the character at the position of the
|   '?' indicating the type of compression used. Conventions as to
|   which compression method is used for the distributed nodelist is
|   a matter of local policy and is usually determined by each zone's
|   Zone Coordinator.

    As stated above, NODELIST.nnn is an ASCII text file. It contains
    two kinds of lines; comment lines and data lines. Each line is
    terminated with an ASCII carriage return and line feed character
    sequence, and contains no trailing white-space (spaces, tabs,
|   etc.). The file is terminated with a DOS end-of-file character
|   (character value 26 decimal, or "control-Z").

    Comment lines contain a semicolon (;) in the first character
    position followed by zero or more alphabetic characters called
    "interest flags". A program which processes the nodelist may use
    comment interest flags to determine the disposition of a comment
    line. The remainder of a comment line (with one exception,
|   treated below) is free-form ASCII text. There are five types of
|   comments flags:

|       ;S This is of particular interest to Sysops
|       ;U This is of particular interest to BBS users
|       ;F This should appear in any formatted "Fido List"
|       ;A This is of general interest (shorthand for ;SUF)
|       ;E This is an error message inserted by the nodelist generator
|       ; This comment may be ignored by a nodelist processor

    The first line of a nodelist is a special comment line containing
    identification data for the particular edition of the nodelist.
    The following is an example of the first line of a nodelist:

    ;A FidoNet Nodelist for Friday, July 3, 1987 -- Day number 184 : 15943

    This line contains the general interest flag, the day, date, and
|   three-digit (zero-filled) day-of-year number of publication, and
    ends with a 5 digit decimal number with leading zeros, if
    necessary. This number is the decimal representation of a check
    value derived as follows:

        Beginning with the first character of the second line, a
        16 bit cyclic redundancy check (CRC) is calculated for the
        entire file, including carriage return and line feed
        characters, but not including the terminating EOF
        character. The check polynomial used is the same one used
        for many file transfer protocols:

                    2**16 + 2**12 + 2**5 + 2**0

    The CRC may be used to verify that the file has not been edited.
    The importance of this will become evident in the discussion of
    NODEDIFF, below. CRC calculation techniques are well documented
|   in various technical references, and will not be treated further
    here.

    The content of the remaining comments in the nodelist are
    intended to be informative. Beyond the use of interest flags for
    distribution, a processing program need not have any interest in
    them.

    A nodelist data line contains eight variable length "fields"
    separated by commas (,). No space characters are allowed in a
    data line, and underscore characters are used in lieu of spaces.
    The term "alphanumeric character" is defined as the portion of
    the ASCII character set from 20 hex through 7E hex, inclusive.
    The following discussion defines the contents of each field in a
    data line.


  Field 1: Keyword

    The keyword field may be empty, or may contain one of the
    following:

    Zone

        Begins the definition of a geographic zone and define its
        coordinator. All the data lines following a line with the
        "Zone" keyword down to, but not including, the next
        occurrence of a "Zone" keyword, are regions, networks, and
|       nodes within the defined zone.  Node entries defined
|       immediately after the "Zone" keyword and before the next
|       region or host entry are known as zone adminstrative nodes.
|       These are allocated by the Zone Coordinator for use by nodes
|       in the entire zone; for example, mail gateways between
|       FidoNet zones.

    Region

        Begins the definition of a geographic region and defines
        its coordinator. All the data lines following a line with
        the "Region" keyword down to,  but not including,  the
        next occurrence of a "Zone",  "Region",  or "Host"
        keyword, are independent nodes within the defined region.

    Host

        Begins the definition of a local network and defines its
|       network coordinator. All the data lines following a line
        with the Host keyword down to, but not including, the
        next occurrence of a "Zone", "Region",  or "Host" keyword,
        are local nodes, members of the defined local network.

    Hub

        Begins the definition of a routing sub-unit within a
        multi-level local network. The hub is the routing focal
        point for nodes listed below it until the next occurrence
        of a "Zone", "Region", "Host", or "Hub" keyword. The hub
        entry MUST be a redundant entry, with a unique number, for
        one of the nodes listed below it, within its hub segment.
        This is necessary because some nodelist processors
        eliminate these entries in all but the local network.

    Pvt

        Defines a private node with unlisted number. Private nodes
        are only allowed as members of local networks.

    Hold

        Defines a node which is temporarily down. Mail may be sent
        to it and is held by its host or coordinator.

    Down

        Defines a node which is not operational. Mail may NOT be
        sent to it. This keyword may not be used for longer than
        two weeks on any single node, at which point the "down"
        node is to be removed from the nodelist.

    

|       The field contains no text (not the sequence ""),
|       and defines a normal node entry.

|   Only one of these may be used in any individual data line.


  Field 2: Zone/Region/Net/Node number

    This field contains only numeric digits and is a number in the
    range of 0 to 32767. If the line had the "Zone", "Region", or
    "Host" keyword, the number is the zone, net, or region number,
    and the node has an implied node number of 0. Otherwise, the
    number is the node number. The zone number, region or net number,
    and the node number, taken together, constitute a node's FidoNet
    address.

    Zone numbers must be unique. Region or net numbers must be unique
    within their zone, hub numbers unique be within their net, node
|   numbers unique within their net (and region, for regional
|   independent nodes, zone for zone administrative entries). Duplicate
    node numbers under different hubs within the same net are not
    allowed.


  Field 3: Node name

    This field may contain any alphanumeric characters other than
    commas and spaces. Underscores are used to represent spaces, and
    a comma delimits the end of the field. This is the name by which
    the node is known, usually as determined by the node or the
    coordinator responsible for compiling the segment.


  Field 4: Location

    This field may contain any alphanumeric characters other than
    commas and spaces. Underscores are used to represent spaces. This
    field contains the location of the node. It is usually expressed
    as the primary local location (town, suburb, city, etc.) plus the
    identifier of the regional geopolitical administrative district
    (state, province, department, county, etc.). Wherever possible,
    standard postal abbreviations for the major regional district
    should be used (IL, BC, NSW, etc.).


  Field 5: Sysop name

    This field may contain any alphanumeric characters other than
    commas and spaces. Underscores are used to represent spaces. This
    is the name of the system operator.


  Field 6: Phone number

    This field contains at least three and usually four numeric
    sub-fields separated by dashes (-). The fields are country code,
    city or area code, exchange code, and number. The various parts
    of the phone number are frequently used to derive cost and
    routing information, as well as what number is to be dialed. A
    typical example of the data in a phone number field is
    1-800-555-1212, corresponding to country 1 (USA), area 800
    (inbound WATS), exchange 555, and number 1212.

    Alternatively, this field may contain the notation
    "-Unpublished-" in the case of a private node. In this case, the
|   keyword "Pvt" must appear at the start of the line.


  Field 7: Baud rate

    This field contains one of the values: 300, 1200, 2400, 9600,
|   19200, or 38400.

|   This baud rate is indicative only of the maximum baud rate that
|   may be expected when connecting to a node and is generally of use
|   only where a calling node needs to adjust the baud rate used to
|   dial to the caller's modem speed in order to achieve a
|   connection, a requirement that with modem technology available in
|   1996 is rarely if ever needed. This information is largely
|   superseded by modem protocol flags (see next section) where any
|   two nodes using a common protocol may have other expectations
|   with regards to actual transfer rates. Use of the baud rate field
|   alone is therefore depreciated.


  Field 8 - Flags

    This optional field contains data about the specific operation of
    the node, such as file requests, modem protocol supported, etc.
    Any text following the seventh comma on a data line is taken
    collectively to be the flags field. The required format is zero
    or more sub-fields, separated by commas. Each sub-field consists
    of a flag, possibly followed by a value.

    The following flags define special operating conditions:

       Flag    Meaning

       CM      Node accepts mail 24 hours a day
       MO      Node does not accept human callers
|      LO      Node accepts calls only from valid listed node
|              numbers in the current FidoNet nodelist


    The following flags define modem protocols supported:

       Flag    Meaning

|      V21     ITU-T V21      300 bps full duplex
|      V22     ITU-T V22     1200 bps full duplex
|      V29     ITU-T V29     9600 bps half duplex
|      V32     ITU-T V32     9600 bps full duplex
|      V32b    ITU-T V32bis 14400 bps full duplex
|      V33     ITU-T V33
|      V34     ITU-T V34    28800 bps full duplex

       H96     Hayes V9600
       HST     USR Courier HST up to 9600
|      H14     USR Courier HST up to 14400
|      H16     USR Courier HST up to 16800
       MAX     Microcom AX/96xx series
       PEP     Packet Ensemble Protocol
|      CSP     Compucom Speedmodem
|      ZYX     Zyxel series
|      VFC     V.Fast Class
|      V32T    V.32 Terbo

       NOTE:   Many V22 modems also support Bell 212A.

    If no modem flag is given, Bell 212A is assumed for 1200 bps
    systems, ITU-T V22bis is assumed for 2400 bps systems.


    The following flags define type of error correction available. A
    separate error correction flag should not be used when the error
    correction type can be determined by the modem flag. For
|   instance, a modem flag of HST implies MNP, V32b implies V32 and
|   V42b implies V42. Therefore MNP+HST, H14+MNP, H16+MNP, V32+V32b
|   and V42+V42b flag pairs are redundant and should not be used.

        Flag    Meaning

        MNP     Microcom Networking Protocol error correction
|       V42     ITU-T LAP-M error correction w/fallback to MNP 1-4
|       V42b    ITU-T LAP-M error correction w/fallback to MNP 1-5


    The following flags define the type(s) of compression of mail
    packets supported.

        Flag    Meaning

        MN      No compression supported

|       NOTE:   While FidoNet nodes usually exchange mail
|               using a variety of different file compression
|               formats negotiated between individual systems, the
|               presence of this flag indicates the INABILITY TO
|               RECEIVE MAIL compressed using the SEA ARC version 5
|               compression format and/or named according to the
|               ARCmail 0.6 mail bundle naming method. This is, by
|               convention, the most common mail compression format
|               in use within FidoNet. The presence of this flag
|               would normally indicate that all mail should be sent
|               uncompressed unless there is some overriding
|               arrangement with the receiving system.

    The following flags indicate the types of file and file update
    requests supported.

        Flag    Meaning

        XA      Bark and WaZOO file/update requests
        XB      Bark file/update requests, WaZOO file requests
|       XC      Bark file requests, WaZOO file file/update
        XP      Bark file/update requests
        XR      Bark and WaZOO file requests
        XW      WaZOO file requests
|       XX      WaZOO file/update requests


    The following flag defines gateways to other domains (mail
    networks).

        Flag    Meaning

        Gx..x   Gateway to domain 'x..x', where 'x..x` is a string
                of alphanumeric characters.

        NOTE:   Valid values for 'x..x' are assigned by the FidoNet
|               International Coordinator or the person appointed as
|               Internetworking Coordinator by the FidoNet
|               International Coordinator. Current valid values of
                'x..x' may usually be found in the notes at the end
|               of the current FidoNet nodelist. The most common
|               gateway flag is "GUUCP", to denote a gateway to the
|               Internet mail system that gates on behalf of the
|               fidonet.org internet domain.


    The following flags define the dedicated mail periods supported.
    They have the form "#nn" or "!nn" where nn is the UTC hour the mail
    period begins, '#' indicates Bell 212A compatibility, and '!'
    indicates incompatibility with Bell 212A.

        Flag    Meaning

|       #01     Zone 5 mail hour (01:00 - 02:00 UTC)
        #02     Zone 2 mail hour (02:30 - 03:30 UTC)
|       #03     Zone 4 mail hour (08:00 - 09:00 UTC)
        #09     Zone 1 mail hour (09:00 - 10:00 UTC)
        #18     Zone 3 mail hour (18:00 - 19:00 UTC)
|       #20     Zone 6 mail hour (20:00 - 21:00 UTC)

        NOTE:   When applicable, the mail period flags may be strung
                together with no intervening commas, e.g.. "#02#09"
|               or "!02!09". Only mail hours other than that
                standard within a node's zone should be given. Since
                observance of mail hour within one's zone is
                mandatory, it should not be indicated.


    The following flag defines user-specific values. If present,
    this flag MUST be the last flag present in a nodelist entry.

        Flag    Meaning

        Ux..x   A user-specified string, which may contain any
                alphanumeric character except blanks. This string
                may contain one to thirty-two characters of
                information that may be used to add user-defined
                data to a specific nodelist entry.

|       NOTE:   Ux..x flags are the mechanism by which new flags may
|               be experimentally introduced into the nodelist for a
|               trial period to assess their worth. They are
|               therefore of a temporary nature, and after their
|               introduction they are eventually either promoted
|               to a non-U flag or dropped from use altogether.

    The FTSC recognizes that the FidoNet International Coordinator is
    the ultimate authority over what appears in the FidoNet nodelist.
    Also, FTSC is by definition a deliberative body, and adding or
    changing a flag may take a considerable amount of time.
    Therefore, the FidoNet International Coordinator may temporarily
    make changes or additions to the flags as defined in this
    document. The FidoNet International Coordinator will then consult
    with FTSC over the changes needed to this document to reflect
    these temporary changes.


    The following are examples of nodelist data lines:

    Host,102,SOCALNET,Los_Angeles_CA,Richard_Martz,1-213-874-9484,2400,XP
    ,101,Rainbow_Data,Culver_City_CA,Don_Brauns,1-213-204-2996,2400,


| THE NODEDIFF

|   With more than thirty-five thousand nodes as of this date (1996),
|   the nodelist, even in archive form, is a document of substantial
|   size. Since distribution of the nodelist occurs via electronic
    file transfer, this file is NOT routinely distributed. Instead,
|   when a new nodelist is prepared weekly, it is compared with the
    previous week's nodelist, and a file containing only the
    differences is created and distributed.

|   The distribution difference file, called NODEDIFF.nnn, where nnn
    is the day-of-year of publication, is actually an editing script
    which will transform the previous week's nodelist into the
    current nodelist. A definition of its format follows:

    The first line of NODEDIFF.nnn is an exact copy of the first line
|   of LAST WEEK'S nodelist (i.e. the first line of the nodelist to
|   which the current difference file applies). This is used as a
    first-level confidence check to insure that the correct file is
    being edited. The second and subsequent lines are editing
    commands and data.

    There are three editing commands and all have the same format:

  <command><number>

    <command> is a 1 letter command, one of A, C, or D.

    <number> is a decimal number greater than zero, and defines the
    number of lines to be operated on by the command. Each command
    appears on a line by itself. The commands have the following
    meanings:

        Ann     Add the following nn lines to the output file.
        Cnn     Copy nn unchanged lines from the input to the output
                file.
        Dnn     Delete (or skip) nn lines from the input file.

    The following illustrate how the first few lines of a
|   hypothetical NODEDIFF.213 might look:

        ;A Friday, July 25, 1986 -- Day number 206 : 27712
        D2
        A2
        ;A Friday, August 1, 1986 -- Day number 213 : 05060
        ;A
        C5

    This fragment illustrates all three editing commands. The first
    line is the first line from the previous nodelist, NODELIST.206.
    The next line says "delete the first two lines" from
    NODELIST.206. These are the identification line and the line
    following it. The next command says "add the next two lines" to
    NODELIST.213 at the "current" location. The two data lines are
    followed by a command which says "copy five unchanged lines" from
    NODELIST.206 to NODELIST.213. Notice that the first line added
|   will ALWAYS contain the new nodelist CRC, so that the software
|   applying the changes to the old nodelist may check the result of
|   its editing.

    Since only the differences will be distributed, it is important
    to insure the accuracy of the newly created nodelist. This is the
    function of the CRC mentioned above. It is sufficient for a
    program designed to perform the above edits to pick the CRC value
    from the first line added to the output file, then compute the
    CRC of the rest of the output file. If the two CRCs do not agree,
    one of the input files has been corrupted. If they do agree, the
    probability is very high (but not 100%) that the output file is
    accurate.

    For actual distribution, NODEDIFF.nnn is packed into an archive
|   file named NODEDIFF.?nn, where 'nn' are the last two digits of
|   day-of-year, and '?' indicates the compression format used.


| NODELIST COMPILATION

|   This section is included for tutorial reasons and is not intended
|   as a definition of any specific method by which FidoNet MUST
|   compile its weekly nodelist. It merely represents an attempt to
|   document the method by which it currently does so. It is intended
|   to be explanatory, and seeks to answer commonly asked questions,
|   such as how the nodelist is compiled and where the information
|   comes from, why the nodelists used in different FidoNet zones are
|   not the same document, and why the difference file generated for
|   use in one FidoNet zone cannot be applied to the nodelist
|   generated for use in a different zone, even though the week
|   numbers match.

|   Nodelists are compiled via a distributed method, which follows
|   the same structure as the FidoNet coordinator hierarchy. At the
|   lowest level, network coordinators maintain a list of the nodes
|   in their network and are responsible for the addition, removal
|   and correction of individual node's listings in their "segment"
|   (as portions of the full nodelist are called). In some larger
|   networks, it is common for this job to be shared with hub
|   coordinators appointed by the net coordinator, though the
|   responsibility for those hub segments still remains with the
|   network coordinator.

|   At a nominated day during the week, before the regional level
|   segment is submitted to the zone coordinator, individual net
|   coordinators submit their segments to the regional coordinator
|   who subsequently compiles these segments and transmits the merged
|   copy to the zone coordinator. These are combined by the zone
|   coordinator with the separate segments of other zones and
|   compiled into that zone's version of the world nodelist. This
|   world nodelist is then compared with the previous week's version,
|   a difference file is generated and subsequently distributed
|   throughout the zone.

|   In some cases, in the interest of saving in transmission times
|   and therefore costs, the compilation process itself may be better
|   served by the submission of DIFFERENCE FILES rather than full
|   net- or region-level segments. Each coordinator therefore retains
|   a copy of the previously submitted segments and applies
|   difference files to those to derive the new one. This process is
|   exactly identical to the NODEDIFF/NODELIST scenario described
|   earlier in this document, with the same first line and CRC
|   validation method used to guard the integrity of the nodelist
|   segments.

|   For a number of reasons, it is important that publication of the
|   nodelist be as timely as possible. These reasons include: the
|   nodelist is a definitive list of valid FidoNet addresses that may
|   receive mail, and must therefore be as correct and up-to-date as
|   possible to save nodes the unnecessary expense of mail routed to
|   possibly non-existing addresses; the nodelist contains the list
|   of telephone numbers that may be called by any user of the
|   FidoNet nodelist and should therefore be accurate so as not to
|   unduly annoy owners of those phone numbers should a listed node
|   go down and an unsuspecting telephone subscriber inherit the same
|   telephone number.

|   Given this constraint, the expense of international calls and the
|   fact that FidoNet is a worldwide network that exists in many time
|   zones, it may be unreasonable to expect the compilation of the
|   nodelist to be delayed until each zone coordinator can transmit
|   their most up-to-date zone segment to a central authority for
|   compilation and subsequent redistribution in any week. For the
|   sake of expedience, each zone instead maintains its own separate
|   world nodelist which contains a compilation of the current zone's
|   latest segments and including the most current copy to hand of
|   all other FidoNet zone's segments. The zone level nodelist
|   generated each week by each zone coordinator is then transmitted
|   to all other zone coordinators for inclusion into their separate
|   world nodelist as timing permits.

|   In theory, then, the only difference between nodelists
|   distributed in each zone in any week are accounted for by timing
|   differences in the exchange of each zone's separate segment. In
|   practice, other constraints may interfere with timeliness, such
|   as the difficulty and expense of international telephonic
|   communications. Also, another point of variance is introduced by
|   the fact that each zone usually includes its own zone segment
|   first into its world nodelist to assist - amongst other things -
|   software that uses the nodelist for index generation. Some
|   software in common use in FidoNet indexes the nodelist according
|   to its sequential order (e.g. version 5 and 6 compiled nodelist
|   formats), and including the current zone first before others will
|   have a beneficial effect on software performance.

BackGo Back