Google


     fetchStatHTTP, fetchListHTTP, fetchXGetFTP, fetchGetFTP, fetchPutFTP,
     fetchStatFTP, fetchListFTP -- file transfer functions


LIBRARY

     library ``libfetch''


SYNOPSIS

     #include <sys/param.h>
     #include <stdio.h>
     #include <fetch.h>

     struct url *
     fetchMakeURL(const char *scheme, const char *host, int port,
         const char *doc, const char *user, const char *pwd);

     struct url *
     fetchParseURL(const char *URL);

     void
     fetchFreeURL(struct url *u);

     FILE *
     fetchXGetURL(const char *URL, struct url_stat *us, const char *flags);

     FILE *
     fetchGetURL(const char *URL, const char *flags);

     FILE *
     fetchPutURL(const char *URL, const char *flags);

     int
     fetchStatURL(const char *URL, struct url_stat *us, const char *flags);

     struct url_ent *
     fetchListURL(const char *URL, const char *flags);

     FILE *
     fetchXGet(struct url *u, struct url_stat *us, const char *flags);

     FILE *
     fetchGet(struct url *u, const char *flags);

     FILE *
     fetchPut(struct url *u, const char *flags);

     int
     fetchStat(struct url *u, struct url_stat *us, const char *flags);

     struct url_ent *
     fetchList(struct url *u, const char *flags);

     FILE *
     FILE *
     fetchXGetHTTP(struct url *u, struct url_stat *us, const char *flags);

     FILE *
     fetchGetHTTP(struct url *u, const char *flags);

     FILE *
     fetchPutHTTP(struct url *u, const char *flags);

     int
     fetchStatHTTP(struct url *u, struct url_stat *us, const char *flags);

     struct url_ent *
     fetchListHTTP(struct url *u, const char *flags);

     FILE *
     fetchXGetFTP(struct url *u, struct url_stat *us, const char *flags);

     FILE *
     fetchGetFTP(struct url *u, const char *flags);

     FILE *
     fetchPutFTP(struct url *u, const char *flags);

     int
     fetchStatFTP(struct url *u, struct url_stat *us, const char *flags);

     struct url_ent *
     fetchListFTP(struct url *u, const char *flags);


DESCRIPTION

     These functions implement a high-level library for retrieving and upload-
     ing files using Uniform Resource Locators (URLs).

     fetchParseURL() takes a URL in the form of a null-terminated string and
     splits it into its components function according to the Common Internet
     Scheme Syntax detailed in RFC1738.  A regular expression which produces
     this syntax is:

         <scheme>:(//(<user>(:<pwd>)?@)?<host>(:<port>)?)?/(<document>)?

     If the URL does not seem to begin with a scheme name, the following syn-
     tax is assumed:

         ((<user>(:<pwd>)?@)?<host>(:<port>)?)?/(<document>)?

     Note that some components of the URL are not necessarily relevant to all
     URL schemes.  For instance, the file scheme only needs the <scheme> and
     <document> components.

     fetchMakeURL() and fetchParseURL() return a pointer to a url structure,
     which is defined as follows in <fetch.h>:
     };

     The pointer returned by fetchMakeURL() or fetchParseURL() should be freed
     using fetchFreeURL().

     fetchXGetURL(), fetchGetURL(), and fetchPutURL() constitute the recom-
     mended interface to the fetch library.  They examine the URL passed to
     them to determine the transfer method, and call the appropriate lower-
     level functions to perform the actual transfer.  fetchXGetURL() also
     returns the remote document's metadata in the url_stat structure pointed
     to by the us argument.

     The flags argument is a string of characters which specify transfer
     options.  The meaning of the individual flags is scheme-dependent, and is
     detailed in the appropriate section below.

     fetchStatURL() attempts to obtain the requested document's metadata and
     fill in the structure pointed to by its second argument.  The url_stat
     structure is defined as follows in <fetch.h>:

     struct url_stat {
         off_t        size;
         time_t       atime;
         time_t       mtime;
     };

     If the size could not be obtained from the server, the size field is set
     to -1.  If the modification time could not be obtained from the server,
     the mtime field is set to the epoch.  If the access time could not be
     obtained from the server, the atime field is set to the modification
     time.

     fetchListURL() attempts to list the contents of the directory pointed to
     by the URL provided.  If successful, it returns a malloced array of
     url_ent structures.  The url_ent structure is defined as follows in
     <fetch.h>:

     struct url_ent {
         char         name[MAXPATHLEN];
         struct url_stat stat;
     };

     The list is terminated by an entry with an empty name.

     The pointer returned by fetchListURL() should be freed using free().

     fetchXGet(), fetchGet(), fetchPut() and fetchStat() are similar to
     fetchXGetURL(), fetchGetURL(), fetchPutURL() and fetchStatURL(), except
     that they expect a pre-parsed URL in the form of a pointer to a struct
     url rather than a string.

     All of the fetchXGetXXX(), fetchGetXXX() and fetchPutXXX() functions
     fetchPutFile() accepts the `a' (append to file) flag.  If that flag is
     specified, the data written to the stream returned by fetchPutFile() will
     be appended to the previous contents of the file, instead of replacing
     them.


FTP SCHEME

     fetchXGetFTP(), fetchGetFTP() and fetchPutFTP() implement the FTP proto-
     col as described in RFC959.

     If the `p' (passive) flag is specified, a passive (rather than active)
     connection will be attempted.

     If the `l' (low) flag is specified, data sockets will be allocated in the
     low (or default) port range instead of the high port range (see ip(4)).

     If the `d' (direct) flag is specified, fetchXGetFTP(), fetchGetFTP() and
     fetchPutFTP() will use a direct connection even if a proxy server is
     defined.

     If no user name or password is given, the fetch library will attempt an
     anonymous login, with user name "anonymous" and password "anony-
     mous@<hostname>".

     By default, fetch changes directories one at a time for the best compati-
     bility with a wide variety of servers.  If it is compiled with the
     `FTP_COMBINE_CWDS' flag, it will first try a single change directory com-
     mand for the entire path and fall back to the default scheme if an error
     occurred.  This can save latency when used with a slow network link.


HTTP SCHEME

     The fetchXGetHTTP(), fetchGetHTTP() and fetchPutHTTP() functions imple-
     ment the HTTP/1.1 protocol.  With a little luck, there is even a chance
     that they comply with RFC2616 and RFC2617.

     If the `d' (direct) flag is specified, fetchXGetHTTP(), fetchGetHTTP()
     and fetchPutHTTP() will use a direct connection even if a proxy server is
     defined.

     Since there seems to be no good way of implementing the HTTP PUT method
     in a manner consistent with the rest of the fetch library, fetchPutHTTP()
     is currently unimplemented.


AUTHENTICATION

     Apart from setting the appropriate environment variables and specifying
     the user name and password in the URL or the struct url, the calling pro-
     gram has the option of defining an authentication function with the fol-
     lowing prototype:

     int myAuthMethod(struct url *u)

     The callback function should fill in the user and pwd fields in the pro-
     vided struct url and return 0 on success, or any other value to indicate

     All other functions return a stream pointer which may be used to access
     the requested document, or NULL if an error occurred.

     The following error codes are defined in <fetch.h>:

     [FETCH_ABORT]       Operation aborted

     [FETCH_AUTH]        Authentication failed

     [FETCH_DOWN]        Service unavailable

     [FETCH_EXISTS]      File exists

     [FETCH_FULL]        File system full

     [FETCH_INFO]        Informational response

     [FETCH_MEMORY]      Insufficient memory

     [FETCH_MOVED]       File has moved

     [FETCH_NETWORK]     Network error

     [FETCH_OK]          No error

     [FETCH_PROTO]       Protocol error

     [FETCH_RESOLV]      Resolver error

     [FETCH_SERVER]      Server error

     [FETCH_TEMP]        Temporary error

     [FETCH_TIMEOUT]     Operation timed out

     [FETCH_UNAVAIL]     File is not available

     [FETCH_UNKNOWN]     Unknown error

     [FETCH_URL]         Invalid URL

     The accompanying error message includes a protocol-specific error code
     and message, e.g. "File is not available (404 Not Found)"


ENVIRONMENT

     FETCH_BIND_ADDRESS  Specifies a hostname or IP address to which sockets
                         used for outgoing connections will be bound.

     FTP_LOGIN           Default FTP login if none was provided in the URL.

     FTP_PASSIVE_MODE    If set to anything but `no', forces the FTP code to
                         will be used for FTP requests, even if the HTTP_PROXY
                         variable is set.

     ftp_proxy           Same as FTP_PROXY, for compatibility.

     HTTP_AUTH           Specifies HTTP authorization parameters as a colon-
                         separated list of items.  The first and second item
                         are the authorization scheme and realm respectively;
                         further items are scheme-dependent.  Currently, only
                         basic authorization is supported.

                         Basic authorization requires two parameters: the user
                         name and password, in that order.

                         This variable is only used if the server requires
                         authorization and no user name or password was speci-
                         fied in the URL.

     HTTP_PROXY          URL of the proxy to use for HTTP requests.  The docu-
                         ment part is ignored.  Only HTTP proxies are sup-
                         ported for HTTP requests.  If no port number is spec-
                         ified, the default is 3128.

                         Note that this proxy will also be used for FTP docu-
                         ments, unless the FTP_PROXY variable is set.

     http_proxy          Same as HTTP_PROXY, for compatibility.

     HTTP_PROXY_AUTH     Specifies authorization parameters for the HTTP proxy
                         in the same format as the HTTP_AUTH variable.

                         This variable is used if and only if connected to an
                         HTTP proxy, and is ignored if a user and/or a pass-
                         word were specified in the proxy URL.

     HTTP_REFERER        Specifies the referrer URL to use for HTTP requests.
                         If set to ``auto'', the document URL will be used as
                         referrer URL.

     HTTP_USER_AGENT     Specifies the User-Agent string to use for HTTP
                         requests.  This can be useful when working with HTTP
                         origin or proxy servers that differentiate between
                         user agents.

     NETRC               Specifies a file to use instead of ~/.netrc to look
                         up login names and passwords for FTP sites.  See
                         ftp(1) for a description of the file format.  This
                         feature is experimental.


EXAMPLES

     To access a proxy server on proxy.example.com port 8080, set the
     HTTP_PROXY environment variable in a manner similar to this:


SEE ALSO

     fetch(1), ftpio(3), ip(4)

     J. Postel and J. K. Reynolds, File Transfer Protocol, October 1985,
     RFC959.

     P. Deutsch, A. Emtage, and A. Marine., How to Use Anonymous FTP, May
     1994, RFC1635.

     T. Berners-Lee, L. Masinter, and M. McCahill, Uniform Resource Locators
     (URL), December 1994, RFC1738.

     R. Fielding, J. Gettys, J. Mogul, H. Frystyk, L. Masinter, P. Leach, and
     T. Berners-Lee, Hypertext Transfer Protocol -- HTTP/1.1, January 1999,
     RFC2616.

     J. Franks, P. Hallam-Baker, J. Hostetler, S. Lawrence, P. Leach, A.
     Luotonen, and L. Stewart, HTTP Authentication: Basic and Digest Access
     Authentication, June 1999, RFC2617.


HISTORY

     The fetch library first appeared in FreeBSD 3.0.


AUTHORS

     The fetch library was mostly written by Dag-Erling Smrgrav
     <des@FreeBSD.org> with numerous suggestions from Jordan K. Hubbard
     <jkh@FreeBSD.org>, Eugene Skepner <eu@qub.com> and other FreeBSD develop-
     ers.  It replaces the older ftpio library written by Poul-Henning Kamp
     <phk@FreeBSD.org> and Jordan K. Hubbard <jkh@FreeBSD.org>.

     This manual page was written by Dag-Erling Smrgrav <des@FreeBSD.org>.


BUGS

     Some parts of the library are not yet implemented.  The most notable
     examples of this are fetchPutHTTP(), fetchListHTTP(), fetchListFTP() and
     FTP proxy support.

     There is no way to select a proxy at run-time other than setting the
     HTTP_PROXY or FTP_PROXY environment variables as appropriate.

     libfetch does not understand or obey 305 (Use Proxy) replies.

     Error numbers are unique only within a certain context; the error codes
     used for FTP and HTTP overlap, as do those used for resolver and system
     errors.  For instance, error code 202 means "Command not implemented,
     superfluous at this site" in an FTP context and "Accepted" in an HTTP
     context.

     fetchStatFTP() does not check that the result of an MDTM command is a
     valid date.

     The man page is incomplete, poorly written and produces badly formatted

Man(1) output converted with man2html