Provided by: libcurl4-doc_8.12.1-3ubuntu1_all bug

NAME

       curl_url_set - set a URL part

SYNOPSIS

       #include <curl/curl.h>

       CURLUcode curl_url_set(CURLU *url,
                              CURLUPart part,
                              const char *content,
                              unsigned int flags);

DESCRIPTION

       The  url  handle  to  work  on,  passed  in as the first argument, must be a handle previously created by
       curl_url(3) or curl_url_dup(3).

       This function sets or updates individual URL components, or parts, held by  the  URL  object  the  handle
       identifies.

       The part argument should identify the particular URL part (see list below) to set or change, with content
       pointing  to  a null-terminated string with the new contents for that URL part. The contents should be in
       the form and encoding they would use in a URL: URL encoded.

       When setting a part in the URL object that was previously already set, it  replaces  the  data  that  was
       previously stored for that part with the new content.

       The  caller  does  not  have  to  keep content around after a successful call as this function copies the
       content.

       Setting a part to a NULL pointer removes that part's contents from the CURLU handle.

       This function has an 8 MB maximum length limit for all  provided  input  strings.   In  the  real  world,
       excessively long fields in URLs cause problems even if this function accepts them.

       When  setting  or updating contents of individual URL parts, curl_url_set(3) might accept data that would
       not be otherwise possible to set in the string when it gets populated as a result of a  full  URL  parse.
       Beware. If done so, extracting a full URL later on from such components might render an invalid URL.

       The flags argument is a bitmask with independent features.

PARTS

       CURLUPART_URL
              Allows  the  full URL of the handle to be replaced. If the handle already is populated with a URL,
              the new URL can be relative to the previous.

              When successfully setting a new URL, relative or absolute, the handle contents  is  replaced  with
              the components of the newly set URL.

              Pass  a  pointer  to  a  null-terminated  string  to the url parameter. The string must point to a
              correctly formatted "RFC 3986+" URL or be a NULL pointer. The  URL  parser  only  understands  and
              parses  the subset of URLS that are "hierarchical" and therefore contain a :// separator - not the
              ones that are normally specified with only a colon separator.

              By default this API only parses URLs using schemes for protocols that are supported  built-in.  To
              make   libcurl   parse   URLs   generically   even  for  schemes  it  does  not  know  about,  the
              CURLU_NON_SUPPORT_SCHEME  flags   bit   must   be   set.    Otherwise,   this   function   returns
              CURLUE_UNSUPPORTED_SCHEME for URL schemes it does not recognize.

              Unless CURLU_NO_AUTHORITY is set, a blank hostname is not allowed in the URL.

              When a full URL is set (parsed), the hostname component is stored URL decoded.

       CURLUPART_SCHEME
              Scheme cannot be URL decoded on set. libcurl only accepts setting schemes up to 40 bytes long.

       CURLUPART_USER
              If only the user part is set and not the password, the URL is represented with a blank password.

       CURLUPART_PASSWORD
              If only the password part is set and not the user, the URL is represented with a blank user.

       CURLUPART_OPTIONS
              The  options field is an optional field that might follow the password in the userinfo part. It is
              only recognized/used when parsing URLs for the  following  schemes:  pop3,  smtp  and  imap.  This
              function however allows users to independently set this field.

       CURLUPART_HOST
              The  hostname.  If  it  is International Domain Name (IDN) the string must then be encoded as your
              locale says or UTF-8 (when WinIDN is used). If it is a  bracketed  IPv6  numeric  address  it  may
              contain a zone id (or you can use CURLUPART_ZONEID).

              Note  that  if  you  set  an  IPv6 address, it gets ruined and causes an error if you also set the
              CURLU_URLENCODE flag.

              Unless CURLU_NO_AUTHORITY is set, a blank hostname is not allowed to set.

       CURLUPART_ZONEID
              If the hostname is a numeric IPv6 address, this field can also be set.

       CURLUPART_PORT
              The port number cannot be URL encoded on set. The given port number is provided as  a  string  and
              the decimal number in it must be between 0 and 65535. Anything else returns an error.

       CURLUPART_PATH
              If a path is set in the URL without a leading slash, a slash is prepended automatically.

       CURLUPART_QUERY
              The  query  part  gets  spaces  converted  to  pluses  when  asked  to  URL encode on set with the
              CURLU_URLENCODE bit.

              If used together with the CURLU_APPENDQUERY bit, the provided part is appended on the end  of  the
              existing query.

              The question mark in the URL is not part of the actual query contents.

       CURLUPART_FRAGMENT
              The hash sign in the URL is not part of the actual fragment contents.

FLAGS

       The flags argument is zero, one or more bits set in a bitmask.

       CURLU_APPENDQUERY
              Can  be used when setting the CURLUPART_QUERY component. The provided new part is then appended at
              the end of the existing query - and if the previous part did not end with  an  ampersand  (&),  an
              ampersand gets inserted before the new appended part.

              When  CURLU_APPENDQUERY  is  used  together  with CURLU_URLENCODE, the first '=' symbol is not URL
              encoded.

       CURLU_NON_SUPPORT_SCHEME
              If set, allows curl_url_set(3) to set a non-supported scheme. It then of course cannot know if the
              provided scheme is a valid one or not.

       CURLU_URLENCODE
              When set, curl_url_set(3) URL encodes the part on entry, except for scheme, port and URL.

              When setting the path component with URL encoding enabled, the slash character is skipped.

              The query part gets space-to-plus converted before the URL conversion is applied.

              This URL encoding is charset unaware and converts the input in a byte-by-byte manner.

       CURLU_DEFAULT_SCHEME
              If set, allows the URL to be set without a scheme and then sets that to the default scheme: HTTPS.
              Overrides the CURLU_GUESS_SCHEME option if both are set.

       CURLU_GUESS_SCHEME
              If set, allows the URL to be set without a scheme and it instead "guesses" which scheme  that  was
              intended  based  on  the  hostname. If the outermost subdomain name matches DICT, FTP, IMAP, LDAP,
              POP3  or  SMTP  then  that  scheme  is  used,  otherwise  it  picks  HTTP.  Conflicts   with   the
              CURLU_DEFAULT_SCHEME option which takes precedence if both are set.

              If  guessing  is  not  allowed and there is no default scheme set, trying to parse a URL without a
              scheme returns error.

              If the scheme ends up set as a result of guessing, i.e. it is not actually present in  the  parsed
              URL, it can later be figured out by using the CURLU_NO_GUESS_SCHEME flag when subsequently getting
              the URL or the scheme with curl_url_get(3).

       CURLU_NO_AUTHORITY
              If  set, skips authority checks. The RFC allows individual schemes to omit the host part (normally
              the only mandatory part of the authority), but libcurl cannot know whether this is  permitted  for
              custom  schemes.  Specifying the flag permits empty authority sections, similar to how file scheme
              is handled.

       CURLU_PATH_AS_IS
              When set for CURLUPART_URL, this skips the normalization of the path. That is the procedure  where
              libcurl  otherwise  removes  sequences  of  dot-slash  and  dot-dot  etc. The same option used for
              transfers is called CURLOPT_PATH_AS_IS(3).

       CURLU_ALLOW_SPACE
              If set, the URL parser allows space (ASCII 32) where possible. The URL syntax  does  normally  not
              allow spaces anywhere, but they should be encoded as %20 or '+'. When spaces are allowed, they are
              still  not  allowed  in  the  scheme.  When space is used and allowed in a URL, it is stored as-is
              unless CURLU_URLENCODE is also set, which then makes libcurl URL encode the space  before  stored.
              This  affects  how the URL is constructed when curl_url_get(3) is subsequently used to extract the
              full URL or individual parts. (Added in 7.78.0)

       CURLU_DISALLOW_USER
              If set, the URL parser does not accept embedded credentials for  the  CURLUPART_URL,  and  instead
              returns CURLUE_USER_NOT_ALLOWED for such URLs.

PROTOCOLS

       This functionality affects all supported protocols

EXAMPLE

       int main(void)
       {
         CURLUcode rc;
         CURLU *url = curl_url();
         rc = curl_url_set(url, CURLUPART_URL, "https://example.com", 0);
         if(!rc) {
           /* change it to an FTP URL */
           rc = curl_url_set(url, CURLUPART_SCHEME, "ftp", 0);
         }
         curl_url_cleanup(url);
       }

AVAILABILITY

       Added in curl 7.62.0

RETURN VALUE

       Returns   a   CURLUcode   error  value,  which  is  CURLUE_OK  (0)  if  everything  went  fine.  See  the
       libcurl-errors(3) man page for the full list with descriptions.

       The input string passed to curl_url_set(3) must be shorter  than  eight  million  bytes.  Otherwise  this
       function returns CURLUE_MALFORMED_INPUT.

       If this function returns an error, no URL part is set.

SEE ALSO

       CURLOPT_CURLU(3),      curl_url(3),      curl_url_cleanup(3),      curl_url_dup(3),      curl_url_get(3),
       curl_url_strerror(3)

libcurl                                            2025-03-05                                    curl_url_set(3)