Very simple and well tested C library for parsing URLs with zero-copy and no
mallocs. The library does not do any validation on the URL, neither before nor
after it is parsed. The different parts are parsed by searching for special
characters like :
and /
. For a URL should be able to be parsed by yuarel,
it has to be constructed in one of the following formats:
Absolute URL: scheme ":" [ "//" ] [ username ":" password "@" ] host [ ":" port ] [ "/" ] [ path ] [ "?" query ] [ "#" fragment ]
Relative URL: path [ "?" query ] [ "#" fragment ]
Parts within [
and ]
are optional. A minimal URL could look like this:
a:b
or /
Due to the fact that the library isn't copying any strings and instead points
to the parts in the URL string, the first /
in the path will be replaced with
a null terminator. Therefore, the first slash will be missing in the path.
$ make && make check && sudo make install
Compile the example in examples/
:
$ make examples
Run the example program:
$ ./simple
The struct that holds the parsed URL looks like this:
struct yuarel {
char *scheme; /* scheme, without ":" and "//" */
char *username; /* username, default: NULL */
char *password; /* password, default: NULL */
char *host; /* hostname or IP address */
int port; /* port, default: 0 */
char *path; /* path, without leading "/", default: NULL */
char *query; /* query, default: NULL */
char *fragment; /* fragment, default: NULL */
};
The struct that holds a parsed query string parameter looks like this:
struct yuarel_param {
char *key;
char *val;
};
int yuarel_parse(struct yuarel *url, char *url_str)
struct yuarel *url
: a pointer to the struct where to store the parsed values.
char *url_str
: a pointer to the url to be parsed (null terminated).
Note that the url string will be modified by the function.
Returns 0 on success, otherwise -1.
int yuarel_split_path(char *path, char **parts, int max_parts)
No data is copied, the slashed are used as null terminators and then
pointers to each path part will be stored in parts
.
char *path
: the path to split. The string will be modified.
char **parts
: a pointer to an array of (char *)
where to store the result.
int max_parts
: max number of parts to parse.
Note that the path string will be modified by the function.
Returns the number of parsed items. -1 on error.
int yuarel_parse_query(char *query, char delimiter, struct yuarel_param *params, int max_params)
char *query
: the query string to parse. The string will be modified.
char delimiter
: the character that separates the key/value pairs from eachother.
struct yuarel_param *params
: an array of (struct yuarel_param)
where to store the result.
int max_values
: max number of parameters to parse.
The query string should be a null terminated string of parameters separated by a delimiter. Each parameter are checked for the equal sign character. If it appears in the parameter, it will be used as a null terminator and the part that comes after it will be the value of the parameter.
No data are copied, the equal sign and delimiters are used as null terminators and then pointers to each parameter key and value will be stored in the yuarel_param struct.
Note that the query string will be modified by the function.
Returns the number of parsed items. -1 on error.
Compile with -lyuarel
.
#include <stdlib.h>
#include <stdio.h>
#include <yuarel.h>
int main(void)
{
int p;
struct yuarel url;
char *parts[3];
char url_string[] = "http://localhost:8989/path/to/test?query=yes#frag=1";
if (-1 == yuarel_parse(&url, url_string)) {
fprintf(stderr, "Could not parse url!\n");
return 1;
}
printf("scheme:\t%s\n", url.scheme);
printf("host:\t%s\n", url.host);
printf("port:\t%d\n", url.port);
printf("path:\t%s\n", url.path);
printf("query:\t%s\n", url.query);
printf("fragment:\t%s\n", url.fragment);
if (3 != yuarel_split_path(url.path, parts, 3)) {
fprintf(stderr, "Could not split path!\n");
return 1;
}
printf("path parts: %s, %s, %s\n", parts[0], parts[1], parts[2]);
printf("Query string parameters:\n");
p = yuarel_parse_query(url.query, '&', params, 3);
while (p-- > 0) {
printf("\t%s: %s\n", params[p].key, params[p].val);
}
}