detect man page

detect, detect_r — Detecting character set and measuring accuracy of charset

Synopsis

#include <chardet.h>

short detect (char * inbuf, DetectObj ** outbuf);

short detect_r (char * inbuf, size_t inlen, DetectObj ** outbuf);

Description

Storing charset and accuracy of inbuf to outbuf

The detect API is deprecated becase this api is not binary safe. Use or replace to detect_r api.

Arguments

inbuf
input string for detecting
inlen
length of input string for detecting
outbuf

Stroing inforamtion of inbuf. The structure of outbuf is follows.

typedef struct DetectObject {
	char * encoding;
	float confidence;
} DetectObj;

The outbuf variable is must initialized by detect_obj_init API before calling this detect api.

Return Value

Returns following condition as case by case.

CHARDET_SUCCESS
Detecting success
CHARDET_NO_RESULT
Detection failure
CHARDET_NULL_OBJECT
Don't initializing outbuf with chardet_obj_init
CHARDET_OUT_OF_MEMORY
Occuring out of memory at internal API

Example

#include <chardet.h>

int main (void) {
	DetectObj *obj;
	char * checkstr = "안녕하세요";

	if ( (obj = detect_obj_init ()) == NULL ) {
		fprintf (stderr, "Memory Allocation failed\n");
		return CHARDET_MEM_ALLOCATED_FAIL;
	}

	//switch (detect (checkstr, &obj))
	switch (detect_r (checkstr, strlen (checkstr), &obj))
	{
		case CHARDET_OUT_OF_MEMORY :
			fprintf (stderr, "On handle processing, occured out of memory\n");
			detect_obj_free (&obj);
			return CHARDET_OUT_OF_MEMORY;
		case CHARDET_NULL_OBJECT :
			fprintf (stderr,
					"2st argument of chardet() is must memory allocation "
					"with detect_obj_init API\n");
			return CHARDET_NULL_OBJECT;
	}

	printf ("encoding: %s, confidence: %f\n", obj->encoding, obj->confidence);
	detect_obj_free (&obj);

    return 0;
}

Authors

JoungKyun.Kim <http://oops.org>

Bug Reports

Use QnA board on http://oops.org

See Also

detect_handledata(3), detect_obj_init(3), detect_obj_free(3)

Referenced By

detect_obj_init(3).

2015-12-11 libchardet manuals