detect - Man Page

Detecting character set and measuring accuracy of charset

Synopsis

#include <chardet.h>

short detect (char * inbuf, DetectObj ** outbuf);

short detect_r (char * inbuf, size_t inlen, DetectObj ** outbuf);

Description

Storing charset and accuracy of inbuf to outbuf

The detect API is deprecated becase this api is not binary safe. Use or replace to detect_r api.

Arguments

inbuf

input string for detecting

inlen

length of input string for detecting

outbuf

Stroing inforamtion of inbuf. The structure of outbuf is follows.

	typedef struct DetectObject {
		char * encoding;
		float confidence;
	} DetectObj;

The outbuf variable is must initialized by detect_obj_init API before calling this detect api.

Return Value

Returns following condition as case by case.

CHARDET_SUCCESS

Detecting success

CHARDET_NO_RESULT

Detection failure

CHARDET_NULL_OBJECT

Don't initializing outbuf with chardet_obj_init

CHARDET_OUT_OF_MEMORY

Occuring out of memory at internal API

Example

#include <chardet.h>

int main (void) {
	DetectObj *obj;
	char * checkstr = "안녕하세요";

	if ( (obj = detect_obj_init ()) == NULL ) {
		fprintf (stderr, "Memory Allocation failed\n");
		return CHARDET_MEM_ALLOCATED_FAIL;
	}

	//switch (detect (checkstr, &obj))
	switch (detect_r (checkstr, strlen (checkstr), &obj))
	{
		case CHARDET_OUT_OF_MEMORY :
			fprintf (stderr, "On handle processing, occured out of memory\n");
			detect_obj_free (&obj);
			return CHARDET_OUT_OF_MEMORY;
		case CHARDET_NULL_OBJECT :
			fprintf (stderr,
					"2st argument of chardet() is must memory allocation "
					"with detect_obj_init API\n");
			return CHARDET_NULL_OBJECT;
	}

	printf ("encoding: %s, confidence: %f\n", obj->encoding, obj->confidence);
	detect_obj_free (&obj);

    return 0;
}

Authors

JoungKyun.Kim <http://oops.org>

Bug Reports

Use QnA board on http://oops.org

See Also

detect_handledata(3), detect_obj_init(3), detect_obj_free(3)

Referenced By

detect_obj_init(3).

2015-12-11 libchardet manuals