UTF-8

Meaning of UTF-8 in English

< character > (UCS transformation format 8) An ASCII -compatible multibyte Unicode and UCS encoding, used by Java and Plan 9 .

The Unicode character set occupies a 16-bit code space. The most obvious Unicode encoding (known as UCS-2) consists of a sequence of 16-bit words. Such strings can contain bytes like '\0' or '/' which have a special meaning in filenames and other C library function parameters. In addition, the majority of Unix tools expects ASCII files and can't read 16-bit words as characters without major modifications. For these reasons, UCS-2 is not a suitable external encoding of Unicode in filenames, text files, environment variables, etc.

The ISO 10646 Universal Character Set (UCS), a superset of Unicode, occupies a 31-bit code space and the obvious UCS-4 encoding for it (a sequence of 32-bit words) has the same problems.

The UTF-8 encoding of Unicode and UCS avoids the problems of fixed-length Unicode encodings because an ASCII file encoded in UTF is exactly same as the original ASCII file and all non-ASCII characters are guaranteed to have the most significant bit set (bit 0x80). This means that normal tools for text searching etc. work as expected.

UTF-8 is defined in RFC 2279 .

["File System Safe UCS Transformation Format (FSS_UTF)", X/Open Preliminary Specification, X/Open Company Ltd., Document Number: P316. This information also appears in ISO/IEC 10646, Annex P].

Plan 9 UTF manual entry .

(1998-07-29)

FOLDOC computer English dictionary. Английский словарь по компьютерам FOLDOC. 2012

UTF-8 — UTF-8; Юникод
Американский Англо-Русский словарь
UTF-8 — UTF-8
Русско-Американский Английский словарь
UTF-8 — Набор знаков для протоколов, допускающих выход за рамки ASCII. Протокол UTF-8 обеспечивает поддержку расширенного набора знаков ASCII и трансляцию UCS-2, …
Russian-English Edic
UTF-8 — (UCS transformation format 8) преобразование UCS, формат 8; код UTF-8 ASCII-совместимый многобайтовый код, применяемый в языке Java и операционной системе Plan …
Англо-Русский толковый словарь терминов и сокращений по ВТ, Интернету и программированию
8 — 8 /eɪt/ BrE AmE written informal a way of writing parts of words that sound like ‘-ate’, ‘-eat’, or ‘-ait’, …
Longman Dictionary of Contemporary English
8 — ) - Internet symbol for someone wearing glasses
Cambridge English vocab
UTF — UCS transformation format
FOLDOC Computer English Dictionary
.UTF — AOL Updating Files
Computer Abbreviations English vocabulary
.8 — A86 Assembler Program File
Computer Abbreviations English vocabulary
UTF — Ucs Transformation Format
Computer Acronyms English vocab
UTF — Use The Force For more possible definitions for UTF , click here ©1988-2002, All Rights Reserved, AcronymFinder.com
Most Common Acronyms and Abbreviations English vocabulary
8-;) — Tongue in cheek
English Glossary of Computer and Internet Terms
8-) — sunglasses
English Glossary of Computer and Internet Terms
8)~~* — frog
English Glossary of Computer and Internet Terms
UTF — сокр. [ultrathin foil] сверхтонкая фольга
Большой Англо-Русский словарь
UTF — сокр. [ultrathin foil] сверхтонкая фольга
Большой Англо-Русский словарь
UTF — сокр. от ultrathin foil сверхтонкая фольга
Большой Англо-Русский политехнический словарь
UTF — сокр. от ultrathin foil сверхтонкая фольга
Большой Англо-Русский политехнический словарь - РУССО
8 — как часть акронима; используется для обозначения звукосочетания eɪt см. тж. acronym I h8 it = I hate it — Не …
Англо-Русский словарь по общей лексике
8 — как часть акронима; используется для обозначения звукосочетания [ўЎ«] см. тж. acronym I h8 it - I hate it — Не выношу этого; …
Англо-Русский словарь общей лексики
:-8( — (смайл) снисходительный пристальный взгляд
Англо-Русский словарь по компьютерам
:-)-8 — (смайл) я - большая девочка
Англо-Русский словарь по компьютерам
8:-) — (смайл) я - маленькая девочка
Англо-Русский словарь по компьютерам
8-) — (смайл) я ношу солнечные очки
Англо-Русский словарь по компьютерам
8-# — (смайл) смерть
Англо-Русский словарь по компьютерам
8- — (смайл) неопределенность
Англо-Русский словарь по компьютерам
8 — (смайл) бесконечность
Англо-Русский словарь по компьютерам
)8-) — (смайл) улыбка аквалангиста с большим лицом
Англо-Русский словарь по компьютерам
UTF — Unicode Transformation Format
Russian-English Edic
UTF — сокр. [ultrathin foil] сверхтонкая фольга
Новый большой Англо-Русский словарь
UTF — сокр. [ultrathin foil] сверхтонкая фольга
Новый большой Англо-Русский словарь
UTF-8; ЮНИКОД — UTF-8
Русско-Американский Английский словарь
STATISTICS: UZBEKISTAN
Britannica English vocabulary
STATISTICS: UNITED STATES
Britannica English vocabulary
STATISTICS: UNITED KINGDOM
Britannica English vocabulary
STATISTICS: UGANDA
Britannica English vocabulary
STATISTICS: SWEDEN
Britannica English vocabulary
STATISTICS: SPAIN
Britannica English vocabulary
STATISTICS: RUSSIA
Britannica English vocabulary
STATISTICS: JAPAN
Britannica English vocabulary
STATISTICS: INDIA
Britannica English vocabulary
STATISTICS: FRANCE
Britannica English vocabulary
STATISTICS: CHINA
Britannica English vocabulary
STATISTICS: CANADA
Britannica English vocabulary
STATISTICS: BARBADOS
Britannica English vocabulary
STATISTICS: AUSTRALIA
Britannica English vocabulary
CHARACTER — colon epsilon SYN ampersand Mu tilde alphanumeric Device Control 4 code position asterisk elvish character Device Control 2 File Separator …
FOLDOC Computer English Dictionary
CHARACTER SET — < character > 1. A particular mapping between characters and byte strings , i.e. the combination of a particular character …
FOLDOC Computer English Dictionary
CHARACTER ENCODING — < character > (Or "character encoding scheme") A mapping of binary values to code positions and back; generally a 1:1 …
FOLDOC Computer English Dictionary
UCS TRANSFORMATION FORMAT — < standard , character > (UTF) A set of standard character encodings in accordance with ISO 10646 . One of …
FOLDOC Computer English Dictionary
TLAS
FOLDOC Computer English Dictionary
RFC 2279 — < standard > The RFC defining UTF-8 . rfc 2279 . (1998-07-29)
FOLDOC Computer English Dictionary
.DN8 — Blue Martini UTF-8 Encoded DNA File (eCRM)
Computer Abbreviations English vocabulary
CLASSIC CAR — A vehicle that is generally considered to be one of the finest models ever built. Unlike antique cars, classic cars …
English Dictionary of Automotive Terms
ЮНИКОД — Стандарт кодировки знаков, разработанный организацией Unicode Consortium, который позволяет представить знаки практически всех письменных языков. Набор знаков в кодировке Юникод …
Russian-English Edic
DN8 — Blue Martini UTF-8 Encoded DNA File (eCRM)
Russian-English Edic

FOLDOC Computer English Dictionary

← UTF UTOPIST→