An alphabet is a standard set of letters (basic written symbols or graphemes) which is used to write one or more languages based on the general principle that the letters represent phonemes (basic significant sounds) of the spoken language. This is in contrast to other types of writing systems, such as syllabaries (in which each character represents a syllable) and logographies (in which each character represents a word, morpheme or semantic unit). The use of alphabets supports efforts to achieve universal literacy, which is a high priority in contemporary society, through the greater ease of learning a limited number of letters compared to the large numbers of symbols involved in logographies.
A true alphabet has letters for the vowels of a language as well as the consonants. The first "true alphabet" in this sense is believed to be the Greek alphabet, which is a modified form of the Phoenician alphabet. In other types of alphabet either the vowels are not indicated at all, as was the case in the Phoenician alphabet (such systems are known as abjads), or else the vowels are shown by diacritics or modification of consonants, as in the devanagari used in India and Nepal (these systems are known as abugidas or alphasyllabaries).
There are dozens of alphabets in use today, the most popular being the Latin alphabet (which was derived from the Greek). Many languages use modified forms of the Latin alphabet, with additional letters formed using diacritical marks. While most alphabets have letters composed of lines (linear writing), there are also exceptions such as the alphabets used in Braille and Morse code.
Alphabets are usually associated with a standard ordering of their letters. This makes them useful for purposes of collation, specifically by allowing words to be sorted in alphabetical order. It also means that their letters can be used as an alternative method of "numbering" ordered items, in such contexts as numbered lists.
The English word alphabet came into Middle English from the Late Latin word alphabetum, which in turn originated in the Greek ἀλφάβητος (alphabētos), from alpha and beta, the first two letters of the Greek alphabet. Alpha and beta in turn came from the first two letters of the Phoenician alphabet, and originally meant ox and house respectively.
The history of alphabetic writing goes back to the consonantal writing system used for Semitic languages in the Levant in the second millennium B.C.E. Most or nearly all alphabetic scripts used throughout the world today ultimately go back to this Semitic proto-alphabet. Its first origins can be traced back to a Proto-Sinaitic script developed in Ancient Egypt to represent the language of Semitic-speaking workers in Egypt. This script was partly influenced by the older Egyptian hieratic, a cursive script related to Egyptian hieroglyphs. 
Although the following description presents the evolution of scripts in a linear fashion, this is a simplification. For example, the Manchu alphabet, descended from the abjads of West Asia, was also influenced by Korean hangul, which was either independent (the traditional view) or derived from the abugidas of South Asia. Georgian apparently derives from the Aramaic family, but was strongly influenced in its conception by Greek. The Greek alphabet, itself ultimately a derivative of hieroglyphs through that first Semitic alphabet, later adopted an additional half dozen demotic hieroglyphs when it was used to write Coptic Egyptian.
By 2700 B.C.E. the ancient Egyptians had developed a set of some 22 hieroglyphs to represent the individual consonants of their language, plus a 23rd that seems to have represented word-initial or word-final vowels. These glyphs were used as pronunciation guides for logograms, to write grammatical inflections, and, later, to transcribe loan words and foreign names. However, although alphabetic in nature, the system was not used for purely alphabetic writing. That is, while capable of being used as an alphabet, it was in fact always used with a strong logographic component, presumably due to strong cultural attachment to the complex Egyptian script.
The Middle Bronze Age scripts of Egypt have yet to be deciphered. However, they appear to be at least partially, and perhaps completely, alphabetic. The oldest examples are found as graffiti from central Egypt and date to around 1800 B.C.E. These inscriptions, according to Gordon J. Hamilton, help to show that the most likely place for the alphabet’s invention was in Egypt proper.
The first purely alphabetic script is thought to have been developed by 2000 B.C.E. for Semitic workers in central Egypt. Over the next five centuries it spread north, and all subsequent alphabets around the world have either descended from it, or been inspired by one of its descendants, with the possible exception of the Meroitic alphabet, a third century B.C.E. adaptation of hieroglyphs in Nubia to the south of Egypt.
The apparently "alphabetic" system known as the Proto-Sinaitic script appears in Egyptian turquoise mines in the Sinai peninsula dated to the fifteenth century B.C.E., apparently left by Canaanite workers. An even earlier version of this first alphabet was discovered at Wadi el-Hol and dated to circa 1800 B.C.E. This alphabet showed evidence of having been adapted from specific forms of Egyptian hieroglyphs dated to circa 2000 B.C.E., suggesting that the first alphabet had been developed around that time. Based on letter appearances and names, it is believed to be based on Egyptian hieroglyphs. This script had no characters representing vowels. An alphabetic cuneiform script with 30 signs including three which indicate the following vowel was invented in Ugarit before the fifteenth century B.C.E. This script was not used after the destruction of Ugarit.
This Semitic script did not restrict itself to the existing Egyptian consonantal signs, but incorporated a number of other Egyptian hieroglyphs, for a total of perhaps thirty, and used Semitic names for them. However, by the time the script was inherited by the Canaanites, it was purely alphabetic. For example, the hieroglyph originally representing "house" stood only for b.
The Proto-Sinaitic script eventually developed into the Phoenician alphabet, which is conventionally called "Proto-Canaanite" before 1050 B.C.E. The oldest text in Phoenician script is an inscription on the sarcophagus of King Ahiram. This script is the parent script of all western alphabets. By the tenth century two other forms can be distinguished namely Canaanite and Aramaic, which then gave rise to Hebrew. The South Arabian alphabet, a sister script to the Phoenician alphabet, is the script from which the Ge'ez alphabet (an abugida) is descended.
The Proto-Sinatic or Proto Canaanite script and the Ugaritic script were the first scripts with limited number of signs, in contrast to the other widely used writing systems at the time, Cuneiform, Egyptian hieroglyphs, and Linear B. The Phoenician script was probably the first phonemic script and it contained only about two dozen distinct letters, making it a script simple enough for common traders to learn. Another advantage of Phoenician was that it could be used to write down many different languages, since it recorded words phonemically.
The script was spread by the Phoenicians across the Mediterranean. In Greece, it was modified to add the vowels, giving rise to the ancestor of all alphabets in the West. The Greeks took letters which did not represent sounds that existed in Greek, and changed them to represent the vowels. The syllabical Linear B script which was used by the Mycenaean Greeks from the sixteenth century B.C.E. had 87 symbols including 5 vowels. In its early years, there were many variants of the Greek alphabet, a situation which caused many different alphabets to evolve from it.
The Phoenician and Aramaic alphabets, like their Egyptian prototype, represented only consonants, a system called an abjad. The Aramaic alphabet, which evolved from the Phoenician in the seventh century B.C.E. as the official script of the Persian Empire, appears to be the ancestor of nearly all the modern alphabets of Asia:
A true alphabet has letters for the vowels of a language as well as the consonants. The first "true alphabet" in this sense is believed to be the Greek alphabet which was modified from the Phoenician alphabet to include vowels.
The Greek alphabet was then carried over by Greek colonists to the Italian peninsula, where it gave rise to a variety of alphabets used to write the Italic languages. One of these became the Latin alphabet, which was spread across Europe as the Romans expanded their empire. Even after the fall of the Roman state, the alphabet survived in intellectual and religious works. It eventually became used for the descendant languages of Latin (the Romance languages) and then for most of the other languages of Europe.
By at least the eighth century B.C.E. the Greeks had borrowed the Phoenician alphabet and adapted it to their own language. The letters of the Greek alphabet are the same as those of the Phoenician alphabet, and both alphabets are arranged in the same order. However, whereas separate letters for vowels would have actually hindered the legibility of Egyptian, Phoenician, or Hebrew, their absence was problematic for Greek, where vowels played a much more important role. The Greeks chose Phoenician letters representing sounds that did not exist in Greek to represent their vowels. For example, the Greeks had no glottal stop or h, so the Phoenician letters ’alep and he became Greek alpha and e (later renamed epsilon), and stood for the vowels /a/ and /e/ rather than the Phoenician consonants. This provided for five or six (depending on dialect) of the twelve Greek vowels, and so the Greeks eventually created digraphs and other modifications, such as ei, ou, and o (which became omega), or in some cases simply ignored the deficiency, as in long a, i, u.
Several varieties of the Greek alphabet developed. One, known as Western Greek or Chalcidian, was west of Athens and in southern Italy. The other variation, known as Eastern Greek, was used in present-day Turkey, and the Athenians, and eventually the rest of the world that spoke Greek, adopted this variation. After first writing right to left, the Greeks eventually chose to write from left to right, unlike the Phoenicians who wrote from right to left.
A tribe known as the Latins, who became known as the Romans, also lived in the Italian peninsula like the Western Greeks. From the Etruscans, a tribe living in the first millennium B.C.E. in central Italy, and the Western Greeks, the Latins adopted writing in about the fifth century. In adopted writing from these two groups, the Latins dropped four characters from the Western Greek alphabet. They also adapted the Etruscan letter F, pronounced 'w,' giving it the 'f' sound, and the Etruscan S, which had three zigzag lines, was curved to make the modern S. To represent the G sound in Greek and the K sound in Etruscan, the Gamma was used. These changes produced the modern alphabet without the letters G, J, U, W, Y, and Z, as well as some other differences.
Over the few centuries after Alexander the Great conquered the Eastern Mediterranean and other areas in the third century B.C.E., the Romans began to borrow Greek words, so they had to adapt their alphabet again in order to write these words. From the Eastern Greek alphabet, they borrowed Y and Z, which were added to the end of the alphabet because the only time they were used was to write Greek words.
When the Anglo-Saxon language began to be written using Roman letters after Britain was invaded by the Normans in the eleventh century further modifications were made: W was placed in the alphabet by V. U developed when people began to use the rounded U when they meant the vowel u and the pointed V when the meant the consonant V. J began as a variation of I, in which a long tail was added to the final I when there were several in a row. People began to use the J for the consonant and the I for the vowel by the fifteenth century, and it was fully accepted in the mid-seventeenth century.
Some adaptations of the Latin alphabet are augmented with ligatures, such as æ in Old English and Icelandic and Ȣ in Algonquian; by borrowings from other alphabets, such as the thorn þ in Old English and Icelandic, which came from the Futhark runes; and by modifying existing letters, such as the eth ð of Old English and Icelandic, which is a modified d. Other alphabets only use a subset of the Latin alphabet, such as Hawaiian, and Italian, which uses the letters j, k, x, y and w only in foreign words.
Another notable script is Elder Futhark, which is believed to have evolved out of one of the Old Italic alphabets. Elder Futhark gave rise to a variety of alphabets known collectively as the Runic alphabets. The Runic alphabets were used for Germanic languages from 100 C.E. to the late Middle Ages. Its usage is mostly restricted to engravings on stone and jewelry, although inscriptions have also been found on bone and wood. These alphabets have since been replaced with the Latin alphabet, except for decorative usage for which the runes remained in use until the twentieth century.
The Old Hungarian script is a contemporary writing system of the Hungarians. It was in use during the entire history of Hungary, albeit not as an official writing system. From the nineteenth century it once again became more popular.
The Glagolitic alphabet was the initial script of the liturgical language Old Church Slavonic and became, together with the Greek uncial script, the basis of the Cyrillic script. Cyrillic is one of the most widely used modern alphabetic scripts, and is notable for its use in Slavic languages and also for other languages within the former Soviet Union. Cyrillic alphabets include the Serbian, Macedonian, Bulgarian, and Russian alphabets. The Glagolitic alphabet is believed to have been created by Saints Cyril and Methodius, while the Cyrillic alphabet was invented by the Bulgarian scholar Clement of Ohrid, who was their disciple. They feature many letters that appear to have been borrowed from or influenced by the Greek alphabet and the Hebrew alphabet.
Beyond the logographic Chinese writing, many phonetic scripts are in existence in Asia. The Arabic alphabet, Hebrew alphabet, Syriac alphabet, and other abjads of the Middle East are developments of the Aramaic alphabet, but because these writing systems are largely consonant-based they are often not considered true alphabets.
Most alphabetic scripts of India and Eastern Asia are descended from the Brahmi script, which is often believed to be a descendant of Aramaic.
Zhuyin (sometimes called Bopomofo) is a semi-syllabary used to phonetically transcribe Mandarin Chinese in the Republic of China. After the later establishment of the People's Republic of China and its adoption of Hanyu Pinyin, the use of Zhuyin today is limited, but it is still widely used in Taiwan where the Republic of China still governs. Zhuyin developed out of a form of Chinese shorthand based on Chinese characters in the early 1900s and has elements of both an alphabet and a syllabary. Like an alphabet the phonemes of syllable initials are represented by individual symbols, but like a syllabary the phonemes of the syllable finals are not; rather, each possible final (excluding the medial glide) is represented by its own symbol. For example, luan is represented as ㄌㄨㄢ (l-u-an), where the last symbol ㄢ represents the entire final -an. While Zhuyin is not used as a mainstream writing system, it is still often used in ways similar to a romanization system—that is, for aiding in pronunciation and as an input method for Chinese characters on computers and cellphones.
In Korea, the Hangul alphabet was created by Sejong the Great Hangul is a unique alphabet: it is a featural alphabet, where many of the letters are designed from a sound's place of articulation (for example P to look like the widened mouth, L to look like the tongue pulled in); its design was planned by the government of the day; and it places individual letters in syllable clusters with equal dimensions (one syllable always takes up one type-space no matter how many letters get stacked into building that one sound-block).
European alphabets, especially Latin and Cyrillic, have been adapted for many languages of Asia. Arabic is also widely used, sometimes as an abjad (as with Urdu and Persian) and sometimes as a complete alphabet (as with Kurdish and Uyghur).
The term "alphabet" is used by linguists and paleographers in both a wide and a narrow sense. In the wider sense, an alphabet is a script that is segmental at the phoneme level—that is, it has separate glyphs for individual sounds and not for larger units such as syllables or words. In the narrower sense, some scholars distinguish "true" alphabets from two other types of segmental script, abjads and abugidas. These three differ from each other in the way they treat vowels: abjads have letters for consonants and leave most vowels unexpressed; abugidas are also consonant-based, but indicate vowels with diacritics to or a systematic graphic modification of the consonants. In alphabets in the narrow sense, on the other hand, consonants and vowels are written as independent letters. The earliest known alphabet in the wider sense is the Wadi el-Hol script, believed to be an abjad, which through its successor Phoenician is the ancestor of modern alphabets, including Arabic, Greek, Latin (via the Old Italic alphabet), Cyrillic (via the Greek alphabet) and Hebrew (via Aramaic).
Examples of present-day abjads are the Arabic and Hebrew scripts; true alphabets include Latin, Cyrillic, and Korean hangul; and abugidas are used to write Tigrinya, Amharic, Hindi, and Thai. The Canadian Aboriginal syllabics are also an abugida rather than a syllabary as their name would imply, since each glyph stands for a consonant which is modified by rotation to represent the following vowel. (In a true syllabary, each consonant-vowel combination would be represented by a separate glyph.)
All three types may be augmented with syllabic glyphs. Ugaritic, for example, is basically an abjad, but has syllabic letters for /ʔa, ʔi, ʔu/. (These are the only time vowels are indicated.) Cyrillic is basically a true alphabet, but has syllabic letters for /ja, je, ju/ (я, е, ю); Coptic has a letter for /ti/. Devanagari is typically an abugida augmented with dedicated letters for initial vowels, though some traditions use अ as a zero consonant as the graphic base for such vowels.
The boundaries between the three types of segmental scripts are not always clear-cut. For example, Sorani Kurdish is written in the Arabic script, which is normally an abjad. However, in Kurdish, writing the vowels is mandatory, and full letters are used, so the script is a true alphabet. Other languages may use a Semitic abjad with mandatory vowel diacritics, effectively making them abugidas. On the other hand, the Phagspa script of the Mongol Empire was based closely on the Tibetan abugida, but all vowel marks were written after the preceding consonant rather than as diacritic marks. Although short a was not written, as in the Indic abugidas, one could argue that the linear arrangement made this a true alphabet. Conversely, the vowel marks of the Tigrinya abugida and the Amharic abugida (ironically, the original source of the term "abugida") have been so completely assimilated into their consonants that the modifications are no longer systematic and have to be learned as a syllabary rather than as a segmental script. Even more extreme, the Pahlavi abjad eventually became logographic. (See below.)
Thus the primary classification of alphabets reflects how they treat vowels. For tonal languages, further classification can be based on their treatment of tone, though names do not yet exist to distinguish the various types. Some alphabets disregard tone entirely, especially when it does not carry a heavy functional load, as in Somali and many other languages of Africa and the Americas. Such scripts are to tone what abjads are to vowels. Most commonly, tones are indicated with diacritics, the way vowels are treated in abugidas. This is the case for Vietnamese (a true alphabet) and Thai (an abugida). In Thai, tone is determined primarily by the choice of consonant, with diacritics for disambiguation. In the Pollard script, an abugida, vowels are indicated by diacritics, but the placement of the diacritic relative to the consonant is modified to indicate the tone. More rarely, a script may have separate letters for tones, as is the case for Hmong and Zhuang. For most of these scripts, regardless of whether letters or diacritics are used, the most common tone is not marked, just as the most common vowel is not marked in Indic abugidas; in Zhuyin not only is one of the tones unmarked, but there is a diacritic to indicate lack of tone, like the virama of Indic.
The number of letters in an alphabet can be quite small. The Book Pahlavi script, an abjad, had only twelve letters at one point, and may have had even fewer later on. Today the Rotokas alphabet has only twelve letters. (The Hawaiian alphabet is sometimes claimed to be as small, but it actually consists of 18 letters, including the ʻokina and five long vowels.) While Rotokas has a small alphabet because it has few phonemes to represent (just eleven), Book Pahlavi was small because many letters had been conflated—that is, the graphic distinctions had been lost over time, and diacritics were not developed to compensate for this as they were in Arabic, another script that lost many of its distinct letter shapes. For example, a comma-shaped letter represented g, d, y, k, or j. However, such apparent simplifications can perversely make a script more complicated. In later Pahlavi papyri, up to half of the remaining graphic distinctions of these twelve letters were lost, and the script could no longer be read as a sequence of letters at all, but instead each word had to be learned as a whole—that is, they had become logograms as in Egyptian Demotic. The alphabet in the Polish language contains 32 letters.
The largest segmental script is probably an abugida, Devanagari. When written in Devanagari, Vedic Sanskrit has an alphabet of 53 letters, including the visarga mark for final aspiration and special letters for kš and jñ, though one of the letters is theoretical and not actually used. The Hindi alphabet must represent both Sanskrit and modern vocabulary, and so has been expanded to 58 with the khutma letters (letters with a dot added) to represent sounds from Persian and English.
The largest known abjad is Sindhi, with 51 letters. The largest alphabets in the narrow sense include Kabardian and Abkhaz (for Cyrillic), with 58 and 56 letters, respectively, and Slovak (for the Latin script), with 46. However, these scripts either count di- and tri-graphs as separate letters, as Spanish did with ch and ll until recently, or uses diacritics like Slovak č. The largest true alphabet where each letter is graphically independent is probably Georgian, with 41 letters.
Syllabaries typically contain 50 to 400 glyphs, and the glyphs of logographic systems typically number from the many hundreds into the thousands. Thus a simple count of the number of distinct symbols is an important clue to the nature of an unknown script.
The Phoenician letter names, in which each letter was associated with a word that begins with that sound, continue to be used to varying degrees in Samaritan, Aramaic, Syriac, Hebrew, Greek and Arabic. The names were abandoned in Latin, which instead referred to the letters by adding a vowel (usually e) before or after the consonant (the exception is zeta, which was retained from Greek). In Cyrillic originally the letters were given names based on Slavic words; this was later abandoned as well in favor of a system similar to that used in Latin.
When an alphabet is adopted or developed for use in representing a given language, an orthography generally comes into being, providing rules for the spelling of words in that language. In accordance with the principle on which alphabets are based, these rules will generally map letters of the alphabet to the phonemes (significant sounds) of the spoken language. In a perfectly phonemic orthography there would be a consistent one-to-one correspondence between the letters and the phonemes, so that a writer could predict the spelling of a word given its pronunciation, and a speaker could predict the pronunciation of a word given its spelling. However this ideal is not normally achieved in practice; some languages (such as Spanish and Finnish) come close to it, while others (such as English) deviate from it to a much larger degree.
Languages may fail to achieve a one-to-one correspondence between letters and sounds in any of several ways:
National languages generally elect to address the problem of dialects by simply associating the alphabet with the national standard. However, with an international language with wide variations in its dialects, such as English, it would be impossible to represent the language in all its variations with a single phonetic alphabet.
Some national languages like Finnish, Turkish, Serbo-Croatian (Serbian, Croatian and Bosnian), and Bulgarian have a very regular spelling system with a nearly one-to-one correspondence between letters and phonemes. Strictly speaking, these national languages lack a word corresponding to the verb "to spell" (meaning to split a word into its letters), the closest match being a verb meaning to split a word into its syllables. Similarly, the Italian verb corresponding to 'spell (out)', compitare, is unknown to many Italians because the act of spelling itself is rarely needed since Italian spelling is highly phonemic. In standard Spanish, it is possible to tell the pronunciation of a word from its spelling, but not vice versa; this is because certain phonemes can be represented in more than one way, but a given letter is consistently pronounced. French, with its silent letters and its heavy use of nasal vowels and elision, may seem to lack much correspondence between spelling and pronunciation, but its rules on pronunciation, though complex, are consistent and predictable with a fair degree of accuracy.
At the other extreme are languages such as English, where the spelling of many words simply has to be memorized as they do not correspond to sounds in a consistent way. For English, this is partly because the Great Vowel Shift occurred after the orthography was established, and because English has acquired a large number of loanwords at different times, retaining their original spelling at varying levels. Even English has general, albeit complex, rules that predict pronunciation from spelling, and these rules are successful most of the time; rules to predict spelling from the pronunciation have a higher failure rate.
Sometimes, countries have the written language undergo a spelling reform to realign the writing with the contemporary spoken language. These can range from simple spelling changes and word forms to switching the entire writing system itself, as when Turkey switched from the Arabic alphabet to a Turkish alphabet of Latin origin.
The sounds of speech of all languages of the world can be written by a rather small universal phonetic alphabet. A standard for this is the International Phonetic Alphabet.
Alphabets often come to be associated with a standard ordering of their letters, which can then be used for purposes of collation – namely for the listing of words and other items in what is called alphabetical order. Thus, the basic ordering of the Latin alphabet (ABCDEFGHIJKLMNOPQRSTUVWXYZ), for example, is well established, although languages using this alphabet have different conventions for their treatment of modified letters (such as the French é, à, and ô) and of certain combinations of letters (multigraphs). Some alphabets, such as Hanunoo, are learned one letter at a time, in no particular order, and are not used for collation where a definite order is required.
It is unknown if the earliest alphabets had a defined sequence. However, the order of the letters of the alphabet is attested from the fourteenth century B.C.E. Tablets discovered in Ugarit, located on Syria’s northern coast, preserve the alphabet in two sequences. One, the ABGDE order later used in Phoenician, has continued with minor changes in Hebrew, Greek, Armenian, Gothic, Cyrillic, and Latin; the other, HMĦLQ, was used in southern Arabia and is preserved today in Ethiopic. Both orders have therefore been stable for at least 3000 years.
The Brahmic family of alphabets used in India abandoned the inherited order for one based on phonology: The letters are arranged according to how and where they are produced in the mouth. This organization is used in Southeast Asia, Tibet, Korean hangul, and even Japanese kana, which is not an alphabet. The historical order was also abandoned in Runic and Arabic, although Arabic retains the traditional "abjadi order" for numbering.
All links retrieved August 16, 2016.
New World Encyclopedia writers and editors rewrote and completed the Wikipedia article in accordance with New World Encyclopedia standards. This article abides by terms of the Creative Commons CC-by-sa 3.0 License (CC-by-sa), which may be used and disseminated with proper attribution. Credit is due under the terms of this license that can reference both the New World Encyclopedia contributors and the selfless volunteer contributors of the Wikimedia Foundation. To cite this article click here for a list of acceptable citing formats.The history of earlier contributions by wikipedians is accessible to researchers here:
Note: Some restrictions may apply to use of individual images which are separately licensed.