Zipf's law applied to word and letter frequencies

Registro completo de metadados
Autor(es): dc.creatorZenil, Hector-
Data de aceite:
Data de disponibilização:
Data de envio:
Data de envio:
Data de envio:
Data de envio:
Data de envio:
Fonte completa do material: dc.identifier
Fonte: dc.identifier.uri
Descrição: dc.descriptionThe frequency of words and letters in bodies of text has been heavily studied for several purposes, one being cryptography. This Demonstration performs an analysis of several texts, including fragments of popular works in several languages. It shows the distribution of frequencies, sorted from most common to least common. Plotting word frequencies illustrates Zipf's law. This is a phenomenological law related to rank data frequencies, primarily of linguistic corpora. It says that the most frequent word will occur approximately twice as often as the second most frequent word, which will occur approximately twice as often as the fourth most frequent word. When the plot approximates a straight line it indicates that the data follows this law. It has also been shown that random bodies of text exhibit a word frequency distribution like Zipf's law, suggesting that the law is more a statistical phenomenon than specific to linguistics. The term has therefore come to be used to refer to any of a family of related power-law probability distributions-
Descrição: dc.descriptionComponente Curricular::Ensino Médio::Matemática-
Idioma: dc.languageen-
Publicador: dc.publisherWolfram Demonstrations Project-
Relação: dc.relationZipfsLawAppliedToWordAndLetterFrequencies.nbp-
Direitos: dc.rightsDemonstration freeware using MathematicaPlayer-
???dc.source???: dc.source
Palavras-chave: dc.subjectData analysis-
Palavras-chave: dc.subjectEducação Básica::Ensino Médio::Matemática::Análise de dados e probabilidade-
Título: dc.titleZipf's law applied to word and letter frequencies-
Aparece nas coleções:Repositório Institucional - MEC BIOE

Não existem arquivos associados a este item.