Topzle Topzle

Benford's law

Updated: Wikipedia source

Benford's law

Benford's law, also known as the Newcomb–Benford law, the law of anomalous numbers, or the first-digit law, is an observation that in many real-life sets of numerical data, the leading digit is likely to be small. In sets that obey the law, the number 1 appears as the leading significant digit about 30% of the time, while 9 appears as the leading significant digit less than 5% of the time. Uniformly distributed digits would each occur about 11.1% of the time. Benford's law also makes predictions about the distribution of second digits, third digits, digit combinations, and so on. Benford's law may be derived by assuming the dataset values are uniformly distributed on a logarithmic scale. The graph to the right shows Benford's law for base 10. Although a decimal base is most common, the result generalizes to any integer base greater than 2. Further generalizations published in 1995 included analogous statements for both the nth leading digit and the joint distribution of the leading n digits, the latter of which leads to a corollary wherein the significant digits are shown to be a statistically dependent quantity. It has been shown that this result applies to a wide variety of data sets, including electricity bills, street addresses, stock prices, house prices, population numbers, death rates, lengths of rivers, and physical and mathematical constants. Like other general principles about natural data—for example, the fact that many data sets are well approximated by a normal distribution—there are illustrative examples and explanations that cover many of the cases where Benford's law applies, though there are many other cases where Benford's law applies that resist simple explanations. Benford's law tends to be most accurate when values are distributed across multiple orders of magnitude, especially if the process generating the numbers is described by a power law (which is common in nature). The law is named after physicist Frank Benford, who stated it in 1938 in an article titled "The Law of Anomalous Numbers", although it had been previously stated by Simon Newcomb in 1881. The law is similar in concept, though not identical in distribution, to Zipf's law.

Tables

· Definition
1
1
mw- d
1
⁠ P ( d ) {\displaystyle P(d)} ⁠
30.1%
2
2
mw- d
2
⁠ P ( d ) {\displaystyle P(d)} ⁠
17.6%
3
3
mw- d
3
⁠ P ( d ) {\displaystyle P(d)} ⁠
12.5%
4
4
mw- d
4
⁠ P ( d ) {\displaystyle P(d)} ⁠
9.7%
5
5
mw- d
5
⁠ P ( d ) {\displaystyle P(d)} ⁠
7.9%
6
6
mw- d
6
⁠ P ( d ) {\displaystyle P(d)} ⁠
6.7%
7
7
mw- d
7
⁠ P ( d ) {\displaystyle P(d)} ⁠
5.8%
8
8
mw- d
8
⁠ P ( d ) {\displaystyle P(d)} ⁠
5.1%
9
9
mw- d
9
⁠ P ( d ) {\displaystyle P(d)} ⁠
4.6%
mw- d
⁠ P ( d ) {\displaystyle P(d)} ⁠
Relative size of ⁠ P ( d ) {\displaystyle P(d)} ⁠
1
30.1%
2
17.6%
3
12.5%
4
9.7%
5
7.9%
6
6.7%
7
5.8%
8
5.1%
9
4.6%
· Examples
Count
Count
Leading digit
Count
m
Share
m
Count
ft
Share
1
1
Leading digit
1
m
23
m
39.7 %
ft
15
ft
25.9 %
Per Benford's law
30.1 %
2
2
Leading digit
2
m
12
m
20.7 %
ft
8
ft
13.8 %
Per Benford's law
17.6 %
3
3
Leading digit
3
m
6
m
10.3 %
ft
5
ft
8.6 %
Per Benford's law
12.5 %
4
4
Leading digit
4
m
5
m
8.6 %
ft
7
ft
12.1 %
Per Benford's law
9.7 %
5
5
Leading digit
5
m
2
m
3.4 %
ft
9
ft
15.5 %
Per Benford's law
7.9 %
6
6
Leading digit
6
m
5
m
8.6 %
ft
4
ft
6.9 %
Per Benford's law
6.7 %
7
7
Leading digit
7
m
1
m
1.7 %
ft
3
ft
5.2 %
Per Benford's law
5.8 %
8
8
Leading digit
8
m
4
m
6.9 %
ft
6
ft
10.3 %
Per Benford's law
5.1 %
9
9
Leading digit
9
m
0
m
0 %
ft
1
ft
1.7 %
Per Benford's law
4.6 %
Leading digit
m
ft
Per Benford's law
Count
Share
Count
Share
1
23
39.7 %
15
25.9 %
30.1 %
2
12
20.7 %
8
13.8 %
17.6 %
3
6
10.3 %
5
8.6 %
12.5 %
4
5
8.6 %
7
12.1 %
9.7 %
5
2
3.4 %
9
15.5 %
7.9 %
6
5
8.6 %
4
6.9 %
6.7 %
7
1
1.7 %
3
5.2 %
5.8 %
8
4
6.9 %
6
10.3 %
5.1 %
9
0
0 %
1
1.7 %
4.6 %
· Examples
Count
Count
Leading digit
Count
Occurrence
Share
1
1
Leading digit
1
Occurrence
29
Occurrence
30.2 %
Per Benford's law
30.1 %
2
2
Leading digit
2
Occurrence
17
Occurrence
17.7 %
Per Benford's law
17.6 %
3
3
Leading digit
3
Occurrence
12
Occurrence
12.5 %
Per Benford's law
12.5 %
4
4
Leading digit
4
Occurrence
10
Occurrence
10.4 %
Per Benford's law
9.7 %
5
5
Leading digit
5
Occurrence
7
Occurrence
7.3 %
Per Benford's law
7.9 %
6
6
Leading digit
6
Occurrence
6
Occurrence
6.3 %
Per Benford's law
6.7 %
7
7
Leading digit
7
Occurrence
5
Occurrence
5.2 %
Per Benford's law
5.8 %
8
8
Leading digit
8
Occurrence
5
Occurrence
5.2 %
Per Benford's law
5.1 %
9
9
Leading digit
9
Occurrence
5
Occurrence
5.2 %
Per Benford's law
4.6 %
Leading digit
Occurrence
Per Benford's law
Count
Share
1
29
30.2 %
30.1 %
2
17
17.7 %
17.6 %
3
12
12.5 %
12.5 %
4
10
10.4 %
9.7 %
5
7
7.3 %
7.9 %
6
6
6.3 %
6.7 %
7
5
5.2 %
5.8 %
8
5
5.2 %
5.1 %
9
5
5.2 %
4.6 %
Kuiper
Kuiper
⍺ Test
Kuiper
0.10
1.191
0.05
1.321
0.01
1.579
Kolmogorov–Smirnov
Kolmogorov–Smirnov
⍺ Test
Kolmogorov–Smirnov
0.10
1.012
0.05
1.148
0.01
1.420
⍺ Test
0.10
0.05
0.01
Kuiper
1.191
1.321
1.579
Kolmogorov–Smirnov
1.012
1.148
1.420
Leemis's m
Leemis's m
⍺Statistic
Leemis's m
0.10
0.851
0.05
0.967
0.01
1.212
Cho & Gaines's d
Cho & Gaines's d
⍺Statistic
Cho & Gaines's d
0.10
1.212
0.05
1.330
0.01
1.569
⍺Statistic
0.10
0.05
0.01
Leemis's m
0.851
0.967
1.212
Cho & Gaines's d
1.212
1.330
1.569
· Generalization to digits beyond the first
1st
1st
Digit
1st
0
1
30.1%
2
17.6%
3
12.5%
4
9.7%
5
7.9%
6
6.7%
7
5.8%
8
5.1%
9
4.6%
2nd
2nd
Digit
2nd
0
12.0%
1
11.4%
2
10.9%
3
10.4%
4
10.0%
5
9.7%
6
9.3%
7
9.0%
8
8.8%
9
8.5%
3rd
3rd
Digit
3rd
0
10.2%
1
10.1%
2
10.1%
3
10.1%
4
10.0%
5
10.0%
6
9.9%
7
9.9%
8
9.9%
9
9.8%
Digit
0
1
2
3
4
5
6
7
8
9
1st
30.1%
17.6%
12.5%
9.7%
7.9%
6.7%
5.8%
5.1%
4.6%
2nd
12.0%
11.4%
10.9%
10.4%
10.0%
9.7%
9.3%
9.0%
8.8%
8.5%
3rd
10.2%
10.1%
10.1%
10.1%
10.0%
10.0%
9.9%
9.9%
9.9%
9.8%

References

  1. They should strictly be bars but are shown as lines for clarity.
  2. Berger and Hill describe this argument but say it "still leaves open the question of why it is reasonable to assume that
  3. In general, the sequence k1, k2, k3, etc., satisfies Benford's law exactly, under the condition that log10 k is an irrat
  4. The Mathematical Intelligencer
    https://doi.org/10.1007/s00283-010-9182-3
  5. MathWorld, A Wolfram web resource
    http://mathworld.wolfram.com/BenfordsLaw.html
  6. Statistical Science
    https://projecteuclid.org/euclid.ss/1177009869
  7. Paul H. Kvam, Brani Vidakovic, Nonparametric Statistics with Applications to Science and Engineering, p. 158.
  8. Stat. Methods Appl
    https://arxiv.org/abs/1909.07527
  9. The American Mathematical Monthly
    https://arxiv.org/abs/1907.08894
  10. Proc. Am. Philos. Soc.
    https://ui.adsabs.harvard.edu/abs/1938PAPhS..78..551B
  11. American Journal of Mathematics
    https://ui.adsabs.harvard.edu/abs/1881AmJM....4...39N
  12. PLOS ONE
    https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2866333
  13. Benford's Law: Theory and Applications
    https://books.google.com/books?id=J_NnBgAAQBAJ&pg=309
  14. Maxwell Consulting, LLC
    http://www.maxwell-consulting.com/Benford_Logarithmic_Transformation.pdf
  15. Transformation Invariance of Benford Variables and their Numerical Modeling
  16. American Mathematical Monthly
    https://doi.org/10.2307%2F2319349
  17. Statistical Science
    http://digitalcommons.calpoly.edu/cgi/viewcontent.cgi?article=1042&context=rgp_rsr
  18. Proceedings of the American Mathematical Society
    https://doi.org/10.1090%2FS0002-9939-1995-1233974-8
  19. The Scientist and Engineer's Guide to Digital Signal Processing
    http://www.dspguide.com/ch34.htm
  20. The American Statistician
    https://www.stat.auckland.ac.nz/~fewster/RFewster_Benford.pdf
  21. Arno Berger and Theodore P. Hill, Benford's Law Strikes Back: No Simple Explanation in Sight for Mathematical Gem, 2011.
    http://digitalcommons.calpoly.edu/cgi/viewcontent.cgi?article=1074&context=rgp_rsr
  22. Results in Physics
    https://doi.org/10.1016%2Fj.rinp.2015.11.010
  23. Transactions of the American Mathematical Society
    https://doi.org/10.1090%2FS0002-9947-1970-0259068-3
  24. Entropy in Dynamical Systems
    https://books.google.com/books?id=avUGMc787v8C&pg=PA106
  25. Ergodic Theory, Entropy
    https://doi.org/10.1007%2FBFb0066096
  26. Journal of Mathematical Imaging and Vision
    https://ui.adsabs.harvard.edu/abs/2001JMIV...14...73J
  27. American Journal of Physics
    https://arxiv.org/abs/1604.05715
  28. Physica A
    https://arxiv.org/abs/cond-mat/9808305
  29. PLOS ONE
    https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2866333
  30. American Scientist
    http://people.math.gatech.edu/~hill/publications/PAPER%20PDFS/TheFirstDigitPhenomenonAmericanScientist1996.pdf
  31. Journal of Applied Probability
    https://web.archive.org/web/20160304125725/http://lmrs.univ-rouen.fr/Persopage/Delarue/Publis/PDF/uniform_distribution_to_Benford_law.pdf
  32. Ann. Math. Statist
    http://projecteuclid.org/DPubS?service=UI&version=1.0&verb=Display&handle=euclid.aoms/1177704862
  33. mathworld.wolfram.com
    https://mathworld.wolfram.com/BenfordsLaw.html
  34. Imperial College of London
    https://wwwf.imperial.ac.uk/~nadams/classificationgroup/Benfords-Law.pdf
  35. Probability Surveys
    https://projecteuclid.org/download/pdfview_1/euclid.ps/1311860830
  36. The American Statistician
    https://doi.org/10.1080%2F00031305.1972.10478934
  37. Radio Lab
    https://www.wnycstudios.org/story/91699-from-benford-to-erdos
  38. Walter R. Mebane, Jr., "Election Forensics: Vote Counts and Benford's Law" (July 18, 2006).
    http://www-personal.umich.edu/~wmebane/pm06.pdf
  39. "Election forensics", The Economist (February 22, 2007).
    https://www.economist.com/science-and-technology/2007/02/22/election-forensics
  40. Political Analysis
    https://www.cambridge.org/core/journals/political-analysis/article/benfords-law-and-the-detection-of-election-fraud/3B1D64E822371C461AF3C61CE91AAF6D
  41. Political Analysis
    https://www.cambridge.org/core/journals/political-analysis/article/comment-on-benfords-law-and-the-detection-of-election-fraud/BC29680D8B5469A54C7C9D865029FE7C
  42. Stephen Battersby Statistics hint at fraud in Iranian election New Scientist 24 June 2009
    https://www.newscientist.com/article/mg20227144.000-statistics-hint-at-fraud-in-iranian-election.html
  43. Walter R. Mebane, Jr., "Note on the presidential election in Iran, June 2009" (University of Michigan, June 29, 2009), p
    http://www-personal.umich.edu/~wmebane/note22jun2009.pdf
  44. Journal of Applied Statistics
    https://arxiv.org/abs/0906.2789
  45. Bernd Beber and Alexandra Scacco, "The Devil Is in the Digits: Evidence That Iran's Election Was Rigged", The Washington
    https://www.washingtonpost.com/wp-dyn/content/article/2009/06/20/AR2009062000004.html
  46. Mark J. Nigrini, Benford's Law: Applications for Forensic Accounting, Auditing, and Fraud Detection (Hoboken, NJ: Wiley,
  47. Walter R. Mebane, Jr., "Election Forensics: The Second-Digit Benford's Law Test and Recent American Presidential Electio
    http://www-personal.umich.edu/~wmebane/fraud06.pdf
  48. Jahrbücher für Nationalökonomie und Statistik
    https://doi.org/10.1515%2Fjbnst-2011-5-610
  49. Reuters
    https://www.reuters.com/article/uk-factcheck-benford/fact-check-deviation-from-benfords-law-does-not-prove-election-fraud-idUSKBN27Q3AI
  50. "Benford's law and the 2020 US presidential election: nothing out of the ordinary"
    https://physicsworld.com/a/benfords-law-and-the-2020-us-presidential-election-nothing-out-of-the-ordinary/
  51. William Goodman, The promises and pitfalls of Benford's law, Significance, Royal Statistical Society (June 2016), p. 38.
    https://rss.onlinelibrary.wiley.com/doi/pdf/10.1111/j.1740-9713.2016.00919.x
  52. The Guardian
    https://www.theguardian.com/commentisfree/2011/sep/16/bad-science-dodgy-stats
  53. International Journal of Research in Marketing
    https://doi.org/10.1016%2Fj.ijresmar.2005.09.002
  54. PLOS ONE
    https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3356352
  55. J Appl Stat
    https://ui.adsabs.harvard.edu/abs/2007JApSt..34..321D
  56. Journal of Informetrics
    https://www.sciencedirect.com/science/article/pii/S1751157721000341
  57. PLOS ONE
    https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11952760
  58. Journal of the Royal Statistical Society, Series B
    https://doi.org/10.1111%2Fj.2517-6161.1970.tb00821.x
  59. Benford's Law, families of distributions and a test basis
    http://cep.lse.ac.uk/_new/publications/series.asp?prog=CEP
  60. The American Statistician
    https://doi.org/10.1080%2F00031305.2000.10474554
  61. The American Statistician
    https://doi.org/10.1198%2F000313007X223496
  62. J. Forensic & Investigative Accounting
  63. Statistics & Probability Letters
    https://www.researchgate.net/publication/312481284
  64. The Fibonacci Quarterly
    https://doi.org/10.1080%2F00150517.1981.12430109
  65. The Fibonacci Quarterly
    https://doi.org/10.1080%2F00150517.1967.12431312
  66. Sankhya B
  67. American Mathematical Monthly
    https://doi.org/10.2307%2F2319349
  68. phys.org
    https://phys.org/news/2009-05-pattern-prime.html
  69. The American Statistician
    https://doi.org/10.1198%2F000313007X223496
  70. J. Clin. Pathol.
    https://doi.org/10.1136%2Fjcp.2008.061721
  71. Singleton, Tommie W. (May 1, 2011). "Understanding and Applying Benford's Law", ISACA Journal, Information Systems Audit
    https://www.isaca.org/resources/isaca-journal/past-issues/2011/understanding-and-applying-benfords-law
  72. J Forensic Accounting
  73. Electronic Communications in Probability
    https://arxiv.org/abs/0705.4488
  74. The American Mathematical Monthly
    http://digitalcommons.calpoly.edu/cgi/viewcontent.cgi?article=1041&context=rgp_rsr
  75. Scott, P.D.; Fasli, M. (2001) "Benford's Law: An empirical investigation and a novel explanation" Archived 13 December 2
    http://dces.essex.ac.uk/technical-reports/2001/CSM-349.pdf
  76. Journal of Forensic and Investigative Accounting
    https://web.archive.org/web/20181007005451/http://www.bus.lsu.edu/accounting/faculty/lcrumbley/jfia/Articles/Abstracts/abs_2010v2n2a7.pdf
Image
Source:
Tip: Wheel or +/− to zoom, drag to pan, Esc to close.