University of California, Berkeley Statistics Major

University of California, Berkeley Statistics Major The undergraduate major at Berkeley provides a systematic and thorough grounding in applied and theoretical statistics, and in probability.

The Undergraduate Student Learning Goals can be found here: http://www.stat.berkeley.edu/529.

03/10/2022

Make your gift TODAY

The Statistics Graduation Video Celebration for the Class of 2021 will be streamed on YouTube  https://www.youtube.com/w...
05/21/2021

The Statistics Graduation Video Celebration for the Class of 2021 will be streamed on YouTube https://www.youtube.com/watch?v=tecRyt0BHZY TODAY, May 21st, 2021, at 4 PM PDT with Special Guest message!

Join us in celebrating the Class of 2021!

03/12/2021

Help the Statistics Department unlock our the pledge donation of $5000! There are less than 5 hours left!

05/07/2018

Four outstanding students have been recognized as the 2018 University Medal runners-up

One of our department's founding faculty...
02/13/2018

One of our department's founding faculty...

David Blackwell was a gifted statistician who, despite racial prejudice, taught countless of students about the ‘beauty’ of math.

10/23/2017

Today we feature a guest post by Bin Yu of UC Berkeley for :

Over the last few centuries, statistics has evolved from gathering demographic information about people to an awesome field of helping data-driven decisions in industry and data-driven knowledge generation in sciences and social sciences. For example, clinical trials are conducted by medical doctors and biostatisticians to provide evidence for the U.S. Food and Drug Administration to decide on a new drug release to save lives; supervised learning in machine learning (a frontier field of both statistics and computer science) is playing an important role from cancer diagnosis to self-driving car design.

The burgeoning field of data science is a re-merging of computational and statistical/inferential thinking. It heavily relies on statistics as well as computer science, mathematics, and domain knowledge to solve data problems. In 1890, the volume of US census data drove statistician Herman Hollerith to invent the Hollerith Tabulating Machine. His company with other three companies later formed IBM. It is timely to see an integration of ideas and concepts of statistics and computer science in two new undergraduate data science courses at Berkeley (http://data8.org/, http://www.ds100.org/).

The stability principle has emerged as a central principle of data science that builds on stability of knowledge on one hand, and on the other, connects to statistical inference or uncertainty assessment (Yu, “Stability”, Bernoulli, 2013). It is a minimum requirement for reproducibility and interpretability. In a nutshell, it makes it self-evident that data-driven decisions and knowledge should be stable relative to appropriate perturbations in data, models, methods algorithms, and ad-hoc human decisions in the data analysis cycle. It helps prevent p-hacking, model-hacking*, and false discoveries. It can be employed as early as the phase of exploratory data analysis and data visualization. Meaningful data patterns (e.g. a linear trend) should persist by using 80% of the data deemed as an appropriate perturbation. An appropriate data perturbation means a sample similar to the original data set, with similarity decided using information from domain knowledge and data collection process. If not, further investigations are warranted before the discovery of a linear trend is claimed to be a data result or discovery. This principle is conceptually simple to use and easily understood by data scientists and consumers of data results alike. Give it a try!

*Footnote: model-hacking is defined by the author as the phenomenon that one tries a large number of models (or algorithms) to find a desirable data result. It is a form of taking noise as results.



Learn more about Dr. Yu’s work at:
https://projecteuclid.org/download/pdfview_1/euclid.bj/1377612862

http://www.nasonline.org/member-directory/members/20022958.html

05/19/2017

Yeah, it really did... Congratulations to our Statistics Majors!

Address

367 Evans Hall, MC#3860
Berkeley, CA
94720

Alerts

Be the first to know and let us send you an email when University of California, Berkeley Statistics Major posts news and promotions. Your email address will not be used for any other purpose, and you can unsubscribe at any time.

Contact The University

Send a message to University of California, Berkeley Statistics Major:

Share