Learning Bayesian Statistics

De: Alexandre Andorra
  • Sumário

  • Are you a researcher or data scientist / analyst / ninja? Do you want to learn Bayesian inference, stay up to date or simply want to understand what Bayesian inference is? Then this podcast is for you! You'll hear from researchers and practitioners of all fields about how they use Bayesian statistics, and how in turn YOU can apply these methods in your modeling workflow. When I started learning Bayesian methods, I really wished there were a podcast out there that could introduce me to the methods, the projects and the people who make all that possible. So I created "Learning Bayesian Statistics", where you'll get to hear how Bayesian statistics are used to detect black matter in outer space, forecast elections or understand how diseases spread and can ultimately be stopped. But this show is not only about successes -- it's also about failures, because that's how we learn best. So you'll often hear the guests talking about what *didn't* work in their projects, why, and how they overcame these challenges. Because, in the end, we're all lifelong learners! My name is Alex Andorra by the way, and I live in Estonia. By day, I'm a data scientist and modeler at the https://www.pymc-labs.io/ (PyMC Labs) consultancy. By night, I don't (yet) fight crime, but I'm an open-source enthusiast and core contributor to the python packages https://docs.pymc.io/ (PyMC) and https://arviz-devs.github.io/arviz/ (ArviZ). I also love https://www.pollsposition.com/ (election forecasting) and, most importantly, Nutella. But I don't like talking about it – I prefer eating it. So, whether you want to learn Bayesian statistics or hear about the latest libraries, books and applications, this podcast is for you -- just subscribe! You can also support the show and https://www.patreon.com/learnbayesstats (unlock exclusive Bayesian swag on Patreon)!
    Copyright Alexandre Andorra
    Exibir mais Exibir menos
Episódios
  • #115 Using Time Series to Estimate Uncertainty, with Nate Haines
    Sep 17 2024

    Proudly sponsored by PyMC Labs, the Bayesian Consultancy. Book a call, or get in touch!

    • My Intuitive Bayes Online Courses
    • 1:1 Mentorship with me

    Our theme music is « Good Bayesian », by Baba Brinkman (feat MC Lars and Mega Ran). Check out his awesome work!

    Visit our Patreon page to unlock exclusive Bayesian swag ;)

    Takeaways:

    • State space models and traditional time series models are well-suited to forecast loss ratios in the insurance industry, although actuaries have been slow to adopt modern statistical methods.
    • Working with limited data is a challenge, but informed priors and hierarchical models can help improve the modeling process.
    • Bayesian model stacking allows for blending together different model predictions and taking the best of both (or all if more than 2 models) worlds.
    • Model comparison is done using out-of-sample performance metrics, such as the expected log point-wise predictive density (ELPD). Brute leave-future-out cross-validation is often used due to the time-series nature of the data.
    • Stacking or averaging models are trained on out-of-sample performance metrics to determine the weights for blending the predictions. Model stacking can be a powerful approach for combining predictions from candidate models. Hierarchical stacking in particular is useful when weights are assumed to vary according to covariates.
    • BayesBlend is a Python package developed by Ledger Investing that simplifies the implementation of stacking models, including pseudo Bayesian model averaging, stacking, and hierarchical stacking.
    • Evaluating the performance of patient time series models requires considering multiple metrics, including log likelihood-based metrics like ELPD, as well as more absolute metrics like RMSE and mean absolute error.
    • Using robust variants of metrics like ELPD can help address issues with extreme outliers. For example, t-distribution estimators of ELPD as opposed to sample sum/mean estimators.
    • It is important to evaluate model performance from different perspectives and consider the trade-offs between different metrics. Evaluating models based solely on traditional metrics can limit understanding and trust in the model. Consider additional factors such as interpretability, maintainability, and productionization.
    • Simulation-based calibration (SBC) is a valuable tool for assessing parameter estimation and model correctness. It allows for the interpretation of model parameters and the identification of coding errors.
    • In industries like insurance, where regulations may restrict model choices, classical statistical approaches still play a significant role. However, there is potential for Bayesian methods and generative AI in certain areas.

    Exibir mais Exibir menos
    1 hora e 40 minutos
  • #114 From the Field to the Lab – A Journey in Baseball Science, with Jacob Buffa
    Sep 5 2024

    Proudly sponsored by PyMC Labs, the Bayesian Consultancy. Book a call, or get in touch!

    • My Intuitive Bayes Online Courses
    • 1:1 Mentorship with me

    Our theme music is « Good Bayesian », by Baba Brinkman (feat MC Lars and Mega Ran). Check out his awesome work!

    Visit our Patreon page to unlock exclusive Bayesian swag ;)

    Takeaways:

    • Education and visual communication are key in helping athletes understand the impact of nutrition on performance.
    • Bayesian statistics are used to analyze player performance and injury risk.
    • Integrating diverse data sources is a challenge but can provide valuable insights.
    • Understanding the specific needs and characteristics of athletes is crucial in conditioning and injury prevention. The application of Bayesian statistics in baseball science requires experts in Bayesian methods.
    • Traditional statistical methods taught in sports science programs are limited.
    • Communicating complex statistical concepts, such as Bayesian analysis, to coaches and players is crucial.
    • Conveying uncertainties and limitations of the models is essential for effective utilization.
    • Emerging trends in baseball science include the use of biomechanical information and computer vision algorithms.
    • Improving player performance and injury prevention are key goals for the future of baseball science.

    Chapters:

    00:00 The Role of Nutrition and Conditioning

    05:46 Analyzing Player Performance and Managing Injury Risks

    12:13 Educating Athletes on Dietary Choices

    18:02 Emerging Trends in Baseball Science

    29:49 Hierarchical Models and Player Analysis

    36:03 Challenges of Working with Limited Data

    39:49 Effective Communication of Statistical Concepts

    47:59 Future Trends: Biomechanical Data Analysis and Computer Vision Algorithms

    Thank you to my Patrons for making this episode possible!

    Yusuke Saito, Avi Bryant, Ero Carrera, Giuliano Cruz, Tim Gasser, James Wade, Tradd Salvo, William Benton, James Ahloy, Robin Taylor,, Chad Scherrer, Zwelithini Tunyiswa, Bertrand Wilden, James Thompson, Stephen Oates, Gian Luca Di Tanna, Jack Wells, Matthew Maldonado, Ian Costley, Ally Salim, Larry Gill, Ian Moran, Paul Oreto, Colin Caprani, Colin Carroll, Nathaniel Burbank, Michael Osthege, Rémi Louf, Clive Edelsten, Henri Wallen, Hugo Botha, Vinh Nguyen, Marcin Elantkowski, Adam C. Smith, Will Kurt, Andrew Moskowitz, Hector Munoz, Marco Gorelli, Simon Kessell, Bradley Rode, Patrick Kelley, Rick Anderson, Casper de Bruin, Philippe Labonde,...

    Exibir mais Exibir menos
    1 hora e 2 minutos
  • #113 A Deep Dive into Bayesian Stats, with Alex Andorra, ft. the Super Data Science Podcast
    Aug 22 2024

    Proudly sponsored by PyMC Labs, the Bayesian Consultancy. Book a call, or get in touch!

    • My Intuitive Bayes Online Courses
    • 1:1 Mentorship with me

    Our theme music is « Good Bayesian », by Baba Brinkman (feat MC Lars and Mega Ran). Check out his awesome work!

    Visit our Patreon page to unlock exclusive Bayesian swag ;)

    Takeaways:

    • Bayesian statistics is a powerful framework for handling complex problems, making use of prior knowledge, and excelling with limited data.
    • Bayesian statistics provides a framework for updating beliefs and making predictions based on prior knowledge and observed data.
    • Bayesian methods allow for the explicit incorporation of prior assumptions, which can provide structure and improve the reliability of the analysis.
    • There are several Bayesian frameworks available, such as PyMC, Stan, and Bambi, each with its own strengths and features.
    • PyMC is a powerful library for Bayesian modeling that allows for flexible and efficient computation.
    • For beginners, it is recommended to start with introductory courses or resources that provide a step-by-step approach to learning Bayesian statistics.
    • PyTensor leverages GPU acceleration and complex graph optimizations to improve the performance and scalability of Bayesian models.
    • ArviZ is a library for post-modeling workflows in Bayesian statistics, providing tools for model diagnostics and result visualization.
    • Gaussian processes are versatile non-parametric models that can be used for spatial and temporal data analysis in Bayesian statistics.

    Chapters:

    00:00 Introduction to Bayesian Statistics

    07:32 Advantages of Bayesian Methods

    16:22 Incorporating Priors in Models

    23:26 Modeling Causal Relationships

    30:03 Introduction to PyMC, Stan, and Bambi

    34:30 Choosing the Right Bayesian Framework

    39:20 Getting Started with Bayesian Statistics

    44:39 Understanding Bayesian Statistics and PyMC

    49:01 Leveraging PyTensor for Improved Performance and Scalability

    01:02:37 Exploring Post-Modeling Workflows with ArviZ

    01:08:30 The Power of Gaussian Processes in Bayesian Modeling

    Thank you to my Patrons for making this episode possible!

    Yusuke Saito, Avi Bryant, Ero Carrera, Giuliano Cruz, Tim Gasser, James Wade, Tradd Salvo, William Benton, James Ahloy, Robin Taylor,, Chad Scherrer, Zwelithini Tunyiswa, Bertrand Wilden, James Thompson, Stephen Oates, Gian Luca Di Tanna,...

    Exibir mais Exibir menos
    1 hora e 31 minutos
activate_samplebutton_t1

O que os ouvintes dizem sobre Learning Bayesian Statistics

Nota média dos ouvintes. Apenas ouvintes que tiverem escutado o título podem escrever avaliações.

Avaliações - Selecione as abas abaixo para mudar a fonte das avaliações.