Best AI papers explained

A podcast by Enoch H. Kang

Categories:

183 Episodes

  1. Is In-Context Learning in Large Language Models Bayesian? A Martingale Perspective

    Published: 5/12/2025
  2. Leaked Claude Sonnet 3.7 System Instruction tuning

    Published: 5/12/2025
  3. Converging Predictions with Shared Information

    Published: 5/11/2025
  4. Test-Time Alignment Via Hypothesis Reweighting

    Published: 5/11/2025
  5. Rethinking Diverse Human Preference Learning through Principal Component Analysis

    Published: 5/11/2025
  6. Active Statistical Inference

    Published: 5/10/2025
  7. Data Mixture Optimization: A Multi-fidelity Multi-scale Bayesian Framework

    Published: 5/10/2025
  8. AI-Powered Bayesian Inference

    Published: 5/10/2025
  9. Can Unconfident LLM Annotations Be Used for Confident Conclusions?

    Published: 5/9/2025
  10. Predictions as Surrogates: Revisiting Surrogate Outcomes in the Age of AI

    Published: 5/9/2025
  11. Learn then Test: Calibrating Predictive Algorithms to Achieve Risk Control

    Published: 5/9/2025
  12. How to Evaluate Reward Models for RLHF

    Published: 5/9/2025
  13. LLMs as Judges: Survey of Evaluation Methods

    Published: 5/9/2025
  14. The Alternative Annotator Test for LLM-as-a-Judge: How to Statistically Justify Replacing Human Annotators with LLMs

    Published: 5/9/2025
  15. Limits to scalable evaluation at the frontier: LLM as Judge won’t beat twice the data

    Published: 5/9/2025
  16. Stratified Prediction-Powered Inference for Hybrid Language Model Evaluation

    Published: 5/9/2025
  17. Accelerating Unbiased LLM Evaluation via Synthetic Feedback

    Published: 5/9/2025
  18. Prediction-Powered Statistical Inference Framework

    Published: 5/9/2025
  19. Optimizing Chain-of-Thought Reasoners via Gradient Variance Minimization in Rejection Sampling and RL

    Published: 5/9/2025
  20. RM-R1: Reward Modeling as Reasoning

    Published: 5/9/2025

1 / 10

Men know other men best. Women know other women best. And yes, perhaps AIs know other AIs best. AI explains what you should know about this week's AI research progress.