3.8 Proceedings Paper

Beyond Ads: Sequential Decision-Making Algorithms in Law and Public Policy

Related references

Note: Only part of the references are listed.
Article Multidisciplinary Sciences

Magnetic control of tokamak plasmas through deep reinforcement learning

Jonas Degrave et al.

Summary: Nuclear fusion using magnetic confinement, specifically in the tokamak configuration, is a promising method for sustainable energy. In this study, researchers introduce a previously undescribed architecture for the design of tokamak magnetic controllers, which autonomously learns to command the full set of control coils. This approach demonstrates unprecedented flexibility and generality in problem specification, leading to a notable reduction in design effort and the ability to produce new plasma configurations.

NATURE (2022)

Article Remote Sensing

Enhancing environmental enforcement with near real-time monitoring: Likelihood-based detection of structural expansion of intensive livestock farms

Ben Chugg et al.

Summary: The article discusses a method for rapidly identifying significant structural expansion using high-resolution satellite imagery, focusing on Concentrated Animal Feeding Operations (CAFOs) as a test case. This approach shows promise for enhancing environmental compliance monitoring and providing near real-time monitoring in various settings.

INTERNATIONAL JOURNAL OF APPLIED EARTH OBSERVATION AND GEOINFORMATION (2021)

Article Multidisciplinary Sciences

Efficient and targeted COVID-19 border testing via reinforcement learning

Hamsa Bastani et al.

Summary: This study presents the design and performance of Eva, a reinforcement learning system deployed at all Greek borders in the summer of 2020 to limit the influx of asymptomatic travelers infected with SARS-CoV-2 and inform border policies. Eva outperformed random surveillance testing and testing policies based solely on epidemiological metrics in identifying infected travelers, showcasing the potential of reinforcement learning in safeguarding public health.

NATURE (2021)

Article Engineering, Industrial

Deep reinforcement learning driven inspection and maintenance planning under incomplete information and constraints

C. P. Andriotis et al.

Summary: Determining inspection and maintenance policies to minimize long-term risks and costs in deteriorating engineering environments is a complex optimization problem. Major computational challenges include the curse of dimensionality, curse of history, presence of state uncertainties, and presence of constraints. These challenges are addressed through a joint framework of constrained Partially Observable Markov Decision Processes (POMDP) and multi-agent Deep Reinforcement Learning (DRL).

RELIABILITY ENGINEERING & SYSTEM SAFETY (2021)

Article Physics, Multidisciplinary

Non Stationary Multi-Armed Bandit: Empirical Evaluation of a New Concept Drift-Aware Algorithm

Emanuele Cavenaghi et al.

Summary: The Multi-Armed Bandit problem addresses sequential decision-making challenges, but in reality, the reward distribution may change. The f-Discounted-Sliding-Window Thompson Sampling algorithm is proposed to combat concept drift in non-stationary environments by introducing a discount factor and a sliding window mechanism.

ENTROPY (2021)

Proceedings Paper Computer Science, Artificial Intelligence

Debiased Off-Policy Evaluation for Recommendation Systems

Yusuke Narita et al.

Summary: The paper proposes an alternative method to evaluate new algorithms by predicting their performance using historical data, validates the method through a simulation experiment and an advertisement design, and shows smaller mean squared errors compared to state-of-the-art methods.

15TH ACM CONFERENCE ON RECOMMENDER SYSTEMS (RECSYS 2021) (2021)

Article Mathematics, Applied

On the Bias, Risk, and Consistency of Sample Means in Multi-armed Bandits

Jaehyeok Shin et al.

Summary: This paper extensively discusses the bias, risk, and consistency of sample means in multi-armed bandit experiments, identifying four distinct sources of selection bias. A new notion of effective sample size is introduced to bound the risk of the sample mean, with carefully designed examples provided for better understanding of the various sources of selection bias studied. The proofs in the paper combine variational representations of information-theoretic divergences with new martingale concentration inequalities.

SIAM JOURNAL ON MATHEMATICS OF DATA SCIENCE (2021)

Article Management

Online Decision Making with High-Dimensional Covariates

Hamsa Bastani et al.

OPERATIONS RESEARCH (2020)

Review Environmental Sciences

A review of machine learning applications in wildfire science and management

Piyush Jain et al.

ENVIRONMENTAL REVIEWS (2020)

Article Multidisciplinary Sciences

Autonomous navigation of stratospheric balloons using reinforcement learning

Marc G. Bellemare et al.

NATURE (2020)

Review Computer Science, Information Systems

Investigating Bias in Facial Analysis Systems: A Systematic Review

Ashraf Khalil et al.

IEEE ACCESS (2020)

Article Computer Science, Artificial Intelligence

Machine Learning for the Geosciences: Challenges and Opportunities

Anuj Karpatne et al.

IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING (2019)

Article Multidisciplinary Sciences

Dissecting racial bias in an algorithm used to manage the health of populations

Ziad Obermeyer et al.

SCIENCE (2019)

Proceedings Paper Computer Science, Artificial Intelligence

Degenerate Feedback Loops in Recommender Systems

Ray Jiang et al.

AIES '19: PROCEEDINGS OF THE 2019 AAAI/ACM CONFERENCE ON AI, ETHICS, AND SOCIETY (2019)

Article Computer Science, Artificial Intelligence

Stop explaining black box machine learning models for high stakes decisions and use interpretable models instead

Cynthia Rudin

NATURE MACHINE INTELLIGENCE (2019)

Editorial Material Medicine, General & Internal

Big Data and Machine Learning in Health Care

Andrew L. Beam et al.

JAMA-JOURNAL OF THE AMERICAN MEDICAL ASSOCIATION (2018)

Article Communication

Opening the government's black boxes: freedom of information and algorithmic accountability

Katherine Fink

INFORMATION COMMUNICATION & SOCIETY (2018)

Proceedings Paper Computer Science, Theory & Methods

Data Poisoning Attacks in Contextual Bandits

Yuzhe Ma et al.

DECISION AND GAME THEORY FOR SECURITY, GAMESEC 2018 (2018)

Article Economics

Spatial interactions and optimal forest management on a fire-threatened landscape

Christopher J. Lauer et al.

FOREST POLICY AND ECONOMICS (2017)

Editorial Material Multidisciplinary Sciences

Beyond prediction: Using big data for policy problems

Susan Athey

SCIENCE (2017)

Article Statistics & Probability

BATCHED BANDIT PROBLEMS

Vianney Perchet et al.

ANNALS OF STATISTICS (2016)

Proceedings Paper Computer Science, Artificial Intelligence

Linear Upper Confidence Bound Algorithm for Contextual Bandit Problem with Piled Rewards

Kuan-Hao Huang et al.

ADVANCES IN KNOWLEDGE DISCOVERY AND DATA MINING, PAKDD 2016, PT II (2016)

Article Economics

Crowdsourcing City Government: Using Tournaments to Improve Inspection Accuracy

Edward L. Glaeser et al.

AMERICAN ECONOMIC REVIEW (2016)

Article Law

The Food Safety Modernization Act: Implications for US Small Scale Farms

Kathryn A. Boys et al.

AMERICAN JOURNAL OF LAW & MEDICINE (2015)

Proceedings Paper Computer Science, Artificial Intelligence

Intelligible Models for HealthCare: Predicting Pneumonia Risk and Hospital 30-day Readmission

Rich Caruana et al.

KDD'15: PROCEEDINGS OF THE 21ST ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING (2015)

Article Automation & Control Systems

A Structured Multiarmed Bandit Problem and the Greedy Policy

Adam J. Mersereau et al.

IEEE TRANSACTIONS ON AUTOMATIC CONTROL (2009)