ML-driven-customer-retention-under-partial-observability

POMDP formulation (state, observation, action, reward), belief updates from noisy behavioral signals, policy optimization under intervention costs

Comment

POMDP formulation (state, observation, action, reward), belief updates from noisy behavioral signals, policy optimization under intervention costs, counterfactual evaluation using historical logs, safety constraints (no repeated discounting). Must think causally and reason about uncertainty.

Built With

action
amazon-web-services
belief-updates-from-noisy-behavioral-signals
cli
counterfactual-evaluation-using-historical-logs
llm
observation
policy-optimization-under-intervention-costs
python
react
reward)
vectordb

Updates

Debabrata Pattnayak started this project — Jan 31, 2026 04:57 PM EST

Leave feedback in the comments!

Log in or sign up for Devpost to join the conversation.