Benchmarking Reliability in AI-generated Legal Advice
Imagine you just moved to a new city. Your landlord refuses to return your security deposit, but you don’t know the local tenant laws. So you ask ChatGPT. But how do you know it is accurate/unbiased?