The Large Language Models (LLMs) have impacted many real-life tasks. To examine the efficacy of LLMs in a high-stake domain like law, we have applied state-of-the-art LLMs for two popular tasks: Statute Prediction and Judgment Prediction, on Indian Supreme Court cases. We see that while LLMs exhibit excellent predictive performance in Statute Prediction, their performance dips in Judgment Prediction when compared with many standard models. The explanations generated by LLMs (along with prediction) are of moderate to decent quality. We also see evidence of gender and religious bias in the LLM-predicted results. In addition, we present a note from a senior legal expert on the ethical concerns of deploying LLMs in these critical legal tasks.
CITATION STYLE
Vats, S., Zope, A., De, S., Sharma, A., Bhattacharya, U., Nigam, S. K., … Ghosh, K. (2023). LLMs - the Good, the Bad or the Indispensable?: A Use Case on Legal Statute Prediction and Legal Judgment Prediction on Indian Court Cases. In Findings of the Association for Computational Linguistics: EMNLP 2023 (pp. 12451–12474). Association for Computational Linguistics (ACL). https://doi.org/10.18653/v1/2023.findings-emnlp.831
Mendeley helps you to discover research relevant for your work.