ALDA: Automated Legal Document Analytics

June 1, 2014 - June 1, 2019

There has been an exponential growth in use of digitized legal documents in recent years. Majority of services on the Internet have associated legal documents such as Terms of Services, Privacy Policies and Service Level agreements. A large corpus of court cases, judgments and compliance/regulations are now digitally available for e-discovery. Moreover, businesses are maintaining large data sets of legal contracts that they have signed with their employees, customers and contractors. Furthermore, companies have to adhere to a variety of compliance and regulatory policies for many of these contracts, which are also increasingly digitally available. Managing and monitoring an ever increasing dataset of legal contracts, regulations and compliance is still a very manual and labour intensive job and can be a bottleneck in the smooth functioning of the enterprise.

Our research aims at building a Legal Question and Answer (LQnA) system that will be built upon large scale document analytics of legal documents using various techniques from deep learning, machine learning, natural language processing and text mining. We are working to transform legal databases from textual databases to graph-based datasets using Semantic Web technologies. Our long term goal is to develop a system that for any given action or question, can highlight all the statutes, laws and case law that might be applicable on it and offer preliminary guidance to a counsel. As a shorter term vision, we're looking to see if we can automatically extract elements from compliance and regulatory legal documents that govern Information Technology (IT) outsourcing/cloud computing and automatically monitor for compliance.

automatic sla monitoring, cloud computing, sla, text mining

OWL Tweet

Principal Faculty

  1. Karuna Pande Joshi

Affiliated Faculty

  1. Tim Finin
  2. Anupam Joshi


  1. Aditi Gupta

Refereed Publications


  1. K. P. Joshi and A. Banerjee, "Automating Privacy Compliance Using Policy Integrated Blockchain", Article, Cryptography, Special Issue Advances of Blockchain Technology and Its Applications, February 2019, 97 downloads.


  1. L. Elluri and K. P. Joshi, "A Knowledge Representation of Cloud Data controls for EU GDPR Compliance", InProceedings, 11th IEEE International Conference on Cloud Computing (CLOUD), July 2018, 430 downloads.
  2. A. Nagar and K. P. Joshi, "A Semantically Rich Knowledge Representation of PCI DSS for Cloud Services", InProceedings, 6th International IBM Cloud Academy Conference ICACON 2018, Japan, May 2018, 251 downloads.


  1. S. Saha, K. P. Joshi, R. Frank, M. Aebig, and J. Lin, "Automated Knowledge Extraction from the Federal Acquisition Regulations System (FARS)", InProceedings, 2nd International Workshop on Enterprise Big Data Semantic and Analytics Modeling at IEEE International Conference on Big Data 2017 , December 2017, 673 downloads.
  2. S. Saha and K. P. Joshi, "Cognitive Assistance for Automating the Analysis of the Federal Acquisition Regulations System", InProceedings, AAAI Fall Symposium 2017, November 2017, 471 downloads.
  3. S. Saha, K. P. Joshi, and A. Gupta, "A Deep Learning Approach to Understanding Cloud Service Level Agreements ", InProceedings, Fifth International IBM Cloud Academy Conference, May 2017, 683 downloads.


  1. K. P. Joshi, A. Gupta, S. Mittal, C. Pearce, A. Joshi, and T. Finin, "Semantic Approach to Automating Management of Big Data Privacy Policies", InProceedings, IEEE BigData 2016, December 2016, 1188 downloads.
  2. K. P. Joshi, A. Gupta, S. Mittal, C. Pearce, A. Joshi, and T. Finin, "ALDA : Cognitive Assistant for Legal Document Analytics", InProceedings, AAAI Fall Symposium 2016, September 2016, 704 downloads.
  3. A. Gupta, S. Mittal, K. P. Joshi, C. Pearce, and A. Joshi, "Streamlining Management of Multiple Cloud Services", InProceedings, IEEE International Conference on Cloud Computing, June 2016, 891 downloads.
  4. S. Mittal, K. P. Joshi, C. Pearce, and A. Joshi, "Automatic Extraction of Metrics from SLAs for Cloud Service Management", InProceedings, 2016 IEEE International Conference on Cloud Engineering (IC2E 2016), April 2016, 906 downloads.


  1. S. Mittal, K. P. Joshi, C. Pearce, and A. Joshi, "Parallelizing Natural Language Techniques for Knowledge Extraction from Cloud Service Level Agreements", InProceedings, 2015 IEEE International Conference on Big Data, October 2015, 873 downloads.
  2. K. P. Joshi and C. Pearce, "Automating Cloud Service Level Agreements using Semantic Technologies", InProceedings, CLaw Workshop, IEEE International Conference on Cloud Engineering (IC2E), March 2015, 965 downloads.


  1. K. P. Joshi, Y. Yesha, and T. Finin, "Automating Cloud Services Lifecycle through Semantic technologies", Article, IEEE Transactions on Service Computing, January 2014, 1689 downloads.

Non-Refereed Publications


  1. S. Saha and K. P. Joshi, "Cognitively Rich Framework to Automate Extraction and Representation of Legal Knowledge", TechReport, March 2018, 48 downloads.


  1. (Project) ALDA: Automated Legal Document Analytics has principal investigator (Person) Karuna Pande Joshi
  2. (Project) ALDA: Automated Legal Document Analytics has developer (Person) Ankur Nagar.