Pile of Law: Learning Responsible Data Filtering from the Law and a 256GB Open-Source Legal Dataset.
Peter HendersonMark S. KrassLucia ZhengNeel GuhaChristopher D. ManningDan JurafskyDaniel E. HoPublished in: NeurIPS (2022)
Keyphrases
- open source
- data sets
- prior knowledge
- raw data
- learning systems
- data collection
- data processing
- legal reasoning
- data analysis
- data sources
- data points
- data quality
- image data
- database
- learning process
- training data
- case law
- background knowledge
- sensor data
- legal argument
- open source software
- training dataset
- original data
- artificial intelligence and law
- test data
- statistical analysis
- online learning
- data mining techniques
- high speed
- supervised learning
- data structure
- reinforcement learning
- learning algorithm
- data mining
- neural network