Simplify Your Law: Using Information Theory to Deduplicate Legal Documents.
Corinna CoupetteJyotsna SinghHolger SpamannPublished in: CoRR (2021)
Keyphrases
- information theory
- legal documents
- case law
- information theoretic
- jensen shannon divergence
- statistical mechanics
- statistical learning
- relative entropy
- statistical physics
- kullback leibler divergence
- conditional entropy
- image processing
- semantic content
- shannon entropy
- multimedia
- mdl principle
- image content
- mutual information
- co occurrence
- active learning