Constrained Policy Optimization for Controlled Self-Learning in Conversational AI Systems.
Mohammad KachueeSungjin LeePublished in: ACL (industry) (2023)
Keyphrases
- ai systems
- intelligent systems
- concave convex procedure
- artificial intelligence
- optimization problems
- multi modal
- optimization algorithm
- optimization method
- natural language generation
- constrained optimization
- saddle point
- user interface
- learning process
- knowledge representation
- computational intelligence
- neural network
- artificial neural networks
- e learning
- information retrieval