Sequitor Based Reinforcment Learning Chatbot
A reinforcement learning based chat bot can be implemented by using the context free grammars created by the sequitur algorithm.
As phrases are learned by the chat bot, the sequitar algorithm creates grammar sub trees based on the existing corpus. These sub trees are cataloged and can be used as a state space. Simplistically, a state consists of a vector equal in length to the number of sub trees, with a bit flipped for whether that specific subtree was seen in the input.
Based on the state, various actions can be taken. New phrases could potentially be generated using the same grammar model, or other actions could be taken.
Reinforcement “simulations” or dreams could be executed between each user input.
Primary difficulties: Keep states consistent as new additions to the state space are added, Generating new phrases based on the captured grammar.