Machine Learning Invention – Air Conditioner Based On Reinforcement Learning? What would be the reward. Paradoxial cooling. Waste a little cooling on the outer unit from a learned algorithm as a little reward. Use much less total energy?