Multi-Agent Q-Learning With And Without Algorithmic Orchestration