move html parsing into own class