I actually built it for my girlfriend who was writing a systematic review paper. She had to compare 7.500 papers against inclusion and exclusion criterias. She obviously did this manually because she cares about scientific integrity, but it sparked the idea to make an AI tool to automate repetitive tasks for people like her who would rather avoid programming. Now I just find it useful myself for a lot of ad-hoc analysis tasks like prompt engineering, rag tuning, and comparing model outputs from anthropic, openai, and google.