I’m a university student interested in Gen AI, and over the holidays, I built a project in public: Who Rates the Rater? – a crowdsourced dataset for benchmarking AI-generated storytelling.
The Idea Inspired by Chatbot Arena, this lets users compare AI-generated stories and provide feedback. The goal is to refine LLMs for creative writing using real human preferences.
How It Works
- Live Demo: https://storycrowdsourcepreference.streamlit.app
- Tech Stack: Built with Streamlit + Supabase
- Open Source: https://github.com/clchinkc/story_crowdsource_preference
Get Involved
- Try it & star the repo if you find it interesting
- Bug reports & feature requests welcome on Twitter
- Follow me for future AI & storytelling projects
Would love to hear your thoughts!