AI Law Librarians Benchmarking Group

About Our Research Group

The AI Law Librarians Benchmarking Group is a collaborative team of law librarians dedicated to systematically evaluating generative AI tools for legal research. As legal tech vendors rapidly release AI-powered products—each claiming to transform research practices—we saw a need for a consistent, thoughtful, and ongoing approach to testing these tools in real-world legal research scenarios. Rather than relying on isolated or anecdotal use, our group is developing a framework grounded in legal research pedagogy and information science to assess how these tools perform across defined task types.

The group includes a few of your AI Law Librarian bloggers, but also welcomes several other librarians. The full group is:

  • Rebecca Fordon, The Ohio State University Moritz Law Library
  • Jonathan Franklin, University of Washington School of Law
  • Deborah Ginsberg, Harvard Law Library
  • Nick Hafen, BYU Law School
  • Sean Harrington, University of Oklahoma Law Library
  • Christine Park, Harvard Law Library

Together, we are building a typology of legal research tasks, designing evaluation prompts and rubrics, and documenting our findings to share with the broader legal and library communities. We welcome collaboration and feedback from others working at the intersection of law, AI, and information literacy. Our introduction post is here.

Legal Research AI Benchmarking Studies / Comparisons

Legal Reasoning and Task Benchmarking

Vendor Reports about Benchmarking