AI Law Librarians

All Things AI Law Librarian-ish, Generative AI, and Legal Research/Education/Technology

Benchmarking a Moving Target, or let’s run a hypo through 7 AIs and see what happens

Posted on September 5, 2025 by Guest Blogger

Appendix – the results

ChatGPT fonts Download

Claude 3.7 Download

Westlaw Precision – Westlaw AI-Assisted Research – 05-12-2025 Download

GPTo3Pro Download

Lexis AI Assistant Response Download

Deepseek fonts Download

Perplexity Copyright Protection for Fonts and Typefaces Under Download

GeminiResults Download

Pages: 1 2

3 thoughts on “Benchmarking a Moving Target, or let’s run a hypo through 7 AIs and see what happens”

Mary Matuszak on September 6, 2025 at 12:09 pm said:

Excellent post! So many questions and thoughts are going through my head! Have you tried posting the same question to the various LLMs to see how the answers differ over time? How do we get other librarians to do similar experiments? Government law libraries are only given two week trial periods to Lexis and Westlaw so evaluation is difficult. Right now I’m concentrating my efforts on case summaries in free models. I do have a personal subscription to ChatGPT and the visualizations of the summaries are impressive.

Reply ↓
Tyler Alexander on September 8, 2025 at 8:18 am said:

Thank you for running this evaluation! I’d be curious to know whether you tested Westlaw’s latest AI-enabled research product, “Deep Research”? Or was this the older product, AI-Assisted Research?

Reply ↓
Debbie on October 17, 2025 at 4:12 pm said:

I think this was the older one.

Reply ↓

Leave a Reply Cancel reply