ChatBench: From Static Benchmarks to Human-AI Evaluation Serina Chang, Ashton Anderson, Jake Hofman April 2025 Preprint
ChatBench: From Static Benchmarks to Human-AI Evaluation Serina Chang, Ashton Anderson, Jake Hofman April 2025 Preprint
ChatBench: From Static Benchmarks to Human-AI Evaluation Serina Chang, Ashton Anderson, Jake Hofman April 2025 Preprint
ChatBench: From Static Benchmarks to Human-AI Evaluation Serina Chang, Ashton Anderson, Jake Hofman April 2025 Preprint