Hellobench: Evaluating long text generation capabilities of large language modelsPublished in arxiv, 2024 Share on Bluesky Facebook LinkedIn X (formerly Twitter) Previous Next