how does deepseek r1's performance in math-heavy benchmarks compare to gpt-4o

clash订阅链接购买网站