English
全部
搜索
图片
视频
地图
资讯
Copilot
更多
购物
航班
旅游
笔记本
Top stories
Sports
U.S.
Local
World
Science
Technology
Entertainment
Business
More
Politics
时间不限
过去 1 小时
过去 24 小时
过去 7 天
过去 30 天
最佳匹配
最新
雷锋网
10 个月
纯蒸馏模型 SOTA 出现!直接 SFT 成本直降 50 倍,数据已全部开源
导语:纯蒸馏 SFT 的推理模型性能对标一众 SFT + RL 模型。 a-m-team 又发新论文了。 这个团队上周刚刚在 Hugging Face 低调开源了32B稠密模型,但在多项关键推理评测中击败了 DeepSeek-R1,并与超大规模的 MoE 模型Qwen3-235B-A22B、Seed1.5-Thinking 不相上下,因此赢得了海内外 ...
一些您可能无法访问的结果已被隐去。
显示无法访问的结果
今日热点
Judge halts ballroom project
Rejects CO therapy ban
Judge blocks Trump order
Xerox CEO steps down
Florida to rename airport
US consumer confidence rises
Japan deploys 1st LR missiles
Gas hits $4 a gallon
Rioux enters transfer portal
Signs 'millionaires tax'
Australia probes tech giants
US senators probe FCC chief
Set for US state visit
Hikes baggage fees
Agrees to $95M, 8-yr deal?
Sugar The Surfing Dog dies
Trump unveils library design
NYC man indicted
Marine detained at airport
EU diplomats arrive in Kyiv
Vance to publish new book
US job openings decline
Fox News lawsuit dismissed
Former Jets QB retires
Megachurch pastor released
Explosive device found in NY
Vegas to host Super Bowl 63
Haiti gang attacks
Faces federal bribery probe
Hold briefing on Iran war
Cream cheese recalled
反馈