There was an error while loading. Please reload this page.
Finally, we use our benchmarks to demonstrate that LLMs struggle to compositionally generalize when asked to do programming-by-example in a few-shot setting, but an ExeDec-style prompting approach can ...
一些您可能无法访问的结果已被隐去。
显示无法访问的结果