gpt_engineer.benchmark.benchmarks.apps.load.load_apps

gpt_engineer.benchmark.benchmarks.apps.load.load_apps(config: AppsConfig) Benchmark[source]

Loads the APPS benchmark, which consists of a series coding problems.

Returns:

A Benchmark object containing a list of Task objects for the APPS evaluation.

Return type:

Benchmark