Chain-of-Problem-Solving: Multi-step agent to solve Meta Hackercup problems

This repository provides our code and approach in the final round of the Meta Hackercup 2024 competition

Approach

The idea here is to simulate how humans think when solving a competitive programming problem

Usually, solving a problem adheres to the following steps:

Read the problem, understand the input and output
Figure out the expected time complexity
Create a mental model on how to solve the problem by doing mathematical reasoning using pen-and-paper
Think of ways to optimize the brute force to fit into (2)
Write the code to solve the problem based on (3)
Submit to the competition website and receive a verdict (Accepted, Time Limit Exceeded, Wrong Answer)
Refine the solution based on the given verdict if it is incorrect. If TLE, optimize the solution. If WA, analyze where went wrong and think of different approaches to fix the mistake

This pipeline's approach aims to mimic the step-by-step approach on how humans approach a competitive programming problem

Think of the time complexity
Review the time complexity
Think of the relevant concepts
Review the relevant concepts
Generate an initial solution
Review the time complexity of the initial solution
Review the correctness of the initial solution
Generate a new solution based on 6 and 7
Repeat 6, 7, 8
Output the final code and run on the test set

To run the code

Create an .env file with the following lines

OPENAI_API_KEY=<YOUR OPENAI_API_KEY>

pip install -r requirements.txt

Run the following command (change the parameters if needed)

python run.py --model o1-mini-2024-09-12 --max_attempts 3 --num_time_complexity_tries 4 --num_concepts_tries 2 --problem_name '"Duplicate Order"'

It will take about 5-20 mins to execute depending on the given problem NOTE: The problem has to be placed in the /problems/(Your Problem Name) folder with files

full_in.txt (full input)
sample_in.txt (sample input)
sample_out.txt (sample output)
statement.txt (problem statement)
relevant images where applicable

Check the output files in the directory associated with the model ((PROBLEM NAME)_output.text and (PROBLEM NAME)_code.py)

Things we have not tried but may look promising

Generating random stress tests to feed into the LLM for debugging

Things we have tried but failed

RAG
Matching answers to sample test cases (some examples have more than one correct answer so this is not feasible)
Publicly available agentic frameworks (Rethink-MCTS and Mapcoder) - these significantly increased the required time to solve the problem with not much greater accuracy. In the final round, none of the problems ran on the pipeline with these additional frameworks within 10 mins.

Team members

Geremie Yeo (Leader)
Saidinesh Pola Total score across all rounds: 25.5

Name		Name	Last commit message	Last commit date
Latest commit History 2 Commits
mini_lib		mini_lib
output/o1-mini-2024-09-12		output/o1-mini-2024-09-12
problems		problems
.gitignore		.gitignore
README.md		README.md
concepts.py		concepts.py
helpers.py		helpers.py
image-1.png		image-1.png
image.png		image.png
prompts.txt		prompts.txt
requirements.txt		requirements.txt
run.py		run.py
time_complexity.py		time_complexity.py
tle.py		tle.py
wrong_answer.py		wrong_answer.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Chain-of-Problem-Solving: Multi-step agent to solve Meta Hackercup problems

Approach

To run the code

Things we have not tried but may look promising

Things we have tried but failed

Team members

About

Releases

Packages

Languages

bogoconic1/meta-hackercup-2024-final-round

Folders and files

Latest commit

History

Repository files navigation

Chain-of-Problem-Solving: Multi-step agent to solve Meta Hackercup problems

Approach

To run the code

Things we have not tried but may look promising

Things we have tried but failed

Team members

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages