Mini-SWE-Agent Integration¶

Mini-SWE-Agent is a lightweight software engineering agent. The Contree integration enables it to execute code in isolated, reproducible containers. Every command in Mini-SWE-Agent is executed in a fresh shell session, which makes it perfectly suitable for Contree.

Integration repository: mini-swe-agent with Contree

Using ContreeEnvironment¶

from minisweagent.agents.default import DefaultAgent
from minisweagent.environments.extra.contree import ContreeEnvironment
from minisweagent.models.litellm_model import LitellmModel

from contree_sdk.config import ContreeConfig


def main():
    model = LitellmModel(model_name="gemini/gemini-flash-latest")

    contree_env = ContreeEnvironment(
        contree_config=ContreeConfig(
            token="your-contree-token",
            base_url="https://your-contree-instance.com",
        ),
        image="ubuntu:focal",
        cwd="/workspace",
    )

    agent = DefaultAgent(model, contree_env)
    agent.run("Develop small calculator script and check it")

    result = contree_env.session.run(shell="ls /workspace -lah").wait()
    print(result.stdout)


if __name__ == "__main__":
    main()

Running with SWE-bench¶

Update config/extra/swebench.yaml:

environment:
  environment_class: contree
  contree_config:
    token: "your-contree-token"

model:
  model_name: "openai/gpt-4"
  cost_tracking: "ignore_errors"
  model_kwargs:
    custom_llm_provider: "openai"
    drop_params: true
    temperature: 0.0
    api_base: "https://api.provider.com/v1/"
    api_key: "your-api-key"

Run the benchmark:

python src/minisweagent/run/extra/swebench.py \
  --subset lite \
  --output results \
  --workers 4 \
  --redo-existing