Concave

An Open-Source, Reliable Automated Programming Platform

Our target: 'Trusted', 'Open Source' Autonomous Software Engineering

We're 50% better at identify problems than the best open-source solution detail test report

Links

Homepage: https://concave-ai.vercel.app

Docs: https://concave-docs.vercel.app

Try our demo Playbook: https://concave-ai.vercel.app/playbook

help you understand how L·IDE and Fleet work.

Our Tech Report: https://github.com/concave-ai/fleet/blob/main/docs/fleet_report.md

The bug we found in TiDB Vector Client, and fixed with Fleet: https://github.com/pingcap/tidb-vector-python/issues/58

What Inspired Us

Many researchers and companies have done well on SWE-Bench. But we saw two main problems:

Current open-source tools can be much better. For example, Python analysis tools are slow and can't do detailed analysis.
Some startup companies have good results, but they don't share their code.

We want to fix these issues with Concave by:

open-source
Using new tech from industry, like better code analysis tools (SCIP, Tree-sitter) and serverless vector search ( TiDB Vector.)

Our Goal

We're building an open-source, trustworthy, and fully automatic software engineering platform. We made Concave to help current AI models do software engineering more easily.

What Concave Does

In this hackathon, we built two main parts from scratch:

Concave L·IDE: The First IDE Made for LLM (not humans :) )
- Gives LLM the best tools
- Can search for any code part, like function calls or variable settings
- Searches using both keywords (with TiDB Data Service) and meaning (with TiDB Vector Search)
- Combines different search types in one system (TiDB Cloud) (HTAVP, Hybrid transactional/analytical/vector processing)
Concave Fleet:
- We made this in one week to test if L·IDE works well
- It can find the main problem in code and fix it automatically
- It works really well: 50% better at finding problems than the best open-source tools tech report
- We even used Fleet to fix a bug we found in TiDB Vector Client while making L·IDE (link to issue)

How We Built It

In one week, we made Concave L·IDE Code Mixed Search using TiDB Serverless:
- TiDB Serverless Data Service was the best choice for building symbol search,
- TiDB Cloud Vector Search and LlamaIndex and JinaAI made it easy to develop semantic search from scratch.
In another week, we made Concave Fleet using Concave L·IDE:
- Find the main problem
- Make code to fix the problem
- By using Dify and JinaAI, we further create and optimized the behavior of each agent in Fleet, ensuring more accurate outputs.

We also made a user interface called Playbook. It helps people understand how L·IDE and Fleet work, making our solution more trustworthy.

For code understanding, we looked at tools like Tree-sitter and SCIP to make a better Python analysis tool.

Challenges We Faced

Main question: How can we make SWE-Bench work better?

Our ideas:

Make each step work better to improve the overall success.
Help AI understand code better to solve more problems.

L·IDE and Fleet proved these ideas work. We got 50% better at finding problems in SWE-Bench. Our report shows that finding problems better helps solve them better too.

We didn't have time to test everything in SWE-Bench, but we proved our method works. We plan to finish all tests soon.

What We're Proud Of

We're 50% better at finding problems than the best open-source tools
We know how to make SWE-Bench work even better
We shared all our tools so other teams can join in making software engineering automatic

What's Next for Concave

Finish the newest SWE-Bench tests and try to get the best open-source score
Make a pre-set up L·IDE for SWE-Bench to help more teams. They can focus on making AI edit code better without worrying about setting up code analysis
Add more features to Concave L·IDE, like searching git commits and history
Make Concave L·IDE useful for more than just SWE-Bench

Built With

python
tidb
tree-sitter

Updates

JustWPH chen posted an update — Aug 24, 2024 01:08 PM EDT

Someone reported an error in the quickstart tutorial. This is because the latest code is in the hybrid search branch, not in the main branch. A pull request has just been created to fix this. (The code was pushed before 8 PM on the 23rd, and now only a pull request operation was done on GitHub.)

Log in or sign up for Devpost to join the conversation.

JustWPH chen posted an update — Aug 23, 2024 03:02 AM EDT

add MIT license to all repo

Log in or sign up for Devpost to join the conversation.

JustWPH chen started this project — Aug 22, 2024 05:53 PM EDT

Leave feedback in the comments!

Log in or sign up for Devpost to join the conversation.