Quesma
Re-inventing how applications use their databases
Pinned Loading
Repositories
Showing 10 of 19 repositories
- BinaryAudit Public
An open-source benchmark for evaluating AI agents' ability to find backdoors hidden in compiled binaries.
QuesmaOrg/BinaryAudit’s past year of commit activity - homebrew-tap Public
QuesmaOrg/homebrew-tap’s past year of commit activity - terminal-bench Public Forked from laude-institute/terminal-bench
A benchmark for LLMs on complicated tasks in the terminal
QuesmaOrg/terminal-bench’s past year of commit activity