Evaluation
This guide is a work in progress. Currently, it just contains the minimum information to get started writing evals against the MongoDB Responses API
This guide contains information about writing evaluations against the MongoDB Responses API.
When developing against the Responses API, you should write evaluations to understand the quality of your AI system and iteratively improve it.
Do Not Use Sensitive Customer Data in Evaluations
NEVER use sensitive customer data in your evaluations. Period.
Use either fake data or anonymize real customer data, in line with information security best practices. If you are not sure if you can use some data, assume that you cannot. Reach out to the Information Security team with any questions about data usage.
Evaluation API
To run evaluations against the MongoDB Responses API, use the following base endpoint:
https://chat-server.docs.staging.corp.mongodb.com/api/v1/
This endpoint bypasses the Web Application Firewall (WAF) and CloudFront protections that we have on the standard staging API, http://knowledge.staging.corp.mongodb.com/
, and the production API http://knowledge.mongodb.com/
. Do not run evaluations against these APIs. You will get 4XX
errors from CloudFront!
To use the evaluation API, you must authenticate. Refer to the following section for authentication information.