We will soon release confidence scores and human-in-the-loop review in Box Preview, a powerful new capability that brings transparency, control, and trust to AI-powered data extraction in Box Extract. With this capability, users will be able to view how confident the Box Extract Agent is with every extracted metadata value, enabling them to navigate directly to the data source within the document, and validate whether the extracted value is correct or whether it should be modified all without leaving Box Preview.
This feature will apply specifically to metadata generated through the autofill capability within in Box Preview and Box Apps, as well as files processed using Custom Extract Agents in Box Extract. Every metadata value extracted using autofill or within Box Extract will include a low, medium or high confidence score, or a model-generated probability indicating how likely the extracted value is correct. Confidence scores will be displayed directly in the metadata panel next to the corresponding metadata value.
Fields that contain low confidence scores will be visually flagged in the metadata panel, making it easy to identify and review incorrect metadata without manually scanning every value. Users can click any extracted metadata value in the metadata panel to jump directly to the referenced location in the document. Box Preview scrolls to the relevant page and displays a bounding box, or citation, around the exact source text used to extract the value, simplifying verification.
For fields flagged for review, reviewers will have two simple inline actions directly in Box Preview:
- Accept: Confirms the extracted value is correct. The field is marked "Accepted", the confidence flag is cleared, and the value is immediately saved.
- Edit or Delete: Requires the user to enter the correct value. The field is marked "Edited," and both the confidence score and bounding box are removed.
With this release, Box Extract will provide customers with the confidence they need in their data. This is especially impactful for regulated industries, including financial services, healthcare, legal, and the public sector, where metadata accuracy is tied directly to business outcomes and compliance obligations.
Stay tuned to learn more about this release.