Setting Confidence Thresholds
Introduction to Confidence Thresholds
When creating a Detector or submitting an ImageQuery, you can set the necessary confidence level for your use case. We call this the confidence_threshold
. Tuning this value allows you to balance the trade-offs between accuracy and latency / cost.
Confidence scores represent the model's internal assessment of its prediction reliability. Groundlight models provide calibrated confidence scores, which means that, when a model makes a prediction with a confidence of 0.95, we expect that (under typical conditions) 95% of the time that prediction will be correct. In other words, a prediction with a confidence of 0.95 is expected to be correct 19 out of 20 times. Confidence calibration kicks in after a sufficient number of labeled images have been collected.
Confidence thresholds represent a minimum confidence that must be achieved for Groundlight to return an answer. If a confidence above the confidence threshold is not achieved, Groundlight will escalate your query up our heirarchy to stronger models and human reviewers. Confidence thresholds should be determined based on your application's acceptable error rate and the potential impact of those errors.
Higher confidence thresholds result in predictions that are more accurate but may take longer to process (because they are escalated to more complex/expensive models or human review). Lower confidence thresholds result in faster responses but may be less accurate. Over time, and as more human-provided labels are collected, the ML models will improve, and our fastest models will be able to provide higher confidence predictions more quickly.
Configuring Timeouts
In some cases, challenging queries that require human review can take a number of seconds, so we provide both client-side and server-side timeouts to ensure that your application can continue to function even if the query takes longer than expected.
Set a client-side timeout by configuring the wait
parameter in the submit_image_query
method. This simply stops the client from waiting for a response after a certain amount of time.
Set a server-side timeout by configuring the patience_time
parameter in the submit_image_query
method. This tells Groundlight to deprioritize the query after a certain amount of time, which can be useful if the result of a query becomes less relevant over time. For example, if you are monitoring a live video feed, you may want to deprioritize queries that are more than a few seconds old so that our human reviewers can focus on the most recent data.
from groundlight import Groundlight
from PIL import Image
import requests
gl = Groundlight()
image_url = "https://www.photos-public-domain.com/wp-content/uploads/2010/11/over_flowing_garbage_can.jpg"
image = Image.open(requests.get(image_url, stream=True).raw)
d = gl.get_or_create_detector(
name="trash",
query="Is the trash can full?",
confidence_threshold=0.95, # Set the confidence threshold to 0.95
)
# This will wait until either 60 seconds have passed or the confidence reaches 0.95
image_query = gl.submit_image_query(
detector=d,
image=image,
wait=10, # tell the client to stop waiting after 10 seconds
patience_time=20, # tell Groundlight to deprioritize the query after 20 seconds
)
print(f"The answer is {image_query.result.label}")
print(f"The confidence is {image_query.result.confidence}")
Tuning the confidence_threshold
allows you to balance accuracy with response time.
Higher confidence thresholds result in more accurate predictions but can increase latency. Achieving these higher confidence levels often requires more labels, which can increase labor costs.
As our models improve over time, they will become more confident, enabling you to receive higher-confidence answers more quickly and at a lower cost.