Obtain metrics that describe potential label noise issues in the dataset. To calculate these metrics, use the calculateDataQualityMetrics
endpoint.
Project ID
OK
Whether the operation succeeded
Optional error description (set if 'success' was false)
Describes the results of running the cosine similarity label noise detection method.
A list of samples that have windows that are similar to windows of other samples that have a different label.
The ID of this sample
2
"idle01.d8Ae"
Whether signature validation passed
true
"HS256"
Either the shared key or the public key that was used to validate the sample
Timestamp when the sample was created on device, or if no accurate time was known on device, the time that the file was processed by the ingestion service.
Timestamp when the sample was last modified.
"training"
"healthy-machine"
Interval between two windows (1000 / frequency). If the data was resampled, then this lists the resampled interval.
16
Frequency of the sample. If the data was resampled, then this lists the resampled frequency.
62.5
Interval between two windows (1000 / frequency) in the source data (before resampling).
16
Frequency of the sample in the source data (before resampling).
62.5
Name of the axis
"accX"
Type of data on this axis. Needs to comply to SenML units (see https://www.iana.org/assignments/senml/senml.xhtml).
Number of readings in this file
Total length (in ms.) of this file
Timestamp when the sample was added to the current acquisition bucket.
True if the current sample is excluded from use
True if the current sample is still processing (e.g. for video)
Set when sample is processing and a job has picked up the request
Set when processing this sample failed
Error (only set when processing this sample failed)
Whether the sample is cropped from another sample (and has crop start / end info)
Sample free form associated metadata
Unique identifier of the project this sample belongs to
Name of the owner of the project this sample belongs to
Name of the project this sample belongs to
What labeling flow the project this sample belongs to uses
Data sample SHA 256 hash (including CBOR envelope if applicable)
Start index of the label (e.g. 0)
End index of the label (e.g. 3). This value is inclusive, so { startIndex: 0, endIndex: 3 } covers 0, 1, 2, 3.
The label for this section.
If this sample was created by a synthetic data job, it's referenced here.
The label of this sample, in index form
A list of samples that have windows that are symptomatic of this issue.
The ID of this sample
2
"idle01.d8Ae"
Whether signature validation passed
true
"HS256"
Either the shared key or the public key that was used to validate the sample
Timestamp when the sample was created on device, or if no accurate time was known on device, the time that the file was processed by the ingestion service.
Timestamp when the sample was last modified.
"training"
"healthy-machine"
Interval between two windows (1000 / frequency). If the data was resampled, then this lists the resampled interval.
16
Frequency of the sample. If the data was resampled, then this lists the resampled frequency.
62.5
Interval between two windows (1000 / frequency) in the source data (before resampling).
16
Frequency of the sample in the source data (before resampling).
62.5
Name of the axis
"accX"
Type of data on this axis. Needs to comply to SenML units (see https://www.iana.org/assignments/senml/senml.xhtml).
Number of readings in this file
Total length (in ms.) of this file
Timestamp when the sample was added to the current acquisition bucket.
True if the current sample is excluded from use
True if the current sample is still processing (e.g. for video)
Set when sample is processing and a job has picked up the request
Set when processing this sample failed
Error (only set when processing this sample failed)
Whether the sample is cropped from another sample (and has crop start / end info)
Sample free form associated metadata
Unique identifier of the project this sample belongs to
Name of the owner of the project this sample belongs to
Name of the project this sample belongs to
What labeling flow the project this sample belongs to uses
Data sample SHA 256 hash (including CBOR envelope if applicable)
Start index of the label (e.g. 0)
End index of the label (e.g. 3). This value is inclusive, so { startIndex: 0, endIndex: 3 } covers 0, 1, 2, 3.
The label for this section.
If this sample was created by a synthetic data job, it's referenced here.
The label of this sample, in index form
The windows in this sample that are symptomatic of this issue.
The start time of this window in milliseconds
The end time of this window in milliseconds
The cosine similarity score between this window and a window from the sample in the parent object.
A list of samples that have windows that are dissimilar to windows of other samples that have the same label.
The ID of this sample
2
"idle01.d8Ae"
Whether signature validation passed
true
"HS256"
Either the shared key or the public key that was used to validate the sample
Timestamp when the sample was created on device, or if no accurate time was known on device, the time that the file was processed by the ingestion service.
Timestamp when the sample was last modified.
"training"
"healthy-machine"
Interval between two windows (1000 / frequency). If the data was resampled, then this lists the resampled interval.
16
Frequency of the sample. If the data was resampled, then this lists the resampled frequency.
62.5
Interval between two windows (1000 / frequency) in the source data (before resampling).
16
Frequency of the sample in the source data (before resampling).
62.5
Name of the axis
"accX"
Type of data on this axis. Needs to comply to SenML units (see https://www.iana.org/assignments/senml/senml.xhtml).
Number of readings in this file
Total length (in ms.) of this file
Timestamp when the sample was added to the current acquisition bucket.
True if the current sample is excluded from use
True if the current sample is still processing (e.g. for video)
Set when sample is processing and a job has picked up the request
Set when processing this sample failed
Error (only set when processing this sample failed)
Whether the sample is cropped from another sample (and has crop start / end info)
Sample free form associated metadata
Unique identifier of the project this sample belongs to
Name of the owner of the project this sample belongs to
Name of the project this sample belongs to
What labeling flow the project this sample belongs to uses
Data sample SHA 256 hash (including CBOR envelope if applicable)
Start index of the label (e.g. 0)
End index of the label (e.g. 3). This value is inclusive, so { startIndex: 0, endIndex: 3 } covers 0, 1, 2, 3.
The label for this section.
If this sample was created by a synthetic data job, it's referenced here.
The label of this sample, in index form
A list of samples that have windows that are symptomatic of this issue.
The ID of this sample
2
"idle01.d8Ae"
Whether signature validation passed
true
"HS256"
Either the shared key or the public key that was used to validate the sample
Timestamp when the sample was created on device, or if no accurate time was known on device, the time that the file was processed by the ingestion service.
Timestamp when the sample was last modified.
"training"
"healthy-machine"
Interval between two windows (1000 / frequency). If the data was resampled, then this lists the resampled interval.
16
Frequency of the sample. If the data was resampled, then this lists the resampled frequency.
62.5
Interval between two windows (1000 / frequency) in the source data (before resampling).
16
Frequency of the sample in the source data (before resampling).
62.5
Name of the axis
"accX"
Type of data on this axis. Needs to comply to SenML units (see https://www.iana.org/assignments/senml/senml.xhtml).
Number of readings in this file
Total length (in ms.) of this file
Timestamp when the sample was added to the current acquisition bucket.
True if the current sample is excluded from use
True if the current sample is still processing (e.g. for video)
Set when sample is processing and a job has picked up the request
Set when processing this sample failed
Error (only set when processing this sample failed)
Whether the sample is cropped from another sample (and has crop start / end info)
Sample free form associated metadata
Unique identifier of the project this sample belongs to
Name of the owner of the project this sample belongs to
Name of the project this sample belongs to
What labeling flow the project this sample belongs to uses
Data sample SHA 256 hash (including CBOR envelope if applicable)
Start index of the label (e.g. 0)
End index of the label (e.g. 3). This value is inclusive, so { startIndex: 0, endIndex: 3 } covers 0, 1, 2, 3.
The label for this section.
If this sample was created by a synthetic data job, it's referenced here.
The label of this sample, in index form
The windows in this sample that are symptomatic of this issue.
The start time of this window in milliseconds
The end time of this window in milliseconds
The cosine similarity score between this window and a window from the sample in the parent object.
Describes the results of running the nearest neighbors label noise detection method.
The label noise score and nearest neighbors for each window of data in the project that shows a potential label noise issue.
The ID of the sample this window belongs to
2
"idle01.d8Ae"
Whether signature validation passed
true
"HS256"
Either the shared key or the public key that was used to validate the sample
Timestamp when the sample was created on device, or if no accurate time was known on device, the time that the file was processed by the ingestion service.
Timestamp when the sample was last modified.
"training"
"healthy-machine"
Interval between two windows (1000 / frequency). If the data was resampled, then this lists the resampled interval.
16
Frequency of the sample. If the data was resampled, then this lists the resampled frequency.
62.5
Interval between two windows (1000 / frequency) in the source data (before resampling).
16
Frequency of the sample in the source data (before resampling).
62.5
Name of the axis
"accX"
Type of data on this axis. Needs to comply to SenML units (see https://www.iana.org/assignments/senml/senml.xhtml).
Number of readings in this file
Total length (in ms.) of this file
Timestamp when the sample was added to the current acquisition bucket.
True if the current sample is excluded from use
True if the current sample is still processing (e.g. for video)
Set when sample is processing and a job has picked up the request
Set when processing this sample failed
Error (only set when processing this sample failed)
Whether the sample is cropped from another sample (and has crop start / end info)
Sample free form associated metadata
Unique identifier of the project this sample belongs to
Name of the owner of the project this sample belongs to
Name of the project this sample belongs to
What labeling flow the project this sample belongs to uses
Data sample SHA 256 hash (including CBOR envelope if applicable)
Start index of the label (e.g. 0)
End index of the label (e.g. 3). This value is inclusive, so { startIndex: 0, endIndex: 3 } covers 0, 1, 2, 3.
The label for this section.
If this sample was created by a synthetic data job, it's referenced here.
The start time of this window in milliseconds
The end time of this window in milliseconds
The label noise score for this window, from 0 to the total number of windows.
Details of the nearest neighbors to this window
The ID of the sample this window belongs to
2
"idle01.d8Ae"
Whether signature validation passed
true
"HS256"
Either the shared key or the public key that was used to validate the sample
Timestamp when the sample was created on device, or if no accurate time was known on device, the time that the file was processed by the ingestion service.
Timestamp when the sample was last modified.
"training"
"healthy-machine"
Interval between two windows (1000 / frequency). If the data was resampled, then this lists the resampled interval.
16
Frequency of the sample. If the data was resampled, then this lists the resampled frequency.
62.5
Interval between two windows (1000 / frequency) in the source data (before resampling).
16
Frequency of the sample in the source data (before resampling).
62.5
Name of the axis
"accX"
Type of data on this axis. Needs to comply to SenML units (see https://www.iana.org/assignments/senml/senml.xhtml).
Number of readings in this file
Total length (in ms.) of this file
Timestamp when the sample was added to the current acquisition bucket.
True if the current sample is excluded from use
True if the current sample is still processing (e.g. for video)
Set when sample is processing and a job has picked up the request
Set when processing this sample failed
Error (only set when processing this sample failed)
Whether the sample is cropped from another sample (and has crop start / end info)
Sample free form associated metadata
Unique identifier of the project this sample belongs to
Name of the owner of the project this sample belongs to
Name of the project this sample belongs to
What labeling flow the project this sample belongs to uses
Data sample SHA 256 hash (including CBOR envelope if applicable)
Start index of the label (e.g. 0)
End index of the label (e.g. 3). This value is inclusive, so { startIndex: 0, endIndex: 3 } covers 0, 1, 2, 3.
The label for this section.
If this sample was created by a synthetic data job, it's referenced here.
The start time of this window in milliseconds
The end time of this window in milliseconds
The number of neighbors used in the nearest neighbors algorithm.
Describes the results of running the cross validation label noise detection method.
The ID of the sample this window belongs to
2
"idle01.d8Ae"
Whether signature validation passed
true
"HS256"
Either the shared key or the public key that was used to validate the sample
Timestamp when the sample was created on device, or if no accurate time was known on device, the time that the file was processed by the ingestion service.
Timestamp when the sample was last modified.
"training"
"healthy-machine"
Interval between two windows (1000 / frequency). If the data was resampled, then this lists the resampled interval.
16
Frequency of the sample. If the data was resampled, then this lists the resampled frequency.
62.5
Interval between two windows (1000 / frequency) in the source data (before resampling).
16
Frequency of the sample in the source data (before resampling).
62.5
Name of the axis
"accX"
Type of data on this axis. Needs to comply to SenML units (see https://www.iana.org/assignments/senml/senml.xhtml).
Number of readings in this file
Total length (in ms.) of this file
Timestamp when the sample was added to the current acquisition bucket.
True if the current sample is excluded from use
True if the current sample is still processing (e.g. for video)
Set when sample is processing and a job has picked up the request
Set when processing this sample failed
Error (only set when processing this sample failed)
Whether the sample is cropped from another sample (and has crop start / end info)
Sample free form associated metadata
Unique identifier of the project this sample belongs to
Name of the owner of the project this sample belongs to
Name of the project this sample belongs to
What labeling flow the project this sample belongs to uses
Data sample SHA 256 hash (including CBOR envelope if applicable)
Start index of the label (e.g. 0)
End index of the label (e.g. 3). This value is inclusive, so { startIndex: 0, endIndex: 3 } covers 0, 1, 2, 3.
The label for this section.
If this sample was created by a synthetic data job, it's referenced here.
The start time of this window in milliseconds
The end time of this window in milliseconds
The label of this window, in index form
The probability of this window being the label it was assigned, as estimated by a classifier trained on the whole dataset.
The z-score of the probability with respect to other class members, so that outliers (i.e. windows whose probability is low) can be easily spotted. This assumes that most correctly labelled class members will have a high probability.