Note
Access to this page requires authorization. You can try signing in or changing directories.
Access to this page requires authorization. You can try changing directories.
The following table lists the metrics available for the microsoft.kubernetesconfiguration/extensions resource type.
Table headings
Metric - The metric display name as it appears in the Azure portal.
Name in Rest API - Metric name as referred to in the REST API.
Unit - Unit of measure.
Aggregation - The default aggregation type. Valid values: Average, Minimum, Maximum, Total, Count.
Dimensions - Dimensions available for the metric.
Time Grains - Intervals at which the metric is sampled. For example, PT1M
indicates that the metric is sampled every minute, PT30M
every 30 minutes, PT1H
every hour, and so on.
DS Export- Whether the metric is exportable to Azure Monitor Logs via Diagnostic Settings.
For information on exporting metrics, see - Metrics export using data collection rules and Create diagnostic settings in Azure Monitor.
For information on metric retention, see Azure Monitor Metrics overview.
Category: Latency
Metric | Name in REST API | Unit | Aggregation | Dimensions | Time Grains | DS Export |
---|---|---|---|---|---|---|
Api Request Duration in Seconds Histogram of request durations |
ApiRequestDurationSeconds |
Seconds | Average | AppName , GpuEnabled , Method , Route |
PT1M, PT5M, PT15M, PT30M, PT1H, PT6H, PT12H | No |
Ingestion Time Total ingestion time in minutes |
IngestionTimeMinutes |
Seconds | Average | AppName , GpuEnabled |
PT1M, PT5M, PT15M, PT30M, PT1H, PT6H, PT12H | No |
Input Preprocessing Time (Milliseconds) Input preprocessing time in milliseconds |
InputPreprocessingTimeMilliseconds |
Milliseconds | Average | GpuEnabled |
PT1M, PT5M, PT15M, PT30M, PT1H, PT6H, PT12H | No |
Call LLM Total Time in Seconds Total call_llm time in seconds |
TotalCallLLMTimeSeconds |
Seconds | Average | AppName , GpuEnabled , LLMProvider , OutputLength |
PT1M, PT5M, PT15M, PT30M, PT1H, PT6H, PT12H | No |
Embedding Generation Total Time in Seconds Total time taken to generate embeddings from local model |
TotalGenerateEmbeddingsTimeSeconds |
Seconds | Average | AppName , GpuEnabled , InputLength , OutputLength |
PT1M, PT5M, PT15M, PT30M, PT1H, PT6H, PT12H | No |
Hybrid Search Embedding Generation Total Time in Seconds Total time taken to generate Hybrid Search embeddings from local model |
TotalGenerateHybridSearchEmbeddingsTimeSeconds |
Seconds | Average | AppName , GpuEnabled , InputLength , OutputLength |
PT1M, PT5M, PT15M, PT30M, PT1H, PT6H, PT12H | No |
Reranking Generation Total Time in Seconds Total time taken to generate Reranking |
TotalGenerateRerankingTimeSeconds |
Seconds | Average | AppName , GpuEnabled , InputLength , OutputLength |
PT1M, PT5M, PT15M, PT30M, PT1H, PT6H, PT12H | No |
Get Chat History Summary Total Time in Milliseconds Total get_chat_history_summary time in milliseconds |
TotalGetChatHistorySummaryTimeMilliseconds |
Milliseconds | Average | AppName , GpuEnabled , InputHistoryPairs , LLMProvider , MaxTokens , OutputLength , Temperature , TopP |
PT1M, PT5M, PT15M, PT30M, PT1H, PT6H, PT12H | No |
Get LLM Payload Total Time in Milliseconds Total get_llm_payload time in milliseconds |
TotalGetLLMPayloadTimeMilliseconds |
Milliseconds | Average | AppName , DiversityPenalty , GpuEnabled , LengthPenalty , LLMProvider , MaxTokens , RepetitionPenalty , Temperature , TopP |
PT1M, PT5M, PT15M, PT30M, PT1H, PT6H, PT12H | No |
Get Hybrid Search Total Time in Milliseconds Total hybrid search time in milliseconds |
TotalHybridSearchTimeMilliseconds |
Milliseconds | Average | AppName , ChunkMinScore , GpuEnabled , IndexType , InputLength , MetricType , TopK |
PT1M, PT5M, PT15M, PT30M, PT1H, PT6H, PT12H | No |
Inference Total Time in Seconds Total inference time in seconds |
TotalInferenceTimeSeconds |
Seconds | Average | AppName , DiversityPenalty , GpuEnabled , InputLength , LLMProvider , MaxTokens , OutputLength , RepetitionPenalty , Temperature , TopK |
PT1M, PT5M, PT15M, PT30M, PT1H, PT6H, PT12H | No |
Chunks Search Total Time in Milliseconds Total search chunks time in milliseconds |
TotalSearchChunksTimeMilliseconds |
Milliseconds | Average | AppName , EmbeddingIndexName , GpuEnabled , InputLength , OutputChunks , TopK |
PT1M, PT5M, PT15M, PT30M, PT1H, PT6H, PT12H | No |
Search Total Time in Milliseconds Total time taken to search |
TotalSearchTimeMilliseconds |
Milliseconds | Average | AppName , ChunkMinScore , GpuEnabled , InputLength , QueryType , TopK |
PT1M, PT5M, PT15M, PT30M, PT1H, PT6H, PT12H | No |
Similarity Search Total Time in Milliseconds Total time taken to search for similar documents |
TotalSimilaritySearchTimeMilliseconds |
Milliseconds | Average | AppName , GpuEnabled , InputLength , ChunkMinScore , IndexType , MetricType , TopK |
PT1M, PT5M, PT15M, PT30M, PT1H, PT6H, PT12H | No |
Category: Traffic
Metric | Name in REST API | Unit | Aggregation | Dimensions | Time Grains | DS Export |
---|---|---|---|---|---|---|
Active PDU Sessions Number of Active PDU Sessions |
ActiveSessionCount |
Count | Total (Sum) | 3gppGen , PccpId , SiteId |
PT1M | No |
API Failure Count Count of failed API requests |
ApiFailureCount |
Count | Count | EndpointName , GpuEnabled , StatusCode |
PT1M, PT5M, PT15M, PT30M, PT1H, PT6H, PT12H | No |
API Request Count Total number of API requests |
ApiRequestCount |
Count | Count | AppName , GpuEnabled , Method , Route |
PT1M, PT5M, PT15M, PT30M, PT1H, PT6H, PT12H | No |
API Success Count Count of successful API requests |
ApiSuccessCount |
Count | Count | EndpointName , GpuEnabled , StatusCode |
PT1M, PT5M, PT15M, PT30M, PT1H, PT6H, PT12H | No |
Authentication Attempts Authentication attempts rate (per minute) |
AuthAttempt |
Count | Total (Sum) | 3gppGen , PccpId , SiteId |
PT1M | Yes |
Authentication Failures Authentication failure rate (per minute) |
AuthFailure |
Count | Total (Sum) | 3gppGen , PccpId , SiteId , Result |
PT1M | Yes |
Authentication Successes Authentication success rate (per minute) |
AuthSuccess |
Count | Total (Sum) | 3gppGen , PccpId , SiteId |
PT1M | Yes |
Connected NodeBs Number of connected gNodeBs or eNodeBs |
ConnectedNodebs |
Count | Total (Sum) | 3gppGen , PccpId , SiteId |
PT1M | Yes |
DeRegistration Attempts UE deregistration attempts rate (per minute) |
DeRegistrationAttempt |
Count | Total (Sum) | 3gppGen , PccpId , SiteId |
PT1M | Yes |
DeRegistration Successes UE deregistration success rate (per minute) |
DeRegistrationSuccess |
Count | Total (Sum) | 3gppGen , PccpId , SiteId |
PT1M | Yes |
Evaluation API Request Count Total number of Evaluation API requests |
EvaluationApiRequestCount |
Count | Count | AppName , Method , Route |
PT1M, PT5M, PT15M, PT30M, PT1H, PT6H, PT12H | No |
Failed Skipped Count Count of failed or skipped files |
FailedSkippedCount |
Count | Count | Category , GpuEnabled |
PT1M, PT5M, PT15M, PT30M, PT1H, PT6H, PT12H | No |
File Ingestion Rate Total files ingested per Job |
FileIngestionRate |
Count | Total (Sum) | AppName , GpuEnabled , FileType , JobID |
PT1M, PT5M, PT15M, PT30M, PT1H, PT6H, PT12H | No |
Hybrid Search Model API Request Count Total number of Hybrid Search Model API requests |
HybridSearchModelApiRequestCount |
Count | Count | AppName , Method , Route |
PT1M, PT5M, PT15M, PT30M, PT1H, PT6H, PT12H | No |
Inference Answer Feedback Inference Answer Feedback |
InferenceAnswerFeedback |
Count | Count | AppName , ChunkMinScore , ChunkScores , GpuEnabled , LLMProvider , RunId , Thumb |
PT1M, PT5M, PT15M, PT30M, PT1H, PT6H, PT12H | No |
Inference API Request Count Number of Inference API requests |
InferenceApiRequestCount |
Count | Count | AppName , Method , Route |
PT1M, PT5M, PT15M, PT30M, PT1H, PT6H, PT12H | No |
Ingestion API Request Count Number of Ingestion API requests |
IngestionApiRequestCount |
Count | Count | AppName , Method , Route |
PT1M, PT5M, PT15M, PT30M, PT1H, PT6H, PT12H | No |
Number of Evaluations Number of Evaluations |
NumberOfEvaluations |
Count | Count | AppName , GpuEnabled , Method , Route |
PT1M, PT5M, PT15M, PT30M, PT1H, PT6H, PT12H | No |
Number of Jobs Number of jobs |
NumberOfJobs |
Count | Count | AppName , GpuEnabled , Method , Route |
PT1M, PT5M, PT15M, PT30M, PT1H, PT6H, PT12H | No |
Paging Attempts Paging attempts rate (per minute) |
PagingAttempt |
Count | Total (Sum) | 3gppGen , PccpId , SiteId |
PT1M | Yes |
Paging Failures Paging failure rate (per minute) |
PagingFailure |
Count | Total (Sum) | 3gppGen , PccpId , SiteId |
PT1M | Yes |
Provisioned Subscribers Number of provisioned subscribers |
ProvisionedSubscribers |
Count | Total (Sum) | PccpId , SiteId |
PT1M | No |
RAN Setup Failures RAN setup failure rate (per minute) |
RanSetupFailure |
Count | Total (Sum) | 3gppGen , PccpId , SiteId , Cause |
PT1M | Yes |
RAN Setup Requests RAN setup reuests rate (per minute) |
RanSetupRequest |
Count | Total (Sum) | 3gppGen , PccpId , SiteId |
PT1M | Yes |
RAN Setup Responses RAN setup response rate (per minute) |
RanSetupResponse |
Count | Total (Sum) | 3gppGen , PccpId , SiteId |
PT1M | Yes |
Registered Subscribers Number of registered subscribers |
RegisteredSubscribers |
Count | Total (Sum) | 3gppGen , PccpId , SiteId |
PT1M | Yes |
Registered Subscribers Connected Number of registered and connected subscribers |
RegisteredSubscribersConnected |
Count | Total (Sum) | 3gppGen , PccpId , SiteId |
PT1M | Yes |
Registered Subscribers Idle Number of registered and idle subscribers |
RegisteredSubscribersIdle |
Count | Total (Sum) | 3gppGen , PccpId , SiteId |
PT1M | Yes |
Registration Attempts Registration attempts rate (per minute) |
RegistrationAttempt |
Count | Total (Sum) | 3gppGen , PccpId , SiteId |
PT1M | Yes |
Registration Failures Registration failure rate (per minute) |
RegistrationFailure |
Count | Total (Sum) | 3gppGen , PccpId , SiteId , Result |
PT1M | Yes |
Registration Successes Registration success rate (per minute) |
RegistrationSuccess |
Count | Total (Sum) | 3gppGen , PccpId , SiteId |
PT1M | Yes |
Service Request Attempts Service request attempts rate (per minute) |
ServiceRequestAttempt |
Count | Total (Sum) | 3gppGen , PccpId , SiteId |
PT1M | Yes |
Service Request Failures Service request failure rate (per minute) |
ServiceRequestFailure |
Count | Total (Sum) | 3gppGen , PccpId , SiteId , Result , Tai |
PT1M | Yes |
Service Request Successes Service request success rate (per minute) |
ServiceRequestSuccess |
Count | Total (Sum) | 3gppGen , PccpId , SiteId |
PT1M | Yes |
Session Establishment Attempts PDU session establishment attempts rate (per minute) |
SessionEstablishmentAttempt |
Count | Total (Sum) | 3gppGen , PccpId , SiteId , Dnn |
PT1M | Yes |
Session Establishment Failures PDU session establishment failure rate (per minute) |
SessionEstablishmentFailure |
Count | Total (Sum) | 3gppGen , PccpId , SiteId , Dnn |
PT1M | Yes |
Session Establishment Successes PDU session establishment success rate (per minute) |
SessionEstablishmentSuccess |
Count | Total (Sum) | 3gppGen , PccpId , SiteId , Dnn |
PT1M | Yes |
Session Releases Session release rate (per minute) |
SessionRelease |
Count | Total (Sum) | 3gppGen , PccpId , SiteId |
PT1M | Yes |
UE Context Release Commands UE context release command message rate (per minute) |
UeContextReleaseCommand |
Count | Total (Sum) | 3gppGen , PccpId , SiteId |
PT1M | Yes |
UE Context Release Completes UE context release complete message rate (per minute) |
UeContextReleaseComplete |
Count | Total (Sum) | 3gppGen , PccpId , SiteId |
PT1M | Yes |
UE Context Release Requests UE context release request message rate (per minute) |
UeContextReleaseRequest |
Count | Total (Sum) | 3gppGen , PccpId , SiteId |
PT1M | Yes |
User Plane Bandwidth User plane bandwidth in bits/second. |
UserPlaneBandwidth |
BitsPerSecond | Total (Sum) | PcdpId , SiteId , Direction , Interface |
PT1M | No |
User Plane Packet Drop Rate User plane packet drop rate (packets/sec) |
UserPlanePacketDropRate |
CountPerSecond | Total (Sum) | PcdpId , SiteId , Cause , Direction , Interface |
PT1M | No |
User Plane Packet Rate User plane packet rate (packets/sec) |
UserPlanePacketRate |
CountPerSecond | Total (Sum) | PcdpId , SiteId , Direction , Interface |
PT1M | No |
VectorDB API Request Count Total number of API requests to VectorDB |
VectorDbApiRequestCount |
Count | Count | AppName , Method , Route |
PT1M, PT5M, PT15M, PT30M, PT1H, PT6H, PT12H | No |
Xn Handover Attempts Handover attempts rate (per minute) |
XnHandoverAttempt |
Count | Total (Sum) | 3gppGen , PccpId , SiteId |
PT1M | Yes |
Xn Handover Failures Handover failure rate (per minute) |
XnHandoverFailure |
Count | Total (Sum) | 3gppGen , PccpId , SiteId |
PT1M | Yes |
Xn Handover Successes Handover success rate (per minute) |
XnHandoverSuccess |
Count | Total (Sum) | 3gppGen , PccpId , SiteId |
PT1M | Yes |