|
|
@ -73,7 +73,6 @@ For example, you can get the master metrics by `curl http://localhost:5679/actua |
|
|
|
- ds.task.dispatch.failure.count: (counter) the number of tasks failed to dispatch, retry failure included |
|
|
|
- ds.task.dispatch.failure.count: (counter) the number of tasks failed to dispatch, retry failure included |
|
|
|
- ds.task.dispatch.error.count: (counter) the number of task dispatch errors |
|
|
|
- ds.task.dispatch.error.count: (counter) the number of task dispatch errors |
|
|
|
- ds.task.execution.count.by.type: (counter) the number of task executions grouped by tag `task_type` |
|
|
|
- ds.task.execution.count.by.type: (counter) the number of task executions grouped by tag `task_type` |
|
|
|
- ds.task.running: (gauge) the number of running tasks |
|
|
|
|
|
|
|
- ds.task.prepared: (gauge) the number of tasks prepared for task queue |
|
|
|
- ds.task.prepared: (gauge) the number of tasks prepared for task queue |
|
|
|
- ds.task.execution.count: (counter) the number of executed tasks |
|
|
|
- ds.task.execution.count: (counter) the number of executed tasks |
|
|
|
- ds.task.execution.duration: (histogram) duration of task executions |
|
|
|
- ds.task.execution.duration: (histogram) duration of task executions |
|
|
@ -104,6 +103,12 @@ For example, you can get the master metrics by `curl http://localhost:5679/actua |
|
|
|
### Worker Server Metrics |
|
|
|
### Worker Server Metrics |
|
|
|
|
|
|
|
|
|
|
|
- ds.worker.overload.count: (counter) the number of times the worker overloaded |
|
|
|
- ds.worker.overload.count: (counter) the number of times the worker overloaded |
|
|
|
|
|
|
|
- ds.worker.task: (gauge) the number of tasks on the worker, including pending and running ones |
|
|
|
|
|
|
|
- ds.worker.execute.queue.size: (gauge) the number of pending tasks on the worker |
|
|
|
|
|
|
|
- ds.worker.active.execute.thread: (gauge) the number of running tasks on the worker |
|
|
|
|
|
|
|
- ds.worker.memory.available: (gauge) the available physical memory of the worker (GB) |
|
|
|
|
|
|
|
- ds.worker.cpu.usage: (gauge) the cpu usage percentage of the worker |
|
|
|
|
|
|
|
- ds.worker.memory.usage: (gauge) the memory usage percentage of the worker |
|
|
|
- ds.worker.full.submit.queue.count: (counter) the number of times the worker's submit queue being full |
|
|
|
- ds.worker.full.submit.queue.count: (counter) the number of times the worker's submit queue being full |
|
|
|
- ds.worker.resource.download.count: (counter) the number of downloaded resource files on workers, sliced by tag `status` |
|
|
|
- ds.worker.resource.download.count: (counter) the number of downloaded resource files on workers, sliced by tag `status` |
|
|
|
- ds.worker.resource.download.duration: (histogram) the time cost of resource download on workers |
|
|
|
- ds.worker.resource.download.duration: (histogram) the time cost of resource download on workers |
|
|
|