Browse Source
* Add mater metrics * fix UT * Add url to mysql profile * Add worker metrics * Update grafana config * Add system metrics doc * Add process failover counter * Add metrics image * Change jpg to png * Add command insert metrics * Fix UT * Revert UT3.1.0-release
Wenjun Ruan
2 years ago
committed by
GitHub
40 changed files with 4496 additions and 567 deletions
@ -0,0 +1,154 @@
|
||||
# Introduction |
||||
|
||||
Apache DolphinScheduler has export some metrics to monitor the system. We use micrometer for the exporter facade, and |
||||
the default exporter is prometheus, more exporter is coming soon. |
||||
|
||||
## Quick Start |
||||
|
||||
You can add the following config in master/worker/alert/api's yaml file to open the metrics exporter. |
||||
|
||||
```yaml |
||||
metrics: |
||||
enabled: true |
||||
``` |
||||
|
||||
Once you open the metrics exporter, you can access the metrics by the url: `http://ip:port/actuator/prometheus` |
||||
|
||||
The exporter port is the `server.port` defined in application.yaml, e.g: master: `server.port: 5679`, worker: `server.port: 1235`, alert: `server.port: 50053`, api: `server.port: 12345`. |
||||
|
||||
For example, you can get the master metrics by `curl http://localhost:5679/actuator/prometheus` |
||||
|
||||
We have prepared the out-of-the-box Grafana configuration for you, you can find the Grafana dashboard |
||||
at `dolphinscheduler-meter/resources/grafana`, you can directly import these dashboards to grafana. |
||||
|
||||
If you want to try at docker, you can use the following command to start the prometheus with grafana: |
||||
|
||||
```shell |
||||
cd dolphinscheduler-meter/src/main/resources/grafana-demo |
||||
docker compose up |
||||
``` |
||||
|
||||
Then you can access the grafana by the url: `http://localhost/3001` |
||||
|
||||
![image.png](../../../../img/metrics/metrics-master.png) |
||||
![image.png](../../../../img/metrics/metrics-worker.png) |
||||
![image.png](../../../../img/metrics/metrics-datasource.png) |
||||
|
||||
## Master Metrics |
||||
|
||||
Master metrics are exported by the DolphinScheduler master server. |
||||
|
||||
### System Metrics |
||||
|
||||
* dolphinscheduler_master_overload_count: Indicates the number of times the master has been overloaded. |
||||
* dolphinscheduler_master_consume_command_count: Indicates the number of commands has consumed. |
||||
|
||||
### Process Metrics |
||||
|
||||
* dolphinscheduler_create_command_count: Indicates the number of command has been inserted. |
||||
* dolphinscheduler_process_instance_submit_count: Indicates the number of process has been submitted. |
||||
* dolphinscheduler_process_instance_running_gauge: Indicates the number of process are running now. |
||||
* dolphinscheduler_process_instance_timeout_count: Indicates the number of process has been timeout. |
||||
* dolphinscheduler_process_instance_finish_count: Indicates the number of process has been finished, include success or |
||||
failure. |
||||
* dolphinscheduler_process_instance_success_count: Indicates the number of process has been successful. |
||||
* dolphinscheduler_process_instance_stop_count: Indicates the number of process has been stopped. |
||||
* dolphinscheduler_process_instance_failover_count: Indicates the number of process has been failed over. |
||||
|
||||
### Task Metrics |
||||
|
||||
* dolphinscheduler_task_timeout_count: Indicates the number of tasks has been timeout. |
||||
* dolphinscheduler_task_finish_count: Indicates the number of tasks has been finished, include success or failure. |
||||
* dolphinscheduler_task_success_count: Indicates the number of tasks has been successful. |
||||
* dolphinscheduler_task_timeout_count: Indicates the number of tasks has been timeout. |
||||
* dolphinscheduler_task_retry_count: Indicates the number of tasks has been retry. |
||||
* dolphinscheduler_task_failover_count: Indicates the number of tasks has been failover. |
||||
* dolphinscheduler_task_dispatch_count: Indicates the number of tasks has been dispatched to worker. |
||||
* dolphinscheduler_task_dispatch_failed_count: Indicates the number of tasks dispatched failed, if dispatched failed |
||||
will retry. |
||||
* dolphinscheduler_task_dispatch_error_count: Indicates the number of tasks dispatched error, if dispatched error, means |
||||
there are exception occur. |
||||
|
||||
## Worker Metrics |
||||
|
||||
Worker metrics are exported by the DolphinScheduler worker server. |
||||
|
||||
### System Metrics |
||||
|
||||
* dolphinscheduler_worker_overload_count: Indicates the number of times the worker has been overloaded. |
||||
* dolphinscheduler_worker_submit_queue_is_full_count: Indicates the number of times the worker's submit queue has been |
||||
full. |
||||
|
||||
### Task Metrics |
||||
|
||||
* dolphinscheduler_task_execute_count: Indicates the number of times a task has been executed, it contains a tag - |
||||
`task_type`. |
||||
* dolphinscheduler_task_execution_count: Indicates the total number of task has been executed. |
||||
* dolphinscheduler_task_execution_timer: Indicates the time spent executing tasks. |
||||
|
||||
## Default System Metrics |
||||
|
||||
In each server, there are some default metrics related to the system instance. |
||||
|
||||
### Database Metrics |
||||
|
||||
* hikaricp_connections_creation_seconds_max: Connection creation time max. |
||||
* hikaricp_connections_creation_seconds_count: Connection creation time count. |
||||
* hikaricp_connections_creation_seconds_sum: Connection creation time sum. |
||||
* hikaricp_connections_acquire_seconds_max: Connection acquire time max. |
||||
* hikaricp_connections_acquire_seconds_count: Connection acquire time count. |
||||
* hikaricp_connections_acquire_seconds_sum: Connection acquire time sum. |
||||
* hikaricp_connections_usage_seconds_max: Connection usage max. |
||||
* hikaricp_connections_usage_seconds_count: Connection usage time count. |
||||
* hikaricp_connections_usage_seconds_sum: Connection usage time sum. |
||||
* hikaricp_connections_max: Max connections. |
||||
* hikaricp_connections_min Min connections |
||||
* hikaricp_connections_active: Active connections. |
||||
* hikaricp_connections_idle: Idle connections. |
||||
* hikaricp_connections_pending: Pending connections. |
||||
* hikaricp_connections_timeout_total: Timeout connections. |
||||
* hikaricp_connections: Total connections |
||||
* jdbc_connections_max: Maximum number of active connections that can be allocated at the same time. |
||||
* jdbc_connections_min: Minimum number of idle connections in the pool. |
||||
* jdbc_connections_idle: Number of established but idle connections. |
||||
* jdbc_connections_active: Current number of active connections that have been allocated from the data source. |
||||
|
||||
### JVM Metrics |
||||
|
||||
* jvm_buffer_total_capacity_bytes: An estimate of the total capacity of the buffers in this pool. |
||||
* jvm_buffer_count_buffers: An estimate of the number of buffers in the pool. |
||||
* jvm_buffer_memory_used_bytes: An estimate of the memory that the Java virtual machine is using for this buffer pool. |
||||
* jvm_memory_committed_bytes: The amount of memory in bytes that is committed for the Java virtual machine to use. |
||||
* jvm_memory_max_bytes: The maximum amount of memory in bytes that can be used for memory management. |
||||
* jvm_memory_used_bytes: The amount of used memory. |
||||
* jvm_threads_peak_threads: The peak live thread count since the Java virtual machine started or peak was reset. |
||||
* jvm_threads_states_threads: The current number of threads having NEW state. |
||||
* jvm_gc_memory_allocated_bytes_total: Incremented for an increase in the size of the (young) heap memory pool after one GC to before the next. |
||||
* jvm_gc_max_data_size_bytes: Max size of long-lived heap memory pool. |
||||
* jvm_gc_pause_seconds_count: Time spent count in GC pause. |
||||
* jvm_gc_pause_seconds_sum: Time spent sum in GC pause. |
||||
* jvm_gc_pause_seconds_max: Time spent max in GC pause. |
||||
* jvm_gc_live_data_size_bytes: Size of long-lived heap memory pool after reclamation. |
||||
* jvm_gc_memory_promoted_bytes_total: Count of positive increases in the size of the old generation memory pool before GC to after GC. |
||||
* jvm_classes_loaded_classes: The number of classes that are currently loaded in the Java virtual machine. |
||||
* jvm_threads_live_threads: The current number of live threads including both daemon and non-daemon threads. |
||||
* jvm_threads_daemon_threads: The current number of live daemon threads. |
||||
* jvm_classes_unloaded_classes_total: The total number of classes unloaded since the Java virtual machine has started execution. |
||||
* process_cpu_usage: The "recent cpu usage" for the Java Virtual Machine process. |
||||
* process_start_time_seconds: Start time of the process since unix epoch. |
||||
* process_uptime_seconds: The uptime of the Java virtual machine. |
||||
|
||||
|
||||
## Other Metrics |
||||
* jetty_threads_config_max: The maximum number of threads in the pool. |
||||
* jetty_threads_config_min: The minimum number of threads in the pool. |
||||
* jetty_threads_current: The total number of threads in the pool. |
||||
* jetty_threads_idle: The number of idle threads in the pool. |
||||
* jetty_threads_busy: The number of busy threads in the pool. |
||||
* jetty_threads_jobs: Number of jobs queued waiting for a thread. |
||||
* process_files_max_files: The maximum file descriptor count. |
||||
* process_files_open_files: The open file descriptor count. |
||||
* system_cpu_usage: The "recent cpu usage" for the whole system. |
||||
* system_cpu_count: The number of processors available to the Java virtual machine. |
||||
* system_load_average_1m: The sum of the number of runnable entities queued to available processors and the number of runnable entities running on the available processors averaged over a period of time. |
||||
* logback_events_total: Number of level events that made it to the logs |
After Width: | Height: | Size: 329 KiB |
After Width: | Height: | Size: 457 KiB |
After Width: | Height: | Size: 394 KiB |
@ -0,0 +1,53 @@
|
||||
/* |
||||
* Licensed to the Apache Software Foundation (ASF) under one or more |
||||
* contributor license agreements. See the NOTICE file distributed with |
||||
* this work for additional information regarding copyright ownership. |
||||
* The ASF licenses this file to You under the Apache License, Version 2.0 |
||||
* (the "License"); you may not use this file except in compliance with |
||||
* the License. You may obtain a copy of the License at |
||||
* |
||||
* http://www.apache.org/licenses/LICENSE-2.0
|
||||
* |
||||
* Unless required by applicable law or agreed to in writing, software |
||||
* distributed under the License is distributed on an "AS IS" BASIS, |
||||
* WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied. |
||||
* See the License for the specific language governing permissions and |
||||
* limitations under the License. |
||||
*/ |
||||
|
||||
package org.apache.dolphinscheduler.server.master.metrics; |
||||
|
||||
import io.micrometer.core.instrument.Counter; |
||||
import io.micrometer.core.instrument.Metrics; |
||||
|
||||
public final class MasterServerMetrics { |
||||
|
||||
private MasterServerMetrics() { |
||||
throw new UnsupportedOperationException("Utility class"); |
||||
} |
||||
|
||||
/** |
||||
* Used to measure the master server is overload. |
||||
*/ |
||||
private static final Counter MASTER_OVERLOAD_COUNTER = |
||||
Counter.builder("dolphinscheduler_master_overload_count") |
||||
.description("Master server overload count") |
||||
.register(Metrics.globalRegistry); |
||||
|
||||
/** |
||||
* Used to measure the number of process command consumed by master. |
||||
*/ |
||||
private static final Counter MASTER_CONSUME_COMMAND_COUNTER = |
||||
Counter.builder("dolphinscheduler_master_consume_command_count") |
||||
.description("Master server consume command count") |
||||
.register(Metrics.globalRegistry); |
||||
|
||||
public static void incMasterOverload() { |
||||
MASTER_OVERLOAD_COUNTER.increment(); |
||||
} |
||||
|
||||
public static void incMasterConsumeCommand(int commandCount) { |
||||
MASTER_CONSUME_COMMAND_COUNTER.increment(commandCount); |
||||
} |
||||
|
||||
} |
@ -0,0 +1,101 @@
|
||||
/* |
||||
* Licensed to the Apache Software Foundation (ASF) under one or more |
||||
* contributor license agreements. See the NOTICE file distributed with |
||||
* this work for additional information regarding copyright ownership. |
||||
* The ASF licenses this file to You under the Apache License, Version 2.0 |
||||
* (the "License"); you may not use this file except in compliance with |
||||
* the License. You may obtain a copy of the License at |
||||
* |
||||
* http://www.apache.org/licenses/LICENSE-2.0
|
||||
* |
||||
* Unless required by applicable law or agreed to in writing, software |
||||
* distributed under the License is distributed on an "AS IS" BASIS, |
||||
* WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied. |
||||
* See the License for the specific language governing permissions and |
||||
* limitations under the License. |
||||
*/ |
||||
|
||||
package org.apache.dolphinscheduler.server.master.metrics; |
||||
|
||||
import java.util.function.Supplier; |
||||
|
||||
import io.micrometer.core.instrument.Counter; |
||||
import io.micrometer.core.instrument.Gauge; |
||||
import io.micrometer.core.instrument.Metrics; |
||||
|
||||
public final class ProcessInstanceMetrics { |
||||
|
||||
private ProcessInstanceMetrics() { |
||||
throw new UnsupportedOperationException("Utility class"); |
||||
} |
||||
|
||||
private static final Counter PROCESS_INSTANCE_SUBMIT_COUNTER = |
||||
Counter.builder("dolphinscheduler_process_instance_submit_count") |
||||
.description("Process instance submit total count") |
||||
.register(Metrics.globalRegistry); |
||||
|
||||
private static final Counter PROCESS_INSTANCE_TIMEOUT_COUNTER = |
||||
Counter.builder("dolphinscheduler_process_instance_timeout_count") |
||||
.description("Process instance timeout total count") |
||||
.register(Metrics.globalRegistry); |
||||
|
||||
private static final Counter PROCESS_INSTANCE_FINISH_COUNTER = |
||||
Counter.builder("dolphinscheduler_process_instance_finish_count") |
||||
.description("Process instance finish total count") |
||||
.register(Metrics.globalRegistry); |
||||
|
||||
private static final Counter PROCESS_INSTANCE_SUCCESS_COUNTER = |
||||
Counter.builder("dolphinscheduler_process_instance_success_count") |
||||
.description("Process instance success total count") |
||||
.register(Metrics.globalRegistry); |
||||
|
||||
private static final Counter PROCESS_INSTANCE_FAILURE_COUNTER = |
||||
Counter.builder("dolphinscheduler_process_instance_failure_count") |
||||
.description("Process instance failure total count") |
||||
.register(Metrics.globalRegistry); |
||||
|
||||
private static final Counter PROCESS_INSTANCE_STOP_COUNTER = |
||||
Counter.builder("dolphinscheduler_process_instance_stop_count") |
||||
.description("Process instance stop total count") |
||||
.register(Metrics.globalRegistry); |
||||
|
||||
private static final Counter PROCESS_INSTANCE_FAILOVER_COUNTER = |
||||
Counter.builder("dolphinscheduler_process_instance_failover_count") |
||||
.description("Process instance failover total count") |
||||
.register(Metrics.globalRegistry); |
||||
|
||||
public static synchronized void registerProcessInstanceRunningGauge(Supplier<Number> function) { |
||||
Gauge.builder("dolphinscheduler_process_instance_running_gauge", function) |
||||
.description("The current running process instance count") |
||||
.register(Metrics.globalRegistry); |
||||
} |
||||
|
||||
public static void incProcessInstanceSubmit() { |
||||
PROCESS_INSTANCE_SUBMIT_COUNTER.increment(); |
||||
} |
||||
|
||||
public static void incProcessInstanceTimeout() { |
||||
PROCESS_INSTANCE_TIMEOUT_COUNTER.increment(); |
||||
} |
||||
|
||||
public static void incProcessInstanceFinish() { |
||||
PROCESS_INSTANCE_FINISH_COUNTER.increment(); |
||||
} |
||||
|
||||
public static void incProcessInstanceSuccess() { |
||||
PROCESS_INSTANCE_SUCCESS_COUNTER.increment(); |
||||
} |
||||
|
||||
public static void incProcessInstanceFailure() { |
||||
PROCESS_INSTANCE_FAILURE_COUNTER.increment(); |
||||
} |
||||
|
||||
public static void incProcessInstanceStop() { |
||||
PROCESS_INSTANCE_STOP_COUNTER.increment(); |
||||
} |
||||
|
||||
public static void incProcessInstanceFailover() { |
||||
PROCESS_INSTANCE_FAILOVER_COUNTER.increment(); |
||||
} |
||||
|
||||
} |
@ -0,0 +1,137 @@
|
||||
/* |
||||
* Licensed to the Apache Software Foundation (ASF) under one or more |
||||
* contributor license agreements. See the NOTICE file distributed with |
||||
* this work for additional information regarding copyright ownership. |
||||
* The ASF licenses this file to You under the Apache License, Version 2.0 |
||||
* (the "License"); you may not use this file except in compliance with |
||||
* the License. You may obtain a copy of the License at |
||||
* |
||||
* http://www.apache.org/licenses/LICENSE-2.0
|
||||
* |
||||
* Unless required by applicable law or agreed to in writing, software |
||||
* distributed under the License is distributed on an "AS IS" BASIS, |
||||
* WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied. |
||||
* See the License for the specific language governing permissions and |
||||
* limitations under the License. |
||||
*/ |
||||
|
||||
package org.apache.dolphinscheduler.server.master.metrics; |
||||
|
||||
import java.util.function.Supplier; |
||||
|
||||
import io.micrometer.core.instrument.Counter; |
||||
import io.micrometer.core.instrument.Gauge; |
||||
import io.micrometer.core.instrument.Metrics; |
||||
|
||||
|
||||
public final class TaskMetrics { |
||||
private TaskMetrics() { |
||||
throw new UnsupportedOperationException("Utility class"); |
||||
} |
||||
|
||||
private static final Counter TASK_SUBMIT_COUNTER = |
||||
Counter.builder("dolphinscheduler_task_submit_count") |
||||
.description("Task submit total count") |
||||
.register(Metrics.globalRegistry); |
||||
|
||||
private static final Counter TASK_FINISH_COUNTER = |
||||
Counter.builder("dolphinscheduler_task_finish_count") |
||||
.description("Task finish total count") |
||||
.register(Metrics.globalRegistry); |
||||
|
||||
private static final Counter TASK_SUCCESS_COUNTER = |
||||
Counter.builder("dolphinscheduler_task_success_count") |
||||
.description("Task success total count") |
||||
.register(Metrics.globalRegistry); |
||||
|
||||
private static final Counter TASK_FAILURE_COUNTER = |
||||
Counter.builder("dolphinscheduler_task_failure_count") |
||||
.description("Task failure total count") |
||||
.register(Metrics.globalRegistry); |
||||
|
||||
private static final Counter TASK_TIMEOUT_COUNTER = |
||||
Counter.builder("dolphinscheduler_task_timeout_count") |
||||
.description("Task timeout total count") |
||||
.register(Metrics.globalRegistry); |
||||
|
||||
private static final Counter TASK_RETRY_COUNTER = |
||||
Counter.builder("dolphinscheduler_task_retry_count") |
||||
.description("Task retry total count") |
||||
.register(Metrics.globalRegistry); |
||||
|
||||
private static final Counter TASK_STOP_COUNTER = |
||||
Counter.builder("dolphinscheduler_task_stop_count") |
||||
.description("Task stop total count") |
||||
.register(Metrics.globalRegistry); |
||||
|
||||
private static final Counter TASK_FAILOVER_COUNTER = |
||||
Counter.builder("dolphinscheduler_task_failover_count") |
||||
.description("Task failover total count") |
||||
.register(Metrics.globalRegistry); |
||||
|
||||
private static final Counter TASK_DISPATCH_COUNTER = |
||||
Counter.builder("dolphinscheduler_task_dispatch_count") |
||||
.description("Task dispatch count") |
||||
.register(Metrics.globalRegistry); |
||||
|
||||
private static final Counter TASK_DISPATCHER_FAILED = |
||||
Counter.builder("dolphinscheduler_task_dispatch_failed_count") |
||||
.description("Task dispatch failed count") |
||||
.register(Metrics.globalRegistry); |
||||
|
||||
private static final Counter TASK_DISPATCH_ERROR = |
||||
Counter.builder("dolphinscheduler_task_dispatch_error_count") |
||||
.description("Task dispatch error") |
||||
.register(Metrics.globalRegistry); |
||||
|
||||
public static void incTaskSubmit() { |
||||
TASK_SUBMIT_COUNTER.increment(); |
||||
} |
||||
|
||||
public synchronized static void registerTaskRunning(Supplier<Number> consumer) { |
||||
Gauge.builder("dolphinscheduler_task_running_gauge", consumer) |
||||
.description("Task running count") |
||||
.register(Metrics.globalRegistry); |
||||
} |
||||
|
||||
public static void incTaskFinish() { |
||||
TASK_FINISH_COUNTER.increment(); |
||||
} |
||||
|
||||
public static void incTaskSuccess() { |
||||
TASK_SUCCESS_COUNTER.increment(); |
||||
} |
||||
|
||||
public static void incTaskFailure() { |
||||
TASK_FAILURE_COUNTER.increment(); |
||||
} |
||||
|
||||
public static void incTaskTimeout() { |
||||
TASK_TIMEOUT_COUNTER.increment(); |
||||
} |
||||
|
||||
public static void incTaskRetry() { |
||||
TASK_RETRY_COUNTER.increment(); |
||||
} |
||||
|
||||
public static void incTaskStop() { |
||||
TASK_STOP_COUNTER.increment(); |
||||
} |
||||
|
||||
public static void incTaskFailover() { |
||||
TASK_FAILOVER_COUNTER.increment(); |
||||
} |
||||
|
||||
public static void incTaskDispatchFailed(int failedCount) { |
||||
TASK_DISPATCHER_FAILED.increment(failedCount); |
||||
} |
||||
|
||||
public static void incTaskDispatchError() { |
||||
TASK_DISPATCH_ERROR.increment(); |
||||
} |
||||
|
||||
public static void incTaskDispatch() { |
||||
TASK_DISPATCH_COUNTER.increment(); |
||||
} |
||||
|
||||
} |
@ -1,60 +0,0 @@
|
||||
/* |
||||
* Licensed to the Apache Software Foundation (ASF) under one or more |
||||
* contributor license agreements. See the NOTICE file distributed with |
||||
* this work for additional information regarding copyright ownership. |
||||
* The ASF licenses this file to You under the Apache License, Version 2.0 |
||||
* (the "License"); you may not use this file except in compliance with |
||||
* the License. You may obtain a copy of the License at |
||||
* |
||||
* http://www.apache.org/licenses/LICENSE-2.0
|
||||
* |
||||
* Unless required by applicable law or agreed to in writing, software |
||||
* distributed under the License is distributed on an "AS IS" BASIS, |
||||
* WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied. |
||||
* See the License for the specific language governing permissions and |
||||
* limitations under the License. |
||||
*/ |
||||
|
||||
package org.apache.dolphinscheduler.server.master.registry; |
||||
|
||||
import org.apache.dolphinscheduler.dao.AlertDao; |
||||
import org.apache.dolphinscheduler.dao.mapper.WorkerGroupMapper; |
||||
import org.apache.dolphinscheduler.service.registry.RegistryClient; |
||||
|
||||
import org.junit.Before; |
||||
import org.junit.Test; |
||||
import org.junit.runner.RunWith; |
||||
import org.mockito.Mock; |
||||
import org.powermock.api.mockito.PowerMockito; |
||||
import org.powermock.core.classloader.annotations.PowerMockIgnore; |
||||
import org.powermock.core.classloader.annotations.PrepareForTest; |
||||
import org.powermock.modules.junit4.PowerMockRunner; |
||||
|
||||
/** |
||||
* server node manager test |
||||
*/ |
||||
@RunWith(PowerMockRunner.class) |
||||
@PrepareForTest({ RegistryClient.class }) |
||||
@PowerMockIgnore({"javax.management.*"}) |
||||
public class ServerNodeManagerTest { |
||||
|
||||
private ServerNodeManager serverNodeManager; |
||||
|
||||
@Mock |
||||
private WorkerGroupMapper workerGroupMapper; |
||||
|
||||
@Mock |
||||
private AlertDao alertDao; |
||||
|
||||
@Before |
||||
public void before() { |
||||
PowerMockito.suppress(PowerMockito.constructor(RegistryClient.class)); |
||||
serverNodeManager = PowerMockito.mock(ServerNodeManager.class); |
||||
} |
||||
|
||||
@Test |
||||
public void test(){ |
||||
//serverNodeManager.getWorkerGroupNodes()
|
||||
} |
||||
|
||||
} |
File diff suppressed because it is too large
Load Diff
File diff suppressed because it is too large
Load Diff
@ -0,0 +1,58 @@
|
||||
/* |
||||
* Licensed to the Apache Software Foundation (ASF) under one or more |
||||
* contributor license agreements. See the NOTICE file distributed with |
||||
* this work for additional information regarding copyright ownership. |
||||
* The ASF licenses this file to You under the Apache License, Version 2.0 |
||||
* (the "License"); you may not use this file except in compliance with |
||||
* the License. You may obtain a copy of the License at |
||||
* |
||||
* http://www.apache.org/licenses/LICENSE-2.0
|
||||
* |
||||
* Unless required by applicable law or agreed to in writing, software |
||||
* distributed under the License is distributed on an "AS IS" BASIS, |
||||
* WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied. |
||||
* See the License for the specific language governing permissions and |
||||
* limitations under the License. |
||||
*/ |
||||
|
||||
package org.apache.dolphinscheduler.server.worker.metrics; |
||||
|
||||
import org.apache.dolphinscheduler.plugin.task.api.TaskChannelFactory; |
||||
|
||||
import java.util.HashMap; |
||||
import java.util.Map; |
||||
import java.util.ServiceLoader; |
||||
|
||||
import io.micrometer.core.instrument.Counter; |
||||
import io.micrometer.core.instrument.Metrics; |
||||
|
||||
public final class TaskMetrics { |
||||
|
||||
private TaskMetrics() { |
||||
throw new UnsupportedOperationException("Utility class"); |
||||
} |
||||
|
||||
private static Map<String, Counter> TASK_TYPE_EXECUTE_COUNTER = new HashMap<>(); |
||||
private static final Counter UNKNOWN_TASK_EXECUTE_COUNTER = |
||||
Counter.builder("dolphinscheduler_task_execute_count") |
||||
.tag("task_type", "unknown") |
||||
.description("task execute counter") |
||||
.register(Metrics.globalRegistry); |
||||
|
||||
static { |
||||
for (TaskChannelFactory taskChannelFactory : ServiceLoader.load(TaskChannelFactory.class)) { |
||||
TASK_TYPE_EXECUTE_COUNTER.put( |
||||
taskChannelFactory.getName(), |
||||
Counter.builder("dolphinscheduler_task_execute_count") |
||||
.tag("task_type", taskChannelFactory.getName()) |
||||
.description("task execute counter") |
||||
.register(Metrics.globalRegistry) |
||||
); |
||||
} |
||||
} |
||||
|
||||
public static void incrTaskTypeExecuteCount(String taskType) { |
||||
TASK_TYPE_EXECUTE_COUNTER.getOrDefault(taskType, UNKNOWN_TASK_EXECUTE_COUNTER).increment(); |
||||
} |
||||
|
||||
} |
@ -0,0 +1,56 @@
|
||||
/* |
||||
* Licensed to the Apache Software Foundation (ASF) under one or more |
||||
* contributor license agreements. See the NOTICE file distributed with |
||||
* this work for additional information regarding copyright ownership. |
||||
* The ASF licenses this file to You under the Apache License, Version 2.0 |
||||
* (the "License"); you may not use this file except in compliance with |
||||
* the License. You may obtain a copy of the License at |
||||
* |
||||
* http://www.apache.org/licenses/LICENSE-2.0
|
||||
* |
||||
* Unless required by applicable law or agreed to in writing, software |
||||
* distributed under the License is distributed on an "AS IS" BASIS, |
||||
* WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied. |
||||
* See the License for the specific language governing permissions and |
||||
* limitations under the License. |
||||
*/ |
||||
|
||||
package org.apache.dolphinscheduler.server.worker.metrics; |
||||
|
||||
import java.util.function.Supplier; |
||||
|
||||
import io.micrometer.core.instrument.Counter; |
||||
import io.micrometer.core.instrument.Gauge; |
||||
import io.micrometer.core.instrument.Metrics; |
||||
|
||||
public final class WorkerServerMetrics { |
||||
|
||||
public WorkerServerMetrics() { |
||||
throw new UnsupportedOperationException("Utility class"); |
||||
} |
||||
|
||||
private static final Counter WORKER_OVERLOAD_COUNTER = |
||||
Counter.builder("dolphinscheduler_worker_overload_count") |
||||
.description("worker load count") |
||||
.register(Metrics.globalRegistry); |
||||
|
||||
private static final Counter WORKER_SUBMIT_QUEUE_IS_FULL_COUNTER = |
||||
Counter.builder("dolphinscheduler_worker_submit_queue_is_full_count") |
||||
.description("worker task submit queue is full count") |
||||
.register(Metrics.globalRegistry); |
||||
|
||||
public static void incWorkerOverloadCount() { |
||||
WORKER_OVERLOAD_COUNTER.increment(); |
||||
} |
||||
|
||||
public static void incWorkerSubmitQueueIsFullCount() { |
||||
WORKER_SUBMIT_QUEUE_IS_FULL_COUNTER.increment(); |
||||
} |
||||
|
||||
public static void registerWorkerRunningTaskGauge(Supplier<Number> supplier) { |
||||
Gauge.builder("dolphinscheduler_worker_running_task_gauge", supplier) |
||||
.description("worker running task gauge") |
||||
.register(Metrics.globalRegistry); |
||||
|
||||
} |
||||
} |
Loading…
Reference in new issue