Wenjun Ruan
4362fb8448
If the task processor is not found need to throw error rather than exception ( #11461 )
2 years ago
Wenjun Ruan
05589606a2
Set master's task running status in `runTask` to avoid the task group acquire failed, but the task status is in running ( #11451 )
2 years ago
Wenjun Ruan
7ff34c3947
[Feature-7024] Add waiting strategy to support master/worker can recover from registry lost ( #11368 )
...
* Add waiting strategy to support master/worker can recover from registry lost
* throw exception when zookeeper registry start failed due to interrupted
2 years ago
Wenjun Ruan
0ca3086296
Fix workflow instance may failover many times due to doesn't check the restart time ( #11445 )
2 years ago
Wenjun Ruan
a2b5219315
Fix find last process instance in dependent task is not correct ( #11424 )
2 years ago
Wenjun Ruan
47b50067ef
Clear thread name in BaseTaskProcessor ( #11422 )
2 years ago
Wenjun Ruan
496c2d4bfa
[Fix-11413] Cannot set task status to kill if the task is not in running( #11414 )
2 years ago
Wenjun Ruan
5fa3e7b1ed
Add pause to task instance status ( #11390 )
2 years ago
caishunfeng
0464123c2b
[Feature-11223] support stream task ( #11350 )
...
* add task execute type
* update task definition list paging
* update task instance list paging
* stream task start
* [Feature][UI] Some changes to execute task.
* Set the connection edge to dashed line.
* Add FLINK_STREAM task.
* add stream task
* flink savepoint and cancel
* fix query bug
* add stream task definition
* add task instance for stream task
* delete stream task definition state
* update api for stream task definition edit
* modify search for stream task instance
* add language
* delete task type search for stream task definition
* change task type search for stream task instance
* add jump button
* add savepoint
* add down log for stream task instance
* ui test
* stream task start
* run DAG
* [Fix][UI] Fix the stream task edgs not to be dashed when filling back.
* [Feature][UI] Remove some fields for FLINK_STREAM.
* add start modal
* add dryRun column for stream task instance
* fix duration
* fix pon
* fix build error
* Add success tip
* add auto sync for stream task instance
* remove forgien key for task instance
* license header
* UT fix
* modify locales
* recover common config
* fix UT
* add doc
Co-authored-by: Amy <amywang0104@163.com>
Co-authored-by: devosend <devosend@gmail.com>
2 years ago
Wenjun Ruan
8774415197
Split ExecutionStatus to WorkflowExecutionStatus and TaskExecutionStatus ( #11340 )
2 years ago
Wenjun Ruan
acd3d3fab1
Fix TaskGroup cannot work and will cause master dead loop ( #11254 )
...
* Fix TaskGroup cannot work and will cause master dead loop
* Remove acquireTaskGroupAgain in ProcessServiceImpl
2 years ago
Eric Gao
9ca1eb96c4
[Improvement][Metrics] Add metrics for alert server ( #11240 )
...
* [Improvement][Metrics] Add metrics for alert server (#11131 )
* Update related docs of metrics
* Add grafana demo dashboards for alert server metrics
* Refactor metric classes with UtilityClass annotation
* Refactor meter names in camelCase for checkstyle
2 years ago
Wenjun Ruan
04f3aa9713
Fix recovery from failed task will dead loop ( #11239 )
2 years ago
Wenjun Ruan
8e21c38c00
Write alert result into db ( #11221 )
2 years ago
caishunfeng
6c7e00c4aa
[Bug-11101] fix task failover NPE ( #11168 )
2 years ago
Eric Gao
0e15ce3389
[Improvement][Metrics] Switch to use tags to indicate task / workflow execution status for metrics ( #11128 )
...
* [Improvement][Metrics] Switch to use tags to indicate task / workflow execution status for metrics (#10867 )
* Update docs and grafana demo dashboards
2 years ago
zhuxt2015
3701a24d15
[Improvement][Task Log] Task status log print description instead of code ( #11009 )
...
* use execution status instead of status code
2 years ago
Wenjun Ruan
5e9c7dad23
Add dolphinscheduler-bom to manage the dependency version ( #11025 )
2 years ago
Wenjun Ruan
e1c78d1a49
Move utils, builder under master ( #11000 )
2 years ago
Wenjun Ruan
083ab2b5c9
Remove dao in worker ( #10994 )
2 years ago
Wenjun Ruan
2be1d4bf0a
Fix worker cannot shutdown due to resource close failed or heart beat check failed ( #10979 )
...
* Use try-with-resource to close resource, and add heart error threshold to avoid worker cannot close due to heart beat check failed
* Move heartbeat error threshold to applicaiton.yml
2 years ago
Wenjun Ruan
998e4d74dd
Kill task when we do master failover to avoid task lost ( #10997 )
2 years ago
Wenjun Ruan
b245e7c973
Fix kill yarn job error when failover caused by doesn't set ProcessDefinition ( #10948 )
2 years ago
Wenjun Ruan
df0416c193
Fix taskInstance's host is not worker nettyServer address ( #10926 )
...
* Fix taskInstance's host is not worker nettyServer address
* Remove unnecessary mock
2 years ago
WangJPLeo
9f34a837b8
[Fix-10909] Returns executionStatus for each process instance status and access token module support. ( #10922 )
...
* Returns executionStatus for each process instance status and access token module support
2 years ago
Wenjun Ruan
2a67866718
Catch exception when check state in StateWheelExecuteThread ( #10908 )
...
* Catch exception when check state
2 years ago
Wenjun Ruan
cade66a9b6
[Fix-10827] Fix network error cause worker cannot send message to master ( #10886 )
...
* Fix network error cause worker cannot send message to master
2 years ago
caishunfeng
aa8b88a875
[Feature-10871] add workflow executing data query ( #10875 )
...
* add workflow executing data query
* fix sonar check for interrupted
2 years ago
Wenjun Ruan
f639a2eed4
[Fix-10854] Fix database restart may lost task instance status ( #10866 )
...
* Fix database update error doesn't rollback the task instance status
* Fix database error may cause workflow dead with running status
2 years ago
WangJPLeo
b5653ea7f2
Complement numbers will run in a loop under the serial strategy fixed. ( #10862 )
...
* Complement numbers will run in a loop under the serial strategy fixed.
* e2e rerun
2 years ago
Wenjun Ruan
3f69ec8f28
[Fix-10842] Fix master/worker failover will cause status incorrect ( #10839 )
...
* Fix master failover will not update task instance status
* Add some failover log
* Fix worker failover will rerun task more than once
* Fix workflowInstance failover may rerun already success taskInstance
2 years ago
Wenjun Ruan
426567348e
Remove quartz in service ( #10748 )
...
* Remove quartz in service
2 years ago
Wenjun Ruan
67d14fb7b3
[Fix-10785] Fix state event handle error will not retry ( #10786 )
...
* Fix state event handle error will not retry
* Use state event handler to deal with the event
2 years ago
Wenjun Ruan
35a10d092f
[Fix-10666] Workflow submit failed will still in memory and never retry ( #10667 )
...
* Workflow submit failed will still in memory and never retry
2 years ago
WangJPLeo
8f621ff98b
[Optimization] Calculate global parameter and local parameter at master. ( #10704 )
...
* Global parameter and local parameter calculation external expansion.
* k8s task ut fix.
* TimePlaceholderUtils import DateUtils fix
* follow the review comments to fix.
* follow the review comments to fix.
* e2e rerun
2 years ago
Wenjun Ruan
247ca4ae8a
[Improvement-10617] Add comment in slot check ( #10618 )
2 years ago
Wenjun Ruan
35b25da863
Validate master/worker config ( #10649 )
2 years ago
Wenjun Ruan
66624c5c86
[Bug] [Master] Worker failover will cause task cannot be failover ( #10631 )
...
* fix worker failover may lose event
2 years ago
WangJPLeo
b5184138fa
[Feature] Time function analysis extension. ( #10624 )
...
* Time function analysis extension.
* param add.
* clear useless logs and update method notes
* permission omission fix.
* extending time functions to optimize static methods
* e2e rerun.
2 years ago
pinkhello
628c8ab2a1
[Improvement][Master] batch remove TaskInstaceId and workflowInstanceId ( #10620 )
2 years ago
xiangzihao
397e51fa63
dependent node log output optimization and fix checkstyle ( #10593 )
2 years ago
xiangzihao
1111371c9a
add datasource health check to the healthcheck endpoint ( #10588 )
2 years ago
xiangzihao
0f38217b12
fix_10514 ( #10568 )
2 years ago
Wenjun Ruan
db595b3eff
Optimize master log, use MDC to inject workflow instance id and task instance id in log ( #10516 )
...
* Optimize master log, add workflow instance id and task instance id in log
* Use MDC to set the workflow info in log4j
* Add workflowInstanceId and taskInstanceId in MDC
2 years ago
WangJPLeo
f46faa02c3
[Fix-10452] Serial wait for policy recovery ( #10453 )
...
* Serial wait for policy recovery
* processInstance state check null
* add sendresult(Host host, Command command) method and replace the original.
2 years ago
Eric Gao
cc06eaaf54
[Improvement][Metrics] Apply micrometer naming convention to metrics ( #10477 )
...
* Apply micrometer naming convention to worker metrics
* Apply micrometer naming convention all current metrics
* Fix remaining metrics names, update English docs and add Chinese docs
* Fix metrics names in grafana-demo dashboards
2 years ago
Wenjun Ruan
b100f6c489
Remove the schedule thread in LowerWeightHostManager ( #10310 )
2 years ago
Wenjun Ruan
117f78ec4b
[Fix-10413] Fix Master startup failure the server still hang ( #10500 )
...
* Fix Master startup failure the server still hang
2 years ago
Wenjun Ruan
0bdfa0cff9
Fix PeerTaskInstancePriorityQueue cannot contains method use taskInstanceId to check ( #10491 )
2 years ago
Wenjun Ruan
ad2646ff1f
Fix TaskProcessorFactory#getTaskProcessor get common processor is not thread safe ( #10479 )
...
* Fix TaskProcessorFactory#getTaskProcessor get common processor is not thread safe
2 years ago