Wenjun Ruan
44ddb6908e
Fix kill yarn job error when failover caused by doesn't set ProcessDefinition ( #10948 )
...
(cherry picked from commit b245e7c973
)
2 years ago
Wenjun Ruan
812d7a8f26
Fix taskInstance's host is not worker nettyServer address ( #10926 )
...
* Fix taskInstance's host is not worker nettyServer address
* Remove unnecessary mock
(cherry picked from commit df0416c193
)
2 years ago
Wenjun Ruan
527ee472fb
Catch exception when check state in StateWheelExecuteThread ( #10908 )
...
* Catch exception when check state
(cherry picked from commit 2a67866718
)
2 years ago
Wenjun Ruan
9ee20cffef
[Fix-10827] Fix network error cause worker cannot send message to master ( #10886 )
...
* Fix network error cause worker cannot send message to master
(cherry picked from commit cade66a9b6
)
2 years ago
Wenjun Ruan
b259deb196
[Fix-10854] Fix database restart may lost task instance status ( #10866 )
...
* Fix database update error doesn't rollback the task instance status
* Fix database error may cause workflow dead with running status
(cherry picked from commit f639a2eed4
)
2 years ago
Wenjun Ruan
71edaf41a2
[Fix-10842] Fix master/worker failover will cause status incorrect ( #10839 )
...
* Fix master failover will not update task instance status
* Add some failover log
* Fix worker failover will rerun task more than once
* Fix workflowInstance failover may rerun already success taskInstance
(cherry picked from commit 3f69ec8f28
)
2 years ago
Wenjun Ruan
4fc9bce444
[Fix-10785] Fix state event handle error will not retry ( #10786 )
...
* Fix state event handle error will not retry
* Use state event handler to deal with the event
(cherry picked from commit 67d14fb7b3
)
2 years ago
Wenjun Ruan
04c47034d4
Add task prepare metrics
2 years ago
Wenjun Ruan
7500e99682
[Fix-10785] Fix state event handle error will not retry ( #10786 )
...
* Fix state event handle error will not retry
* Use state event handler to deal with the event
(cherry picked from commit 67d14fb7b3
)
2 years ago
Wenjun Ruan
3b923e5933
[Fix-10666] Workflow submit failed will still in memory and never retry ( #10667 )
...
* Workflow submit failed will still in memory and never retry
(cherry picked from commit 35a10d092f
)
2 years ago
Wenjun Ruan
6c83967ebe
[Improvement-10617] Add comment in slot check ( #10618 )
...
(cherry picked from commit 247ca4ae8a
)
2 years ago
Wenjun Ruan
4b224ae2e5
Validate master/worker config ( #10649 )
...
(cherry picked from commit 35b25da863
)
2 years ago
Wenjun Ruan
db31deb54f
[Bug] [Master] Worker failover will cause task cannot be failover ( #10631 )
...
* fix worker failover may lose event
(cherry picked from commit 66624c5c86
)
2 years ago
Wenjun Ruan
fc1c1f6ad1
add CMDPARAM_COMPLEMENT_DATA_SCHEDULE_DATE_LIST
2 years ago
Wenjun Ruan
3ab9ee13fc
Optimize master log, use MDC to inject workflow instance id and task instance id in log ( #10516 )
...
* Optimize master log, add workflow instance id and task instance id in log
* Use MDC to set the workflow info in log4j
* Add workflowInstanceId and taskInstanceId in MDC
(cherry picked from commit db595b3eff
)
2 years ago
Wenjun Ruan
9a28d32057
Remove the schedule thread in LowerWeightHostManager ( #10310 )
...
(cherry picked from commit b100f6c489
)
2 years ago
Wenjun Ruan
90c87f0121
[Fix-10413] Fix Master startup failure the server still hang ( #10500 )
...
* Fix Master startup failure the server still hang
(cherry picked from commit 117f78ec4b
)
2 years ago
Wenjun Ruan
9a59054655
Fix PeerTaskInstancePriorityQueue cannot contains method use taskInstanceId to check ( #10491 )
...
(cherry picked from commit 0bdfa0cff9
)
2 years ago
Wenjun Ruan
9a4c7f876a
Fix TaskProcessorFactory#getTaskProcessor get common processor is not thread safe ( #10479 )
...
* Fix TaskProcessorFactory#getTaskProcessor get common processor is not thread safe
(cherry picked from commit ad2646ff1f
)
2 years ago
Wenjun Ruan
52815975bc
Add some warning log in master ( #10383 )
...
* Add some warn log in master
* fix may skip sleep
(cherry picked from commit b0d9d3f9ab
)
2 years ago
Wenjun Ruan
318a8e3ae0
[Feature][metrics] Add master, worker metrics ( #10326 )
...
* Add mater metrics
* fix UT
* Add url to mysql profile
* Add worker metrics
* Update grafana config
* Add system metrics doc
* Add process failover counter
* Add metrics image
* Change jpg to png
* Add command insert metrics
* Fix UT
* Revert UT
(cherry picked from commit e21d7b1551
)
2 years ago
Wenjun Ruan
81cadd15d2
Optimize MasterServer, add MasterRPCService ( #10371 )
...
* Optimize MasterServer, avoid NPE
(cherry picked from commit 3ecbee3885
)
2 years ago
Wenjun Ruan
4ceb420873
Fix TaskProcessorFactory#getTaskProcessor get common processor is not thread safe ( #10479 )
...
* Fix TaskProcessorFactory#getTaskProcessor get common processor is not thread safe
(cherry picked from commit ad2646ff1f
)
2 years ago
WangJPLeo
7b7ec0f20f
Complement numbers will run in a loop under the serial strategy fixed. ( #10862 )
...
* Complement numbers will run in a loop under the serial strategy fixed.
* e2e rerun
2 years ago
devosend
7ddaa2f47d
[maven-release-plugin] prepare for next development iteration
2 years ago
devosend
0a1b9bdd52
[maven-release-plugin] prepare release 3.0.0-beta-2
2 years ago
devosend
d68dcda2bb
[chore] pre-release change pom.xml
2 years ago
Jiajie Zhong
57ade38939
[maven-release-plugin] prepare release 3.0.0-beta-1
3 years ago
旺阳
e08b08efdd
[improve] Change Mysql Driver ( #10220 )
...
(cherry picked from commit aba5f8a40e
)
3 years ago
BaoLiang
b016037a6f
[BUG][TaskGroup] Task group does not take effect ( #10093 )
...
* fix 10092: Task group does not take effect
* fix 10092: Task group does not take effect
* fix 10092: Task group does not take effect
(cherry picked from commit ee2b855ced
)
3 years ago
xiangzihao
98576cb509
[Fix-10049] Conditions Task branch flow failed ( #10077 )
...
(cherry picked from commit 225cb332d1
)
3 years ago
caishunfeng
aa51c66d91
[Bug][Master] fix master task failover ( #10065 )
...
* fix master task failover
* ui
(cherry picked from commit 0cc0ee77fa
)
3 years ago
WangJPLeo
7b0e6fe5ec
[Fix-9975] The selected task instance was recreated when the Master service fail… ( #9976 )
...
* The selected task instance was recreated when the Master service failed over.
* Returns the expression result directly.
* Use Recovery to determine whether to use the old task instance.
(cherry picked from commit dbdbfeaeee
)
3 years ago
Tq
104f67d306
[Bug] [MASTER-9811]fix cmd param to overwrite global param when executing complement ( #9952 )
...
* fix cmd param to overwrite global param when executing complement
* fix cmd param to overwrite global param when executing complement
(cherry picked from commit d4aeee16e5
)
3 years ago
Jiajie Zhong
a9fa6b33a4
[chore] Change release version to 3.0.0-beta-1 ( #9957 )
3 years ago
Paul Zhang
8562f6a878
[Feature][Log]Add timezone information in log output ( #9913 )
3 years ago
WangJPLeo
a1b6b033ad
[Fix-9906] After the serial wait execution strategy stops the running workflow instance, the instance will be woken up and executed if there is a wait instance. ( #9907 )
...
* After the serial wait execution strategy stops the running workflow instance, the instance will be woken up and executed if there is a wait instance.
* clear logic
* Resource overloading
3 years ago
WangJPLeo
fb0f96ed94
[Fix-9868] A task flow definition isolates the runs of different execution strategies by version numbers. ( #9869 )
...
* The thread cache task flow definition should get the latest version.
* Coverage on New Code
* Coverage on New Code
* Coverage on New Code
* use an existing method.
* Increase unit test coverage.
* Task flow definitions enforce policy isolation.
3 years ago
WangJPLeo
31cd1b5e61
Serial wait for subsequent execution ( #9847 )
3 years ago
WangJPLeo
3cea039239
Task queue status update. ( #9832 )
3 years ago
WangJPLeo
5c0be8a3d7
A task instance that normally queries the serial wait state. ( #9777 )
...
Co-authored-by: WangJPLeo <wangjipeng@whaleops.com>
3 years ago
WangJPLeo
897d7cb555
Add the host address of the execution server to the sub task task instance. ( #9758 )
...
Co-authored-by: WangJPLeo <wangjipeng@whaleops.com>
3 years ago
Jiajie Zhong
de50f43de6
[common] Make dolphinscheduler_env.sh work when start server ( #9726 )
...
* [common] Make dolphinscheduler_env.sh work
* Change dist tarball `dolphinscheduler_env.sh` location
from `bin/` to `conf/`, which users could finish their
change configuration operation in one single directory.
and we only need to add `$DOLPHINSCHEDULER_HOME/conf`
when we start our sever instead of adding both
`$DOLPHINSCHEDULER_HOME/conf` and `$DOLPHINSCHEDULER_HOME/bin`
* Change the `start.sh`'s path of `dolphinscheduler_env.sh`
* Change the setting order of `dolphinscheduler_env.sh`
* `bin/env/dolphinscheduler_env.sh` will overwrite the `<server>/conf/dolphinscheduler_env.sh`
when start the server using `bin/dolphinsceduler_daemon.sh` or `bin/install.sh`
* Change the related docs
3 years ago
WangJPLeo
7bcec7115a
[Fix-9717] The failure policy of the task flow takes effect ( #9718 )
...
* Failure policy takes effect.
* Coverage on New Code
* correct description logic
* Compatible with all scenarios
* clearer logic
Co-authored-by: WangJPLeo <wangjipeng@whaleops.com>
3 years ago
caishunfeng
5657cb9aec
[Bug-9719][Master] fix failover fail because task plugins has not been loaded ( #9720 )
3 years ago
gaojun2048
ebc4253d50
[fix][Service] BusinessTime should format with schedule timezone ( #9714 )
...
* BusinessTime should format with schedule timezone
* fix test error
* fix test error
* fix test error
3 years ago
caishunfeng
88d2803fe1
fix task dispatch error overload resource pool of task group ( #9667 )
3 years ago
caishunfeng
63638601b0
fix process pause and rerun ( #9568 )
3 years ago
sparklezzz
508ed9769a
[Fix][Master Server] handle warn+failed timeout strategy in workflow execute thread of master server ( #8077 ) ( #9485 )
...
Co-authored-by: xudong.zhang <xudong.zhang@nio.com>
3 years ago
Paul Zhang
3815a86a3b
[Improvement][Master] Fix typo for MasterTaskExecThreadTest ( #9513 )
3 years ago