hokie-chan
d27d27f7d7
[fix][worker][bug] master/worker crash when registry recover from SUSPENDED to RECONNECTED ( #13328 )
2 years ago
Wenjun Ruan
9564bdeefb
cherry-pick Fix task group cannot release when kill task #13314
2 years ago
JieguangZhou
83e2c14be1
fix retry task failure ( #13077 )
2 years ago
Kerwin
d4c6ec4fa1
[Bug-12963] [Master] Fix dependent task node null pointer exception ( #12965 )
...
* Fix that there are both manual and scheduled workflow instances in dependent nodes, and one of them will report a null pointer exception during execution.
2 years ago
Kerwin
6ffac27dbc
fix NPE while retry task ( #12903 )
2 years ago
Kerwin
9126e1d1fc
[3.1.1-prepare][Test] test ut ( #12729 )
2 years ago
Wenjun Ruan
9957df1c41
Fix the waiting strategy cannot recovery if the serverstate is already in running ( #12651 )
2 years ago
Wenjun Ruan
0c3ab4335e
cherry-pick Add worker-group-refresh-interval in master config #12601
2 years ago
Kerwin
94ef02e9de
cherry-pick [Improvement] Merge spi.utils into common.utils ( #12607 )
...
#12502
Co-authored-by: caishunfeng <caishunfeng2021@gmail.com>
2 years ago
kezhenxu94
eb7b48596c
cherry-pick Reorganize some classes in common module, remove duplicate classes
...
#12321
2 years ago
sgw
d742994e62
[DS-12131][master] Optimize the log printing of the master module acc… ( #12152 )
...
* [DS-12131][master] Optimize the log printing of the master module according to the log specification.
2 years ago
Wenjun Ruan
a17d0cc5d2
cherry-pick [Improvement][WorkerGroup]Remove workerGroup in registry #12217
2 years ago
Kerwin
58ef3cccd7
[3.1.1-prepare]cherry-pick [Improvement] remove log-server and server module #12206 ( #12562 )
2 years ago
Kerwin
f3277277f0
cherry-pick Refactor LogServiceClient Singleton to avoid repeat creation of NettyClient #11777 ( #12542 )
...
Co-authored-by: Wenjun Ruan <wenjun@apache.org>
2 years ago
Kerwin
344293102d
[3.1.1-prepare][cherry-pick][Improvement] Add remote task model #11767 ( #12541 )
2 years ago
Yann Ann
35608becac
make sure all failed task will save in errorTaskMap ( #12424 )
2 years ago
Kerwin
2402b8a6ef
Use temurin Java docker images instead of deprecated ones ( #12334 ) ( #12421 )
...
Co-authored-by: kezhenxu94 <kezhenxu94@apache.org>
2 years ago
Yann Ann
3e899bee06
[fix#12000]Cannot remove the WorkerGroup from the master service ( #12050 )
...
* [Bug] [Master] Cannot remove the WorkerGroup from the master service. #12000
* remove unnecessary locks
* Update dolphinscheduler-master/src/main/java/org/apache/dolphinscheduler/server/master/registry/ServerNodeManager.java
Co-authored-by: caishunfeng <caishunfeng2021@gmail.com>
Co-authored-by: caishunfeng <caishunfeng2021@gmail.com>
2 years ago
caishunfeng
780a509f67
fix workflow keep running when task fail ( #11930 )
2 years ago
Daniel Y
2a437607ae
[Improvement-11834] Upgrade docker base image to support python3.9 ( #11835 )
...
* use openjdk:8-jre-slim-bullseye instead of buster to support python3.9
* use openjdk:8-jre-slim-bullseye instead of buster across all components
2 years ago
LiuBodong
0f3b42925f
[Fix][Monitor]Monitor UI not show DisakAvailable and MemoryUsage correctly ( #11870 )
2 years ago
Kengo Seki
216ceea641
[Improve] Upgrade Hadoop to 3.2.4 ( #11832 )
2 years ago
kezhenxu94
fcc75ef1c6
Bump up dependencies to fix cves ( #11765 )
2 years ago
Kengo Seki
52b79b017e
[Improvement] Replace commons-lang 2 function invocations with commons-lang3 ( #11810 )
...
* [Improvement] Replace commons-lang 2 function invocations with commons-lang3
2 years ago
caishunfeng
27b69e608a
[Bug] [Master] WorkflowExecuteRunnable will face a infinite loop #11838 ( #11864 ) ( #11950 )
...
* [Bug] [Master] WorkflowExecuteRunnable will face a infinite loop #11838
Co-authored-by: Yann Ann <2993643785@qq.com>
2 years ago
caishunfeng
f8d46a26c1
[Feature-11530] add state history for process instance ( #11757 )
...
* [Feature] add state history for process instance (#97 )
* add state history for process instance
* upsertProcessInstance
* remove unuse method
* fix UT
Co-authored-by: caishunfeng <534328519@qq.com>
2 years ago
kezhenxu94
277f137358
Add Kubernetes configmap reload to all components ( #11730 )
2 years ago
caishunfeng
f95cdd350a
reduce task dependencies ( #11731 )
2 years ago
Wenjun Ruan
67e7f88d8b
Refactor heart beat task, use json to serialize/deserialize ( #11702 )
...
* Refactor heart beat task, use json to serialize/deserialize
2 years ago
Wenjun Ruan
b96d69701a
[Improvement-#11613] Add spi priority factory ( #11614 )
...
* Add spi priority factory
* Add doc
* Add override log
* Use lombok
* Add comment
2 years ago
JinYong Li
3f2ca7bca3
[Fix-9980] [Server] fix heartBeatTaskCount bug ( #11232 )
...
* fix heartBeat bug
* modify class name
* fix conflict
Co-authored-by: JinyLeeChina <jiny.li@foxmail.com>
2 years ago
insist777
abfef1a929
[improvement] Add two parameters in workergroup, and support the application of description display and other parameters ( #11542 )
...
* [Improvement] Improvement default worker group
* [Improvement] Improvement default worker group
* [Improvement] Improvement default worker group
* [Improvement] Improvement default worker group
* [Improvement] Improvement default worker group
* [improvement] The default worker grouping supports editing
* [improvement] The default worker grouping supports editing
* Update WorkerGroupServiceImpl.java
* Update WorkerGroupServiceImpl.java
* The default worker grouping supports editing
* update sql file
* update
* update
* update
* update
2 years ago
Wenjun Ruan
4d13a5104b
Fix kill task failed will cause the taskGroup cannot release and add taskGroup log ( #11469 )
2 years ago
Wenjun Ruan
a41c6824fc
Add dependent task instance log ( #11541 )
...
* Add dependent task instance log
* Optimize log
* Fix dependent task initialize failed will throw exception
2 years ago
Wenjun Ruan
8886d5a126
Fix quartz threadPriority config name error ( #11596 )
...
* Fix quartz threadPriority config name error
* Add batchTriggerAcquisitionMaxCount config
2 years ago
Wenjun Ruan
3516533017
Remove logger header in task log file ( #11555 )
2 years ago
kezhenxu94
f6ef628030
Support reading application properties from kubernetes config map ( #11402 )
2 years ago
Wenjun Ruan
4362fb8448
If the task processor is not found need to throw error rather than exception ( #11461 )
2 years ago
Wenjun Ruan
05589606a2
Set master's task running status in `runTask` to avoid the task group acquire failed, but the task status is in running ( #11451 )
2 years ago
Wenjun Ruan
7ff34c3947
[Feature-7024] Add waiting strategy to support master/worker can recover from registry lost ( #11368 )
...
* Add waiting strategy to support master/worker can recover from registry lost
* throw exception when zookeeper registry start failed due to interrupted
2 years ago
Wenjun Ruan
0ca3086296
Fix workflow instance may failover many times due to doesn't check the restart time ( #11445 )
2 years ago
Wenjun Ruan
a2b5219315
Fix find last process instance in dependent task is not correct ( #11424 )
2 years ago
Wenjun Ruan
47b50067ef
Clear thread name in BaseTaskProcessor ( #11422 )
2 years ago
Wenjun Ruan
496c2d4bfa
[Fix-11413] Cannot set task status to kill if the task is not in running( #11414 )
2 years ago
Wenjun Ruan
5fa3e7b1ed
Add pause to task instance status ( #11390 )
2 years ago
caishunfeng
0464123c2b
[Feature-11223] support stream task ( #11350 )
...
* add task execute type
* update task definition list paging
* update task instance list paging
* stream task start
* [Feature][UI] Some changes to execute task.
* Set the connection edge to dashed line.
* Add FLINK_STREAM task.
* add stream task
* flink savepoint and cancel
* fix query bug
* add stream task definition
* add task instance for stream task
* delete stream task definition state
* update api for stream task definition edit
* modify search for stream task instance
* add language
* delete task type search for stream task definition
* change task type search for stream task instance
* add jump button
* add savepoint
* add down log for stream task instance
* ui test
* stream task start
* run DAG
* [Fix][UI] Fix the stream task edgs not to be dashed when filling back.
* [Feature][UI] Remove some fields for FLINK_STREAM.
* add start modal
* add dryRun column for stream task instance
* fix duration
* fix pon
* fix build error
* Add success tip
* add auto sync for stream task instance
* remove forgien key for task instance
* license header
* UT fix
* modify locales
* recover common config
* fix UT
* add doc
Co-authored-by: Amy <amywang0104@163.com>
Co-authored-by: devosend <devosend@gmail.com>
2 years ago
Wenjun Ruan
8774415197
Split ExecutionStatus to WorkflowExecutionStatus and TaskExecutionStatus ( #11340 )
2 years ago
Wenjun Ruan
acd3d3fab1
Fix TaskGroup cannot work and will cause master dead loop ( #11254 )
...
* Fix TaskGroup cannot work and will cause master dead loop
* Remove acquireTaskGroupAgain in ProcessServiceImpl
2 years ago
Eric Gao
9ca1eb96c4
[Improvement][Metrics] Add metrics for alert server ( #11240 )
...
* [Improvement][Metrics] Add metrics for alert server (#11131 )
* Update related docs of metrics
* Add grafana demo dashboards for alert server metrics
* Refactor metric classes with UtilityClass annotation
* Refactor meter names in camelCase for checkstyle
2 years ago
Wenjun Ruan
04f3aa9713
Fix recovery from failed task will dead loop ( #11239 )
2 years ago