https://github.com/kubeflow/katib

sort by:
Revision Author Date Message Commit Date
51f6aa2 User root user explicitely for DB readinessProbe (#962) 12 December 2019, 02:14:32 UTC
aefa758 Fix typo in getKabitJob function name (#965) 12 December 2019, 01:28:32 UTC
4057a58 Use port 8080 for Katib UI (#967) 12 December 2019, 00:30:32 UTC
67a4d84 Validate experiment (#957) * Validate experiment * Stop editing each field 11 December 2019, 04:56:04 UTC
c16821c UI: Support namespace selection in experiment monitor (#950) * Init changes * Support namespace selection in experiment monitor page * Remove console log * Run mock for katib client 10 December 2019, 04:27:29 UTC
983583e Delete v1alpha2 files (#953) 10 December 2019, 01:59:28 UTC
4a97e21 Resume experiment with extra trials from last checkpoint (#952) * Resuming experiment with extra trials * Resuming experiment with extra trials * Adding test script * relative path * Verify if experiment is running again * Adding case when maxtrials is not set 09 December 2019, 05:13:09 UTC
cb7d3e7 Add a gauge metric for current experiments (#954) * add a gauge metric for current experiments Signed-off-by: yeya24 <yb532204897@gmail.com> * fmt & fix test Signed-off-by: yeya24 <yb532204897@gmail.com> 09 December 2019, 01:25:10 UTC
5d46799 feat: Support running (#894) * feat: Support running Signed-off-by: Ce Gao <gaoce@caicloud.io> * fix: Do not mark trial running when the job is created Signed-off-by: Ce Gao <gaoce@caicloud.io> * fix: Add nil pointer check Signed-off-by: Ce Gao <gaoce@caicloud.io> * fix: Avoid nil pointer Signed-off-by: Ce Gao <gaoce@caicloud.io> * fix: Update Signed-off-by: Ce Gao <gaoce@caicloud.io> 06 December 2019, 06:30:49 UTC
83eba02 Use kubeflowkatib repo as image repo of example (#949) 05 December 2019, 05:24:32 UTC
330c3bd Update API spec for early stopping (#951) 05 December 2019, 04:42:32 UTC
8a443c4 rename counter metrics (#942) Signed-off-by: yeya24 <yb532204897@gmail.com> 04 December 2019, 16:12:57 UTC
d31c7b6 update deployment api version (#937) Signed-off-by: yeya24 <yb532204897@gmail.com> 04 December 2019, 15:28:57 UTC
6e19a06 Fix fetch trial template (#938) 04 December 2019, 11:38:58 UTC
0cb72d1 Implement metrics custom filters (#947) 04 December 2019, 09:08:58 UTC
68de7a6 Remove katib webhook when undeploy (#935) 29 November 2019, 05:03:03 UTC
b52cd1a Change web failPolicy to fail instead of default ingore (#933) 25 November 2019, 16:29:25 UTC
29b3bc0 feat: Add limit for suggestion pod (#932) * feat: Add limit for suggestion pod Signed-off-by: Ce Gao <gaoce@caicloud.io> * fix: Format Signed-off-by: Ce Gao <gaoce@caicloud.io> 25 November 2019, 15:41:09 UTC
40f55b4 Support multiple metric logs in one line (#925) * Support multiple metric logs in one line * Modify user image to cover change test 22 November 2019, 06:15:28 UTC
08234c3 Tfevent metriccollector fails when multiple files exist (#920) 15 November 2019, 01:31:37 UTC
6410b03 Handle metricscollector case worker container have no command (#914) * Handle metricscollector case worker container have no command * Change method name 08 November 2019, 01:38:19 UTC
eb6ff94 tfevent-metricscollector support ppc64le (#912) 06 November 2019, 04:38:59 UTC
cd8399d Fix grid suggestion ValidateAlgorithmSettings return (#913) 05 November 2019, 09:12:38 UTC
02ccbff Fix wrong suggestion service endpoint (#911) 05 November 2019, 08:22:38 UTC
c18bab6 Enable arm64 architecture support for katib images and fix grpc health probe multiarch error. (#897) Change-Id: I5ddee7e8fbe96b8e0a025e3f182b4a5192c45597 Signed-off-by: Henry Wang <henry.wang@arm.com> 05 November 2019, 01:52:39 UTC
d9bb39e feat: Support custom database (#910) Signed-off-by: Ce Gao <gaoce@caicloud.io> 04 November 2019, 12:27:40 UTC
a55cf2a Enhance validation for metrics collector (#909) 04 November 2019, 05:05:40 UTC
2df906e Support custom metrics collector kind (#908) * Support custom metrics collector kind * Fix python image version for v1alpha2 04 November 2019, 03:41:40 UTC
c95c144 support ppc64le (#893) * support ppc64le * support grpc_health_probe for available suggestions on ppc64le 04 November 2019, 02:35:40 UTC
975da72 fix: Add Suggestion into CI (#907) * fix: Add Suggestion into CI Signed-off-by: Ce Gao <gaoce@caicloud.io> * fix: Use 3.6 Signed-off-by: Ce Gao <gaoce@caicloud.io> * fix: Fix Hyperband Signed-off-by: Ce Gao <gaoce@caicloud.io> 02 November 2019, 04:01:40 UTC
574c657 Validate algorithm (#904) 01 November 2019, 01:37:23 UTC
1608d28 Support restarting training job (#901) 30 October 2019, 04:06:51 UTC
606736d Fix katib-manager crash in kubeflow cluster (#900) 28 October 2019, 02:45:24 UTC
0eadfce Revert env for katib-db (#899) 25 October 2019, 22:45:23 UTC
818b879 feat: Patch to fix running condition (#895) * feat: Patch to fix running condition Signed-off-by: Ce Gao <gaoce@caicloud.io> * chore: Add job dir Signed-off-by: Ce Gao <gaoce@caicloud.io> * fix: Remove CI for suggestion Signed-off-by: Ce Gao <gaoce@caicloud.io> * fix: Remove tag Signed-off-by: Ce Gao <gaoce@caicloud.io> 25 October 2019, 07:15:39 UTC
7443f02 feat: Add quick start (#878) * feat: Add quick start Signed-off-by: Ce Gao <gaoce@caicloud.io> * feat: Add UI Signed-off-by: Ce Gao <gaoce@caicloud.io> * readme: Update Signed-off-by: Ce Gao <gaoce@caicloud.io> * feat: Add trial detail Signed-off-by: Ce Gao <gaoce@caicloud.io> * feat: Remove quick start example and use TFJob Signed-off-by: Ce Gao <gaoce@caicloud.io> 18 October 2019, 04:47:57 UTC
2cc7675 Pin operators to 0.7 branch (#885) 16 October 2019, 03:32:08 UTC
8c8094a fix: Use 64 instead of 32 since we are using float64 (#883) Signed-off-by: Ce Gao <gaoce@caicloud.io> 15 October 2019, 15:45:52 UTC
bad9626 fix: Use as instead of , to support python 3 in tfevent metrics collector (#881) * fix: Use as instead of , to support python 3 Signed-off-by: Ce Gao <gaoce@caicloud.io> * fix: Set a longer timeout Signed-off-by: Ce Gao <gaoce@caicloud.io> 15 October 2019, 03:37:53 UTC
8221592 feat: Add event when the reconcile is failed (#879) * feat: Add event when the reconcile is failed Signed-off-by: Ce Gao <gaoce@caicloud.io> * fix: Use format Signed-off-by: Ce Gao <gaoce@caicloud.io> 14 October 2019, 17:42:05 UTC
d93b602 feat: Add events in experiment (#880) * feat: Add events in experiment Signed-off-by: Ce Gao <gaoce@caicloud.io> * fix: Reorder import Signed-off-by: Ce Gao <gaoce@caicloud.io> 14 October 2019, 16:56:36 UTC
9d7164a Remove unsed katib-manager-rest (#876) 14 October 2019, 00:12:35 UTC
198a63a feat: Refactor to make it easy to extend new kinds (#865) * feat: Refactor to make it easy to extend new kinds Signed-off-by: Ce Gao <gaoce@caicloud.io> * fix: Remove hard coded name Signed-off-by: Ce Gao <gaoce@caicloud.io> 12 October 2019, 07:32:38 UTC
fb6739c feat: Support random state in random search (#873) * feat: Support random search Signed-off-by: Ce Gao <gaoce@caicloud.io> * feat: Update docs Signed-off-by: Ce Gao <gaoce@caicloud.io> 11 October 2019, 08:37:39 UTC
030b691 Add prometheus metrics for experiment and trial (#870) 11 October 2019, 07:37:40 UTC
dd1fb5c fix: Use binary in test (#875) Signed-off-by: Ce Gao <gaoce@caicloud.io> 11 October 2019, 06:39:38 UTC
89ed82b feat: Support env in mysql (#868) * feat: Support env in mysql Signed-off-by: Ce Gao <gaoce@caicloud.io> * fix: Reuse code Signed-off-by: Ce Gao <gaoce@caicloud.io> 11 October 2019, 05:17:39 UTC
cefb6fe feat: Add liveness probe for DB (#871) * feat: Add liveness probe Signed-off-by: Ce Gao <gaoce@caicloud.io> * feat: Upgrade mysql Signed-off-by: Ce Gao <gaoce@caicloud.io> 11 October 2019, 04:09:47 UTC
4014465 Remove unused files (#869) 11 October 2019, 01:31:20 UTC
76fb8bf feat: Add doc about algorithm (#867) Signed-off-by: Ce Gao <gaoce@caicloud.io> 10 October 2019, 11:19:43 UTC
e0cf1f1 feat: Add doc about how to add a new kind in trial (#844) Signed-off-by: Ce Gao <gaoce@caicloud.io> 10 October 2019, 03:14:53 UTC
9aa8eae Adding metric unavailability to events (#864) * Adding metric unavailability to events * Adding status condition * Fix tests 09 October 2019, 13:24:14 UTC
d39ee12 Fix worker error silent (#863) * Fix worker error silent * Use 3rd golang tail to replace linux os command * Rename sidecar container name 09 October 2019, 10:05:52 UTC
5cdbde6 feat: Show experiment status in json (#853) Signed-off-by: Ce Gao <gaoce@caicloud.io> 09 October 2019, 06:11:49 UTC
2be5f93 Finish reconcile only after running trials are complete (#861) * Wait for trials to complete * Adding validate as false for operators * Fix tests 09 October 2019, 04:53:49 UTC
2e823b2 Update Readme (#860) * Update README.md * Add to Readme * Update README.md * Update README.md * Update README.md * Update README.md * Update Readme 08 October 2019, 06:23:11 UTC
2f63d44 fix: Fix docs about metrics collection and sugg (#858) Signed-off-by: Ce Gao <gaoce@caicloud.io> 08 October 2019, 06:09:11 UTC
4e96b94 Adding events to trials (#852) * Adding events to trials * Fix tests 04 October 2019, 02:43:55 UTC
0fadd5d chore: Add dockerignore, enhance liveness for manager (#851) * chore: Speedup local build.sh Signed-off-by: Ce Gao <gaoce@caicloud.io> * chore: Update liveness probe Signed-off-by: Ce Gao <gaoce@caicloud.io> * chore: Ignore frontend build Signed-off-by: Ce Gao <gaoce@caicloud.io> 03 October 2019, 18:31:55 UTC
31411d9 fix: Reorder (#847) Signed-off-by: Ce Gao <gaoce@caicloud.io> 03 October 2019, 13:14:08 UTC
142fdfc feat: Set default namespace and template for trial (#850) Signed-off-by: Ce Gao <gaoce@caicloud.io> 03 October 2019, 10:20:11 UTC
6039a2e fix: Use namespace to get trial list (#846) Signed-off-by: Ce Gao <gaoce@caicloud.io> 03 October 2019, 03:32:09 UTC
7ade03b [docs] Add suggestion proposal (#726) * feat: Add suggestion redesign doc Signed-off-by: Ce Gao <gaoce@caicloud.io> * fix: Update K8s API Signed-off-by: Ce Gao <gaoce@caicloud.io> * chore: Update TOC Signed-off-by: Ce Gao <gaoce@caicloud.io> * fix: Address comments Signed-off-by: Ce Gao <gaoce@caicloud.io> * fix: Fix field Signed-off-by: Ce Gao <gaoce@caicloud.io> 30 September 2019, 09:45:37 UTC
fb865e7 feat: Add doc for implementing new algorithms (#769) * feat: Add doc Signed-off-by: Ce Gao <gaoce@caicloud.io> * feat: Update Signed-off-by: Ce Gao <gaoce@caicloud.io> * feat: Update Signed-off-by: Ce Gao <gaoce@caicloud.io> * fix: Update Signed-off-by: Ce Gao <gaoce@caicloud.io> * fix: Address comments Signed-off-by: Ce Gao <gaoce@caicloud.io> 30 September 2019, 08:35:38 UTC
e0659b4 feat: Support namespace in NAS UI (#839) * feat: Support namespace in monitor Signed-off-by: Ce Gao <gaoce@caicloud.io> * feat: Support namesapce in create tag Signed-off-by: Ce Gao <gaoce@caicloud.io> 30 September 2019, 08:15:38 UTC
cb6de0d feat: Show all experiments in monitor (#835) * feat: Show all experiments Signed-off-by: Ce Gao <gaoce@caicloud.io> * fix: Use namespace Signed-off-by: Ce Gao <gaoce@caicloud.io> * fix: Move package Signed-off-by: Ce Gao <gaoce@caicloud.io> * fix: Support multiple namespace Signed-off-by: Ce Gao <gaoce@caicloud.io> * feat: Support deletion Signed-off-by: Ce Gao <gaoce@caicloud.io> * feat: Support info Signed-off-by: Ce Gao <gaoce@caicloud.io> * feat: Support namesapce in submitJob Signed-off-by: Ce Gao <gaoce@caicloud.io> * fix: Remove NAS Signed-off-by: Ce Gao <gaoce@caicloud.io> 30 September 2019, 03:07:37 UTC
8fc9db0 Delete jobs when completed (#838) 30 September 2019, 01:41:36 UTC
1267a90 Remove used manager message definition (#837) 29 September 2019, 09:37:37 UTC
d92ec01 Add tfjob and pytorch examples to e2e (#820) * Add tfjob and pytorch examples to e2e * Fix tests * Fix tests * Fix tests * Fix tests * Install crds before katib * Fix tests * Adding timeout to 30 min 29 September 2019, 08:47:38 UTC
18c4a8b fix: Update liveness probe to avoid problems (#833) Signed-off-by: Ce Gao <gaoce@caicloud.io> 29 September 2019, 08:01:38 UTC
d03a551 Remove used katib-manager code (#836) 29 September 2019, 06:37:37 UTC
fbf0726 File metrics collector end to end test (#832) 29 September 2019, 05:31:38 UTC
afaf252 feat: support namespace for trial template (#827) * WIP Signed-off-by: Ce Gao <gaoce@caicloud.io> * feat: support namespace for trial template Signed-off-by: Ce Gao <gaoce@caicloud.io> * fix: Use configmap function Signed-off-by: Ce Gao <gaoce@caicloud.io> 29 September 2019, 04:29:37 UTC
69904e9 Remove metrics in DB when delete trial (#830) 29 September 2019, 03:19:37 UTC
adaffc5 Update status conditions during reconcile error (#831) 29 September 2019, 01:43:37 UTC
92069c2 feat: Use env var for namespace (#829) Signed-off-by: Ce Gao <gaoce@caicloud.io> 27 September 2019, 18:41:36 UTC
cad5060 Make sure experiment namespace can inject metriccollector sidecar (#828) 27 September 2019, 08:09:36 UTC
1182029 Doc about katib workflow design (#824) 27 September 2019, 05:17:35 UTC
6a09c61 fix: Support multiple namespaces (#826) Signed-off-by: Ce Gao <gaoce@caicloud.io> 27 September 2019, 05:13:35 UTC
b3f005e feat: Support step when using grid in UI (#821) * fix: Use log Signed-off-by: Ce Gao <gaoce@caicloud.io> * fix: Remove hard coded path Signed-off-by: Ce Gao <gaoce@caicloud.io> * feat: Add step for double Signed-off-by: Ce Gao <gaoce@caicloud.io> * fix: Do not set if it is necessary Signed-off-by: Ce Gao <gaoce@caicloud.io> 26 September 2019, 10:23:06 UTC
3627933 fix: Build e2e-runner (#822) Signed-off-by: Ce Gao <gaoce@caicloud.io> 26 September 2019, 09:43:08 UTC
e26c442 Fix stdout of worker container show nothing (#819) 26 September 2019, 08:41:07 UTC
d39865b feat: Remove useless APIs (#818) * feat: Remove useless APIs Signed-off-by: Ce Gao <gaoce@caicloud.io> * feat: Remove Signed-off-by: Ce Gao <gaoce@caicloud.io> * fix: Fix Signed-off-by: Ce Gao <gaoce@caicloud.io> 26 September 2019, 07:31:09 UTC
e9c91ed feat: Add validation for grid (#812) * feat: Add validation for grid Signed-off-by: Ce Gao <gaoce@caicloud.io> * fix: Import grpc Signed-off-by: Ce Gao <gaoce@caicloud.io> * fix: Remove check in get suggestions Signed-off-by: Ce Gao <gaoce@caicloud.io> 26 September 2019, 06:29:06 UTC
8be1650 Adding additional printer columns for better debugging (#817) 26 September 2019, 05:29:06 UTC
39beda3 metrics-collector role is not usefule any more (#816) 26 September 2019, 04:13:06 UTC
67a9cea Rename algorithm deployment and service (#814) 26 September 2019, 02:55:06 UTC
9c768ca fix: Fix the type (#813) * fix: Fix the type Signed-off-by: Ce Gao <gaoce@caicloud.io> * fix: trigger CI Signed-off-by: Ce Gao <gaoce@caicloud.io> 26 September 2019, 02:09:07 UTC
95d7d12 feat: Add tpe e2e test case (#809) Signed-off-by: Ce Gao <gaoce@caicloud.io> 25 September 2019, 11:11:59 UTC
d571094 Remove unused field from Experiment Spec (#806) * Remove unused field from Spec * Remove references 25 September 2019, 10:36:00 UTC
cc76656 feat: Add HyperBand (#787) * feat: Add HyperBand Signed-off-by: Ce Gao <gaoce@caicloud.io> * chore: Add test in CI Signed-off-by: Ce Gao <gaoce@caicloud.io> * fix: Fix Signed-off-by: Ce Gao <gaoce@caicloud.io> * fix: Fix name Signed-off-by: Ce Gao <gaoce@caicloud.io> * fix: Fix name Signed-off-by: Ce Gao <gaoce@caicloud.io> * fix: Fix script Signed-off-by: Ce Gao <gaoce@caicloud.io> * fix: Fix r_l Signed-off-by: Ce Gao <gaoce@caicloud.io> * fix: Add parallel trial count Signed-off-by: Ce Gao <gaoce@caicloud.io> * fix: Add output Signed-off-by: Ce Gao <gaoce@caicloud.io> * feat: Append algorithm settings Signed-off-by: Ce Gao <gaoce@caicloud.io> * fix: Add output Signed-off-by: Ce Gao <gaoce@caicloud.io> * fix: Fix useless variable Signed-off-by: Ce Gao <gaoce@caicloud.io> * fix: Use resource_name instead of ResourceName Signed-off-by: Ce Gao <gaoce@caicloud.io> * fix: Update Signed-off-by: Ce Gao <gaoce@caicloud.io> * fix: Avoid nil pointer exception Signed-off-by: Ce Gao <gaoce@caicloud.io> * feat: Move algorithm to status Signed-off-by: Ce Gao <gaoce@caicloud.io> * fix: Add max Signed-off-by: Ce Gao <gaoce@caicloud.io> * fix: Use algorithm settings Signed-off-by: Ce Gao <gaoce@caicloud.io> * fix: Remove updateSpec Signed-off-by: Ce Gao <gaoce@caicloud.io> * fix: Fix test Signed-off-by: Ce Gao <gaoce@caicloud.io> 25 September 2019, 09:43:59 UTC
e9e0768 Removing unecessary lines (#803) 25 September 2019, 01:03:59 UTC
5601587 feat: Add NAS RL based algorithm (#793) * feat: Add NAS RL based algorithm Signed-off-by: Ce Gao <gaoce@caicloud.io> * fix: Add tensorflow Signed-off-by: Ce Gao <gaoce@caicloud.io> * feat: Add health check Signed-off-by: Ce Gao <gaoce@caicloud.io> * feat: Add E2E in CI Signed-off-by: Ce Gao <gaoce@caicloud.io> * fix: Install packages Signed-off-by: Ce Gao <gaoce@caicloud.io> * fix: Fix image Signed-off-by: Ce Gao <gaoce@caicloud.io> * feat: Add NAS in suggestion client Signed-off-by: Ce Gao <gaoce@caicloud.io> * fix: Do not set nasconfig for hp jobs Signed-off-by: Ce Gao <gaoce@caicloud.io> * fix: Fix script Signed-off-by: Ce Gao <gaoce@caicloud.io> * fix: Add output Signed-off-by: Ce Gao <gaoce@caicloud.io> * fix: Add for debug Signed-off-by: Ce Gao <gaoce@caicloud.io> * fix: Remove -u Signed-off-by: Ce Gao <gaoce@caicloud.io> * fix: Remove version Signed-off-by: Ce Gao <gaoce@caicloud.io> * fix: Comment test Signed-off-by: Ce Gao <gaoce@caicloud.io> 24 September 2019, 11:39:27 UTC
6a5bc1d fix: Remove copy (#802) Signed-off-by: Ce Gao <gaoce@caicloud.io> 24 September 2019, 10:59:26 UTC
bd4480c Adding example trial as the default (#801) 24 September 2019, 10:25:27 UTC
cdd8e32 Removing metric collector templates from UI (#800) 24 September 2019, 09:53:27 UTC
50e7f00 fix: Use commitid (#799) Signed-off-by: Ce Gao <gaoce@caicloud.io> 24 September 2019, 08:35:27 UTC
e7e8e57 Use common metricsCollector struct (#798) * Use common metricsCollector struct * Fix test error 24 September 2019, 02:31:25 UTC
81856da build: Support arguments (#795) Signed-off-by: Ce Gao <gaoce@caicloud.io> 23 September 2019, 08:49:23 UTC
ebb48f8 feat: Rename algorithms (#794) * feat: Rename algorithms Signed-off-by: Ce Gao <gaoce@caicloud.io> * fix: Remove prefix Signed-off-by: Ce Gao <gaoce@caicloud.io> * fix: Fix test cases Signed-off-by: Ce Gao <gaoce@caicloud.io> * fix: Fix algorithms Signed-off-by: Ce Gao <gaoce@caicloud.io> 23 September 2019, 08:09:23 UTC
back to top