https://github.com/kubeflow/katib

sort by:
Revision Author Date Message Commit Date
47d4d8f Add npm build to the UI Dockerfile (#665) * Remove build from the Repo * npm build in dockerfile * Remove no-cache * Change size 28 June 2019, 01:33:21 UTC
c81818d MetricController: Run only a single job per task (#660) This changes the `spec.concurrencyPolicy` of the metric collector cron-job from "Allow" (default) to "Forbid". The cronjob used to create a new job even if the previous job had not succeeded. On high-load clusters this could lead to a high number of jobs which never finished. This fixed #659 27 June 2019, 05:39:19 UTC
702703b Build images for nasrl training container (#669) * Add NASRL training container build image * Add build for v1alpha1 21 June 2019, 03:30:36 UTC
a21c14f Add delete experiment (#654) 19 June 2019, 03:00:32 UTC
1344dc2 Change add template (#656) 19 June 2019, 01:30:32 UTC
855f75c Select objectiveType from the list (#653) 18 June 2019, 19:17:50 UTC
c81692c Add e2e test to presubmit (#652) * Adding grid e2e test to presubmit * Adding extra checks 18 June 2019, 06:44:16 UTC
ae10864 fix: Do not use webhook in UT (#657) Signed-off-by: Ce Gao <gaoce@caicloud.io> 17 June 2019, 08:20:08 UTC
0cb1597 Enhancing katib client apis (#650) * Enhacning katib client apis * Delete unnecessary file 14 June 2019, 10:46:21 UTC
8970cdf Wrong mock file name (#651) 14 June 2019, 10:10:21 UTC
69d097e UI: Show only succeeded Trials (#646) * Show only succeeded trials * Create build 13 June 2019, 05:58:13 UTC
14dad8b v1alpha2 hyperband suggestion service validation (#648) 12 June 2019, 18:20:17 UTC
0d456ae refactor: Remove requests check for most test cases (#626) * refactor: Remove requests check for most test cases Signed-off-by: Ce Gao <gaoce@caicloud.io> * fix: Set timeout for apiserver Signed-off-by: Ce Gao <gaoce@caicloud.io> * fix: Remove Signed-off-by: Ce Gao <gaoce@caicloud.io> * fix: Fix experiment test Signed-off-by: Ce Gao <gaoce@caicloud.io> * fix: Fix test cases Signed-off-by: Ce Gao <gaoce@caicloud.io> * fix: Fix test Signed-off-by: Ce Gao <gaoce@caicloud.io> * fix: Fix test cases Signed-off-by: Ce Gao <gaoce@caicloud.io> 12 June 2019, 16:02:23 UTC
ff27b55 feat(experiment): Delete dup trials (#647) * feat(experiment): Delete dup trials Signed-off-by: Ce Gao <gaoce@caicloud.io> * fix: Sort before delete Signed-off-by: Ce Gao <gaoce@caicloud.io> * fix: Use the result in the memory Signed-off-by: Ce Gao <gaoce@caicloud.io> 12 June 2019, 09:07:29 UTC
2625d43 Add bayesianoptimization algorithm in selectlist (#645) 11 June 2019, 23:31:29 UTC
56192e1 Fix v1alpha1 hyperband algorithm mismatch (#634) * Fix v1alpha1 hyperband algorithm mismatch * Fix test error 08 June 2019, 05:31:07 UTC
da6dae1 hyperband suggestion service (#631) 08 June 2019, 05:07:06 UTC
2ef2bc8 Upgrade Job operators to v1 (#635) * Upgrade tfjob/pytorchjob apis to v1 * Remove unnecessary files 07 June 2019, 00:48:00 UTC
2d059a4 Fix sql syntax for UpdateAlgorithmExtraSettings (#633) 05 June 2019, 08:17:52 UTC
77ae12d Write entries to extra settings table during create (#630) 05 June 2019, 07:39:54 UTC
b0e0dd5 Adding cascading delete of pods when jobs are deleted (#632) 04 June 2019, 18:13:55 UTC
03fb85e Add tests for grid suggestion (#628) 04 June 2019, 08:51:09 UTC
e70d56b Fixing tag (#627) 04 June 2019, 05:08:59 UTC
32d3401 Training Container for NAS RL Suggestion in v1alpha2 (#614) * Add training container in v1alpha2 * Modify runTrial 04 June 2019, 03:22:59 UTC
cb25807 Implementing v1alpha2 grid search suggestion algorithm (#622) * Implementing v1alpha2 grid search algorithm * Fix indendation * Build grid image 03 June 2019, 17:43:33 UTC
0f6fdeb feat: Support bayesianoptimization in v1alpha2 (#595) * feat: Support bayesianoptimization Signed-off-by: Ce Gao <gaoce@caicloud.io> * feat: Suport bo Signed-off-by: Ce Gao <gaoce@caicloud.io> * fix: Resolve conflicts Signed-off-by: Ce Gao <gaoce@caicloud.io> * fix: Fix runtime error Signed-off-by: Ce Gao <gaoce@caicloud.io> * fix: Fix format Signed-off-by: Ce Gao <gaoce@caicloud.io> * fix: Fix comments Signed-off-by: Ce Gao <gaoce@caicloud.io> * fix: Fix comments Signed-off-by: Ce Gao <gaoce@caicloud.io> * fix: Fix component name Signed-off-by: Ce Gao <gaoce@caicloud.io> * fix: Fix errors Signed-off-by: Ce Gao <gaoce@caicloud.io> * fix: Fix a runtime error Signed-off-by: Ce Gao <gaoce@caicloud.io> 03 June 2019, 11:06:14 UTC
0c50fb4 NAS RL Suggestion for v1alpha2 (#613) * Init commit * 50% Suggestion is done * Suggestion is done * Remove logs * Remove temp file * Fix errors * Fix ValidateAlgorithmReply * Move NASRL suggestion deployment * Build image for NASRL suggestion * Fix building image * Remove unused import 03 June 2019, 06:20:15 UTC
c2a1995 Fix problems in the UI for v1alpha2 (#623) * Fix AlgorithmName in Params Submit * Fix TrialPath in Submit by Params * Fix name in InputSizes and OutputSizes Change initial NumLayers 03 June 2019, 02:40:13 UTC
2e07fe1 Updated help message for golint. (#621) This is related to https://github.com/kubeflow/kfserving/issues/135 02 June 2019, 08:38:12 UTC
d9e727d Add experiment to Scheme (#620) 02 June 2019, 03:46:15 UTC
48889cb Merge pull request #616 from johnugeorge/metricfix Set trial completion status only after metric collection 31 May 2019, 23:21:50 UTC
8056907 go unit tests from presubmits (#618) * Remove go unit tests from presubmits * Minor fix 31 May 2019, 10:46:25 UTC
df9f22c Adding owner for cronjob watch 31 May 2019, 09:42:43 UTC
4e342cc Set trial completion status only after metric collection 31 May 2019, 09:42:43 UTC
7a2ffe1 Skip creating trials if add count is zero (#617) 31 May 2019, 09:24:25 UTC
1fdca87 Fix nasrl example in v1alpha2 (#609) 30 May 2019, 03:42:18 UTC
6a484f6 Enabled make check in travis. (#608) * Enabled make check in travis. * Upgrade to go 1.12.5 for travis. 29 May 2019, 05:54:18 UTC
8d77fa5 fix make check (#606) 29 May 2019, 03:16:21 UTC
b9b179a Fine-grained docker image build. (#605) 29 May 2019, 02:16:21 UTC
2bc89ed Moving folders (#602) 28 May 2019, 23:20:22 UTC
31104a2 Fixing latest tag (#603) 28 May 2019, 22:57:48 UTC
3d4712d Minor changes (#601) 28 May 2019, 11:43:52 UTC
99a4359 Mini fix for v1alpha1 metricsCollector (#600) 28 May 2019, 11:01:54 UTC
4f678e2 Check error in OpenSQLConnection (#588) * Check error in openSQLconn * Add logic in v1alpha1 * Change to Errorf 28 May 2019, 10:23:53 UTC
c2b6f9e Fix issue of hyperband suggestion service cannot move on (#596) 28 May 2019, 09:35:56 UTC
eafd7f7 doc: Update readme (#593) * doc: Update readme Signed-off-by: Ce Gao <gaoce@caicloud.io> * fix: Add title Signed-off-by: Ce Gao <gaoce@caicloud.io> * fix: Remove vizier Signed-off-by: Ce Gao <gaoce@caicloud.io> * fix: Add a note in NASRL Signed-off-by: Ce Gao <gaoce@caicloud.io> * fix: Remove vizier in Job Signed-off-by: Ce Gao <gaoce@caicloud.io> * fix: Replace links Signed-off-by: Ce Gao <gaoce@caicloud.io> 28 May 2019, 05:57:51 UTC
86b0721 Reverse logic of Less in hyperband v1alpha1 (#592) 28 May 2019, 04:53:51 UTC
c3478af Mini fix for getExperimentConf (#594) 28 May 2019, 04:27:52 UTC
c3faf0c feat: Add UI in manifests v1alpha2 (#591) * feat: Add UI Signed-off-by: Ce Gao <gaoce@caicloud.io> * fix: Add name Signed-off-by: Ce Gao <gaoce@caicloud.io> * fix: Rename Signed-off-by: Ce Gao <gaoce@caicloud.io> * fix: Fix ui Signed-off-by: Ce Gao <gaoce@caicloud.io> * feat: Support UI Signed-off-by: Ce Gao <gaoce@caicloud.io> 28 May 2019, 03:51:51 UTC
daac957 feat: Support flags in UI (#590) Signed-off-by: Ce Gao <gaoce@caicloud.io> 27 May 2019, 10:39:51 UTC
c2b20e5 Default make target to v1alpha2. (#585) 27 May 2019, 03:11:49 UTC
1e663f9 Change undeploy script (#587) * Move delete for pv under db * Change script in v1alpha1 25 May 2019, 07:21:27 UTC
0d8f13d Added undeploy for katib. (#579) 24 May 2019, 06:50:20 UTC
5c67c0d feat(trial): Add more failure test cases (#570) * WIP Signed-off-by: Ce Gao <gaoce@caicloud.io> * fix: Fix test cases Signed-off-by: Ce Gao <gaoce@caicloud.io> 23 May 2019, 23:20:18 UTC
644fc99 Add categories for katib CRDs (#576) 23 May 2019, 08:14:26 UTC
d4d87ca Add Validate Algorithm Settings in v1alpha2 (#574) * Add Validate Algorithm Settings * Integrate ValidateAlgorithmSettings in ManagerClient * Run dep ensure 23 May 2019, 07:26:26 UTC
200c59d Updated makefile by adding more targets for developer. (#575) * Updated Makefile for go tools. * Run make depend. * Run make update. * Fixed go vet. * Updated development guide. 23 May 2019, 06:46:25 UTC
ef2ac5b feat(experiment): Add more test cases (#563) * feat: Add test cases Signed-off-by: Ce Gao <gaoce@caicloud.io> * fix: Remove debug Signed-off-by: Ce Gao <gaoce@caicloud.io> * fix: Fix deletion Signed-off-by: Ce Gao <gaoce@caicloud.io> 23 May 2019, 05:30:24 UTC
42dc4c8 refactor: Use manager client to get log for test (#569) * refactor: Use manager client to get log for test Signed-off-by: Ce Gao <gaoce@caicloud.io> * fix: Add a log Signed-off-by: Ce Gao <gaoce@caicloud.io> 23 May 2019, 03:30:28 UTC
206108a Adding go tools scripts - part 1 (#573) * Added hack scripts for katib. * Run ./hack/update-gofmt.sh. 23 May 2019, 02:54:26 UTC
cc5f367 Retain for job and metricsCollector (#572) 23 May 2019, 02:32:25 UTC
9df08fd Fix finalizer cannot work (#571) 23 May 2019, 01:52:24 UTC
17dbca3 Implement GetExperimentInDB (#558) * Implement GetExperimentInDB * Parse ErrNoRows error * Fix pod ready condition in test script * Add PreCheckRegisterExperiment 22 May 2019, 03:36:04 UTC
73d940d refactor: Unify the interface (#568) Signed-off-by: Ce Gao <gaoce@caicloud.io> 21 May 2019, 10:38:40 UTC
26231dd Implement trial observation metrics (#564) * Implement trial observation * Fix test * Remove unnecessary condition 21 May 2019, 07:56:44 UTC
87e3a53 fix: Check if the deletion time is nil or zero (#567) Signed-off-by: Ce Gao <gaoce@caicloud.io> 21 May 2019, 03:02:39 UTC
07b1343 feat(experiment-validator): Inject client (#561) * feat: Inject client to generator in validator Signed-off-by: Ce Gao <gaoce@caicloud.io> * fix: mockgen Signed-off-by: Ce Gao <gaoce@caicloud.io> 21 May 2019, 01:58:42 UTC
d2c64a7 Change path for yaml file and NAS training containers (#566) * Change path for yaml file and NAS training containers * Remove empty lines 21 May 2019, 01:24:39 UTC
4c378bc Added metric name to GetObservationLogRequest (#559) * Adding metric name to GetObservationLogRequest * regenerate mockdb 20 May 2019, 12:11:14 UTC
0a98288 Reducing the name length (#562) 20 May 2019, 09:41:13 UTC
65290ca chore: Add code coverage test (#560) * chore: Add travis and code coverage * README: Update Signed-off-by: Ce Gao <gaoce@caicloud.io> 20 May 2019, 09:37:14 UTC
22ac009 feat(experiment): Add test cases (#556) * fix: Remove useless code Signed-off-by: Ce Gao <gaoce@caicloud.io> * feat(experiment): Abstract manager client Signed-off-by: Ce Gao <gaoce@caicloud.io> * feat: Add basic test cases Signed-off-by: Ce Gao <gaoce@caicloud.io> 20 May 2019, 07:23:13 UTC
f71f0e8 chore: Refactor nasenvelopenet (#492) * chore: Refactor NAS Signed-off-by: Ce Gao <gaoce@caicloud.io> * fix: Move requirements Signed-off-by: Ce Gao <gaoce@caicloud.io> * fix: Address comments Signed-off-by: Ce Gao <gaoce@caicloud.io> * fix: Use v1alpha1 dockerfile Signed-off-by: Ce Gao <gaoce@caicloud.io> * fix: Fix dep problem Signed-off-by: Ce Gao <gaoce@caicloud.io> * fix: Show info in runtests Signed-off-by: Ce Gao <gaoce@caicloud.io> * fix: Show yaml Signed-off-by: Ce Gao <gaoce@caicloud.io> * fix: Fix path bug Signed-off-by: Ce Gao <gaoce@caicloud.io> * fix: Remove info in script Signed-off-by: Ce Gao <gaoce@caicloud.io> 20 May 2019, 07:01:13 UTC
bc343ff feat(trial): Refactor trial controller and add basic test cases (#528) * feat(trial): Add test * vendor: Update Signed-off-by: Ce Gao <gaoce@caicloud.io> * feat(trial): Add test cases Signed-off-by: Ce Gao <gaoce@caicloud.io> * chore: Add mockgen and test crd Signed-off-by: Ce Gao <gaoce@caicloud.io> * fix: Update test Signed-off-by: Ce Gao <gaoce@caicloud.io> * fix: Update Signed-off-by: Ce Gao <gaoce@caicloud.io> * fix: Fix export path Signed-off-by: Ce Gao <gaoce@caicloud.io> * fix: Introduce the dependency Signed-off-by: Ce Gao <gaoce@caicloud.io> * feat: Add status handler Signed-off-by: Ce Gao <gaoce@caicloud.io> * Fix: Use 1.0.7 Signed-off-by: Ce Gao <gaoce@caicloud.io> * fix: Fix v1alpha1 Signed-off-by: Ce Gao <gaoce@caicloud.io> * fix: Update deploy scripts Signed-off-by: Ce Gao <gaoce@caicloud.io> * chore: Rebase master Signed-off-by: Ce Gao <gaoce@caicloud.io> 20 May 2019, 02:09:11 UTC
0712177 Add status for experiment and trial in additionalPrinterColumns (#555) 20 May 2019, 01:41:11 UTC
c8ae73e Fix default metricsController wrong args (#550) * Fix default metricsController wrong args * Add command for metricsConllector deployment 19 May 2019, 23:41:09 UTC
32c1bd2 Add NAS RL yaml example (#554) 18 May 2019, 03:32:07 UTC
f5babef Fix trial requestNumber error (#553) 17 May 2019, 21:40:13 UTC
bf3aeff Adding tests for random suggestion algorithm (#552) 17 May 2019, 15:14:12 UTC
215e3d7 Adding minor styles changes (#548) 17 May 2019, 12:10:04 UTC
ee37ba7 Fix nil pointer error when create trial (#547) 17 May 2019, 10:44:05 UTC
ad8fe2b Used klog for katib. (#526) 17 May 2019, 10:16:05 UTC
d22f8e6 Implement GetSuggestions for general suggestion (#546) 17 May 2019, 09:46:06 UTC
330c239 fix: Fix the conflicts (#545) Signed-off-by: Ce Gao <gaoce@caicloud.io> 17 May 2019, 09:12:04 UTC
c6f34ea Earlystopping changes trigger CI based on version (#544) * Ui changes trigger CI based on version * Make earlystop version in prow config 17 May 2019, 08:34:08 UTC
8a4b6e3 Adding manifests for manager rest (#542) * Adding manifests for manager rest * Fix to add new image * Exclude katib-manager-rest in grep 17 May 2019, 08:28:10 UTC
4ec45bb Katib UI for v1alpha2 (#486) * Init commit * Half of backend edit * Backend for v1alpha2 UI is done * Fix gopkg.toml * fix gopkg lock * Almost submit HP job by params done * Modify gitignore * Add by parameters for HP and NAS done * HP and NAS jobs functionality done * Template is done * Add new filters in job list Create a new build with frontend * Fix newlines * Review changes * Add tests for katib-ui * Delete pkg * Add pkg * Fix backend 17 May 2019, 07:12:04 UTC
bd39447 Enable suggestion-random image build and push in CI (#543) 17 May 2019, 06:50:06 UTC
41152dd fix(status): Define status handler (#518) Signed-off-by: Ce Gao <gaoce@caicloud.io> 17 May 2019, 06:00:07 UTC
b2d7bbf Include suggestion manager based on version in prow (#541) 17 May 2019, 05:30:07 UTC
a0aab3a Adding random suggestion algorithm implementation and manifests (#540) * Adding random algorithm implementation and manifests * Removing test file 17 May 2019, 05:22:06 UTC
04b0d91 fix: Add test cases for validator and manifest generator (#508) * dep: Update Signed-off-by: Ce Gao <gaoce@caicloud.io> * refactor: Simplify to test Signed-off-by: Ce Gao <gaoce@caicloud.io> * config: Add Signed-off-by: Ce Gao <gaoce@caicloud.io> * fix: Fix test cases Signed-off-by: Ce Gao <gaoce@caicloud.io> * fix: Update Signed-off-by: Ce Gao <gaoce@caicloud.io> * fix: Add more test cases Signed-off-by: Ce Gao <gaoce@caicloud.io> * fix: Update Signed-off-by: Ce Gao <gaoce@caicloud.io> * fix: Fix test cases Signed-off-by: Ce Gao <gaoce@caicloud.io> * fix: Fix the rebase bug Signed-off-by: Ce Gao <gaoce@caicloud.io> * fix: Address comments Signed-off-by: Ce Gao <gaoce@caicloud.io> * experiment: Fix Signed-off-by: Ce Gao <gaoce@caicloud.io> * fix: Install kubebuilder Signed-off-by: Ce Gao <gaoce@caicloud.io> * fix: Fix unit test Signed-off-by: Ce Gao <gaoce@caicloud.io> 17 May 2019, 03:17:27 UTC
928c66a Update trial status DB operation (#537) 17 May 2019, 02:57:26 UTC
eb7235e Add labels for TFjob and PytorchJob (#539) 17 May 2019, 02:29:26 UTC
9b9f6dd v1alpha2 TFEvent metrics collector (#538) * TFEvent collector * TFjob example * TFjob example * Build metrics collector images on pre and postsubmits 17 May 2019, 00:56:27 UTC
a8086c0 Register Trial in DB (#530) * Register Trial in DB * Fix errors * Change Spec and Status for Trial and Experiment * Fix unit test * Fix condition in Register Experiment and Trial * Fix ut in manager * Fix Status in Experiment Config * Fix Experiment Spec in py test * Add trial status * Fix ut with trial status 16 May 2019, 23:14:28 UTC
287e503 Restructuring docker files to build images per version (#527) * Restructuring docker files to build image per version * binary renamed * Fixing prow include directory paths * binary rename * fixing names * Fix paths * Fixing python paths in v1alpha1 * Adding manifests change * test path fix 16 May 2019, 17:19:07 UTC
e4891e4 Dep ensure to sync up vendor. (#535) * Enable remove un-used package. * Run dep ensure. 16 May 2019, 11:05:46 UTC
1f90657 fix: Avoid side effect (#532) * fix: Avoid side effect Signed-off-by: Ce Gao <gaoce@caicloud.io> * fix: Remove Signed-off-by: Ce Gao <gaoce@caicloud.io> 16 May 2019, 10:25:41 UTC
cb6113e Include vendor dir instead of Gopkg in prow config (#536) 16 May 2019, 06:21:41 UTC
back to top