Revision e1467a79dc6580ae009d827b5e6f274faff3b339 authored by liqunfu on 27 March 2020, 21:42:04 UTC, committed by GitHub on 27 March 2020, 21:42:04 UTC
2 parent s c7bc93f + a2055f6
Raw File
README.md
This directory contains different scripts to support CNTK.

## CNTK Binary Installers

The directory `install` contains scripts which are used in the CNTK binary download to install
CNTK on a users system. They are NOT intended to run from this location in the repository.

* `install/windows` - A script for installing a Windows CNTK *binary* drop, cf. [here](https://docs.microsoft.com/en-us/cognitive-toolkit/Setup-Windows-Binary-Script).
* `install/linux` - A script for installing a Linux CNTK *binary* drop, cf. [here](https://docs.microsoft.com/en-us/cognitive-toolkit/Setup-Linux-Binary-Script).

## CNTK Text format Converters

Two Python Scripts for converting Data to CNTK Text format for using as an input for CNTK Text Format Reader (https://docs.microsoft.com/en-us/cognitive-toolkit/BrainScript-CNTKTextFormat-Reader).

### Convert Dictionary to Text

`txt2ctf.py` converts a set of dictionary files and a plain text file to CNTK Text format.

Run `python txt2ctf.py -h` to see usage instructions. See the comments in the beginning of the script file for the specific usage example.

### Convert UCI Format to Text

`uci2ctf.py` converts data stored in a text file in UCI format to CNTK Text format.

Run `python uci2ctf.py -h` to see usage instructions and example.

For Example:

```
python Scripts/uci2ctf.py --input_file Examples/Image/MNIST/Data/Train-28x28.txt --features_start 1 --features_dim 784 --labels_start 0 --labels_dim 1 --num_labels 10  --output_file examples/Image/MNIST/Data/Train-28x28_cntk_text.txt
```

* `input_file` - original dataset in the (columnar) UCI format
* `features_start` - index of the first feature column (start parameter in the UCIFastReader config, see [here](https://docs.microsoft.com/en-us/cognitive-toolkit/BrainScript-UCI-Fast-Reader)
* `features_dim` - number of feature columns (dim parameter in the UCIFastReader config)
* `labels_start` - index of the first label column
* `labels_dim` - number of label columns
* `num_labels` - number of possible label values (labelDim parameter in the UCIFastReader config)
* `output_file` - path and filename of the resulting dataset.

back to top