tf_example1

Jun 24, 2021

29c515e · Jun 24, 2021

This branch is 15 commits ahead of, 2603 commits behind intel/neural-compressor:master.

Name	Name	Last commit message	Last commit date
parent directory ..
README.md	README.md	spelling issues fix (intel#133 )	Jun 24, 2021
conf.yaml	conf.yaml	ux improvement (intel#52 )	May 24, 2021
test.py	test.py	[fix] fix pruning bug, fix mxnet dataloader bug, fix pytorch save bug…	Apr 7, 2021

README.md

tf_example1 example

This example is used to demonstrate how to utilize LPOT builtin dataloader and metric to enabling quantization without coding effort.

Download the FP32 model wget https://storage.googleapis.com/intel-optimized-tensorflow/models/v1_6/mobilenet_v1_1.0_224_frozen.pb
Update the root of dataset in conf.yaml The configuration will create a dataloader of Imagenet and it will do Bilinear resampling to resize the image to 224x224. And it will create a TopK metric function for evaluation.

quantization:                                        # optional. tuning constraints on model-wise for advance user to reduce tuning space.
  calibration:
    sampling_size: 20                                # optional. default value is 100. used to set how many samples should be used in calibration.
    dataloader:
      dataset:
        ImageRecord:
          root: <DATASET>/TF_imagenet/val/           # NOTE: modify to calibration dataset location if needed
      transform:
        BilinearImagenet: 
          height: 224
          width: 224
  model_wise:                                        # optional. tuning constraints on model-wise for advance user to reduce tuning space.
    activation:
      algorithm: minmax
    weight:
      granularity: per_channel

evaluation:                                          # optional. required if user doesn't provide eval_func in lpot.Quantization.
  accuracy:                                          # optional. required if user doesn't provide eval_func in lpot.Quantization.
    metric:
      topk: 1                                        # built-in metrics are topk, map, f1, allow user to register new metric.
    dataloader:
      batch_size: 32
      dataset:
        ImageRecord:
          root: <DATASET>/TF_imagenet/val/           # NOTE: modify to evaluation dataset location if needed
      transform:
        BilinearImagenet: 
          height: 224
          width: 224

Run quantization We only need to add the following lines for quantization to create an int8 model.

    from lpot.experimental import Quantization, common
    quantizer = Quantization('./conf.yaml')
    quantizer.model = common.Model("./mobilenet_v1_1.0_224_frozen.pb")
    quantized_model = quantizer()

Run quantization and evaluation:

    python test.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Files

tf_example1

tf_example1

README.md

tf_example1 example

Files

tf_example1

Directory actions

More options

Directory actions

More options

Latest commit

History

tf_example1

Folders and files

parent directory

README.md

tf_example1 example