batch_normalization | Batch Normalization Layer for Caffe | Machine Learning library

by ChenglongChen C++ Version: Current License: No License

X-Ray Key Features Code Snippets Community Discussions(10)Vulnerabilities Install Support

kandi X-RAY | batch_normalization Summary

batch_normalization is a C++ library typically used in Artificial Intelligence, Machine Learning, Deep Learning applications. batch_normalization has no bugs, it has no vulnerabilities and it has low support. You can download it from GitHub.

Batch Normalization Layer for Caffe

Support

Quality

Security

License

Reuse

Support

batch_normalization has a low active ecosystem.

It has 36 star(s) with 22 fork(s). There are 9 watchers for this library.

It had no major release in the last 6 months.

There are 2 open issues and 1 have been closed. There are no pull requests.

It has a neutral sentiment in the developer community.

The latest version of batch_normalization is current.

Quality

batch_normalization has 0 bugs and 0 code smells.

Security

batch_normalization has no vulnerabilities reported, and its dependent libraries have no vulnerabilities reported.

batch_normalization code analysis shows 0 unresolved vulnerabilities.

There are 0 security hotspots that need review.

License

batch_normalization does not have a standard license declared.

Check the repository for any license declaration and review the terms closely.

Without a license, all rights are reserved, and you cannot use the library in your applications.

Reuse

batch_normalization releases are not available. You will need to build from source code and install.

Top functions reviewed by kandi - BETA

kandi's functional review helps you automatically verify the functionalities of the libraries and avoid rework.
Currently covering the most popular Java, JavaScript and Python libraries. See a Sample of batch_normalization

Get all kandi verified functions for this library.

batch_normalization Key Features

No Key Features are available at this moment for batch_normalization.

batch_normalization Examples and Code Snippets

No Code Snippets are available at this moment for batch_normalization.

Community Discussions

Trending Discussions on batch_normalization

Transfer Learning/Fine Tuning - how to keep BatchNormalization in inference mode?

Tensor Dimensions must be equal

Python Keras Input 0 of layer batch_normalization is incompatible with the layer

No gradients are provided / 'NoneType' object is not callable when trying to fit a multi-output model

Why do batch_normalization produce all-zero output when training = True but produce non-zero output when training = False?

How can I see the model as visualized?

Gradients do not exist for most convolution filters in subclassed model

Why is the loss of my autoencoder not going down at all during training?

How to train a Keras autoencoder with custom dataset?

Keras 1D CNN always predicts the same result even if accuracy is high on training set

QUESTION

Transfer Learning/Fine Tuning - how to keep BatchNormalization in inference mode?

Asked 2022-Apr-02 at 15:34

In the following tutorial Transfer learning and fine-tuning by TensorFlow it is explained that that when unfreezing a model that contains BatchNormalization (BN) layers, these should be kept in inference mode by passing training=False when calling the base model.

[…]
Important notes about BatchNormalization layer
Many image models contain BatchNormalization layers. That layer is a special case on every imaginable count. Here are a few things to keep in mind.

BatchNormalization contains 2 non-trainable weights that get updated during training. These are the variables tracking the mean and variance of the inputs.

When you set bn_layer.trainable = False, the BatchNormalization layer will run in inference mode, and will not update its mean & variance statistics. This is not the case for other layers in general, as weight trainability & inference/training modes are two orthogonal concepts. But the two are tied in the case of the BatchNormalization layer.

When you unfreeze a model that contains BatchNormalization layers in order to do fine-tuning, you should keep the BatchNormalization layers in inference mode by passing training=False when calling the base model. Otherwise the updates applied to the non-trainable weights will suddenly destroy what the model has learned.

[…]

In the examples they pass training=False when calling the base model, but later they set base_model.trainable=True, which for my understanding is the opposite of inference mode, because the BN layers will be set to trainable as well.

For my understanding there would have to be 0 trainable_weights and 4 non_trainable_weights for inference mode, which would be identical to when setting the bn_layer.trainable=False, which they stated would be the case for running the bn_layer in inference mode.

I checked the number of trainable_weights and number of non_trainable_weights and they are both 2.

I am confused by the tutorial, how can I really be sure BN layer are in inference mode when doing fine tuning on a model?

Does setting training=False on the model overwrite the behavior of bn_layer.trainable=True? So that even if the trainable_weights get listed with 2 these would not get updated during training (fine tuning)?

Update:

Here I found some further information: BatchNormalization layer - on keras.io.

[...]
About setting layer.trainable = False on a BatchNormalization layer:
The meaning of setting layer.trainable = False is to freeze the layer, i.e. its internal state will not change during training: its trainable weights will not be updated during fit() or train_on_batch(), and its state updates will not be run.

Usually, this does not necessarily mean that the layer is run in inference mode (which is normally controlled by the training argument that can be passed when calling a layer). "Frozen state" and "inference mode" are two separate concepts.

However, in the case of the BatchNormalization layer, setting trainable = False on the layer means that the layer will be subsequently run in inference mode (meaning that it will use the moving mean and the moving variance to normalize the current batch, rather than using the mean and variance of the current batch).

This behavior has been introduced in TensorFlow 2.0, in order to enable layer.trainable = False to produce the most commonly expected behavior in the convnet fine-tuning use case.

Note that: - Setting trainable on an model containing other layers will recursively set the trainable value of all inner layers. - If the value of the trainable attribute is changed after calling compile() on a model, the new value doesn't take effect for this model until compile() is called again.

Question:

In case I want to fine tune the whole model, so I am going to unfreeze the base_model.trainable = True, would I have to manually set the BN layers to bn_layer.trainable = False in order to keep them in inference mode?
What does happen when with the call of the base_model passing training=False and additionally setting base_model.trainable=True? Do layers like BatchNormalization and Dropout stay in inference mode?

...

ANSWER

Answered 2022-Feb-06 at 08:59

After reading the documentation and having a look on the source code of TensorFlows implementations of tf.keras.layers.Layer, tf.keras.layers.Dense, and tf.keras.layers.BatchNormalization I got the following understanding.

If training = False is passed on calling the layer or the model/base model, it will run in inference mode. This has nothing to do with the attribute trainable, which means something different. It would probably lead to less misunderstanding, if they would have called the parameter training_mode instead of training. I would have preferred defining it the other way round and calling it inference_mode .

When doing Transfer Learning or Fine Tuning training = False should be passed on calling the base model itself. As far as I saw until now this will only affect layers like tf.keras.layers.Dropout and tf.keras.layers.BatchNormalization and will have not effect on the other layers. Running in inference mode via training = False will result in:

tf.layers.Dropout not to apply the dropout rate at all. As tf.layers.Dropout has no trainable weights, setting the attribute trainable = False will have no effect at all on this layer.
tf.keras.layers.BatchNormalization normalizing its inputs using the mean and variance of its moving statistics learned during training

The attribute trainable will only activate or deactivate updating the trainable weights of a layer.

Source https://stackoverflow.com/questions/70998847

QUESTION

Tensor Dimensions must be equal

Asked 2022-Feb-15 at 14:19

I am writing my own code for False alarm metrics in Keras for Neural networks. The Neural network implemented gives an output of (100,1) dimension, where each output value is between 0 and 1. The batch size is 1000. In the false_alarm1 function, I want to select the top k probabilities and make them 1 and the rest equal to 0 and then calculate the confusion matrix. I received a message that the dimensions do not match. I tried all the solutions available on the internet for a similar problem but nothing seems to work.

Here is the function, I implemented

...

ANSWER

Answered 2022-Feb-15 at 14:19

Try using tf.tensor_scatter_nd_update to make the top-k probabilities 1 and the rest 0:

Source https://stackoverflow.com/questions/71126934

QUESTION

Python Keras Input 0 of layer batch_normalization is incompatible with the layer

Asked 2022-Jan-11 at 07:44

I am using CIFAR-10 Dataset to train some MLP models. I want to try data augmentation as the code block below.

...

ANSWER

Answered 2022-Jan-10 at 15:15

The input shape of CIRFAR is (32, 32, 3) but your model's input isn't taking that shape. You can try as follows for your model input.

Source https://stackoverflow.com/questions/70654454

QUESTION

No gradients are provided / 'NoneType' object is not callable when trying to fit a multi-output model

Asked 2021-Aug-16 at 15:38

I'm fairly new to machine learning and I'm having quite a bit of difficulty with this issue. I'm using Kaggle notebook with tensorflow version 2.3.1. I need to train a model with face images and predict multiple attributes, wrinkles, freckles, hair colour etc, hair thickness and glasses, hence multi-output model. When I try to model.fit, on the first instance I get the error of "No gradients are provided". Upon running the same code without any change gives me error of "NoneType object is not callable". I'm stuck here for over a week now and so far no solution on the internet has been able to resolve this issue so I'm including as much detail as possible here. Some side info about the problem, Wrinkles and Freckles have values 0 or 1 while other outputs have values ranging from 0 to 3, 0 to 5 or 0 to 9. Here is the code.

Setting up CNN:

...

ANSWER

Answered 2021-Aug-16 at 15:38

Apparently the tutorial I was following was inaccurate. Since my NN ha s5 branches as it needs to make 5 predictions, my model.fit function needed to be changed and each Nn branch needed to be mapped to the corresponding labels.

code with error:

Source https://stackoverflow.com/questions/68523402

QUESTION

Why do batch_normalization produce all-zero output when training = True but produce non-zero output when training = False?

Asked 2021-Jun-06 at 07:54

I am following the Tensorflow tutorial https://www.tensorflow.org/guide/migrate. Here is an example:

...

ANSWER

Answered 2021-Jun-06 at 07:54

Why do batch_normalization produce all-zero output when training = True

It's because your batch size = 1 here.

Batch normalization layer normalizes its input by using batch mean and batch standard deviation for each channel.

When the batch size is 1 and after flatten, there is only one single value in each channel, so that the batch mean(for that channel) will be the single value itself, thus outputting a zero tensor after the batch normalization layer.

but produce non-zero output when training = False?

During inference, batch normalization layer normalizes inputs by using moving average of batch mean and SD instead of current batch mean and SD.

The moving mean and SD are initialized as zero and one respectively and updated gradually. Therefore, the moving mean doesn't equal that single value in each channel at the beginning, therefore the layer will not output a zero tensor.

In conclusion: use batch size > 1 and input tensor with random values/realistic data values rather than tf.ones() in which all elements are the same.

Source https://stackoverflow.com/questions/67846115

QUESTION

How can I see the model as visualized?

Asked 2021-Jun-04 at 12:46

I am trying to do some sample code of GAN, here comes the generator.

I want to see the visualized model but, this is not the model.

Model.summary() is not the function of tensorflow but it is keras?? if so how can I see visualized model??

...

ANSWER

Answered 2021-Jun-03 at 10:47

One possible solution (or an idea) is to wrap your tensorflow operation into the Lambda layer and use it to build the keras model. Something like

Source https://stackoverflow.com/questions/67816912

QUESTION

Gradients do not exist for most convolution filters in subclassed model

Asked 2021-May-15 at 17:12

Contain the necessary files. Add this to your "My Drive". https://drive.google.com/drive/folders/1epROVNfvO10Ksy8CwJQdamSK96JZnWW9?usp=sharing Google colab link with a minimal example: https://colab.research.google.com/drive/18sMqNn8IpTQLZBlInWSbX0ITd2GWDDkz?usp=sharing

This basic block 'module', if you will, is part of a larger network. It all boils down to this block, however, as this is where the convolutions are performed (in this case depthwise separable convolution). The network is seemingly able to train, however, while training (and during all the epochs) a WARNING is thrown out:

...

ANSWER

Answered 2021-May-15 at 17:12

Solved by reworking the network and putting all the layers one after the other rather than having multiple instances of a model. So everything from beginning to end is in one single subclassed model.

Source https://stackoverflow.com/questions/67371219

QUESTION

Why is the loss of my autoencoder not going down at all during training?

Asked 2021-Apr-05 at 15:32

I am following this tutorial to create a Keras-based autoencoder, but using my own data. That dataset includes about 20k training and about 4k validation images. All of them are very similar, all show the very same object. I haven't modified the Keras model layout from the tutorial, only changed the input size, since I used 300x300 images. So my model looks like this:

...

ANSWER

Answered 2021-Apr-05 at 15:32

It could be that the decay_rate argument in tf.keras.optimizers.schedules.ExponentialDecay is decaying your learning rate quicker than you think it is, effectively making your learning rate zero.

Source https://stackoverflow.com/questions/66932872

QUESTION

How to train a Keras autoencoder with custom dataset?

Asked 2021-Mar-30 at 15:25

I am reading this tutorial in order to create my own autoencoder based on Keras. I followed the tutorial step by step, the only difference is that I want to train the model using my own images data set. So I changed/added the following code:

...

ANSWER

Answered 2021-Mar-30 at 15:25

Use class_mode="input" at the flow_from_directory so returned Y will be same as X

https://github.com/tensorflow/tensorflow/blob/v2.4.1/tensorflow/python/keras/preprocessing/image.py#L867-L958

class_mode: One of "categorical", "binary", "sparse", "input", or None. Default: "categorical". Determines the type of label arrays that are returned: - "categorical" will be 2D one-hot encoded labels, - "binary" will be 1D binary labels, "sparse" will be 1D integer labels, - "input" will be images identical to input images (mainly used to work with autoencoders). - If None, no labels are returned (the generator will only yield batches of image data, which is useful to use with model.predict()). Please note that in case of class_mode None, the data still needs to reside in a subdirectory of directory for it to work correctly.

Code should end up like:

Source https://stackoverflow.com/questions/66873097

QUESTION

Keras 1D CNN always predicts the same result even if accuracy is high on training set

Asked 2021-Feb-13 at 17:15

The validation accuracy of my 1D CNN is stuck on 0.5 and that's because I'm always getting the same prediction out of a balanced data set. At the same time my training accuracy keeps increasing and the loss decreasing as intended.

Strangely, if I do model.evaluate() on my training set (that has close to 1 accuracy in the last epoch), the accuracy will also be 0.5. How can the accuracy here differ so much from the training accuracy of the last epoch? I've also tried with a batch size of 1 for both training and evaluating and the problem persists.

Well, I've been searching for different solutions for quite some time but still no luck. Possible problems I've already looked into:

My data set is properly balanced and shuffled;
My labels are correct;
Tried adding fully connected layers;
Tried adding/removing dropout from the fully connected layers;
Tried the same architecture, but with the last layer with 1 neuron and sigmoid activation;
Tried changing the learning rates (went down to 0.0001 but still the same problem).

Here's my code:

...

ANSWER

Answered 2021-Jan-19 at 09:40

... also tried with sigmoid but the issue persists ...

You don't want to be "trying" out activation functions or loss functions for a well-defined problem statement. It seems you are mixing up a single-label multi-class and a multi-label multi-class architecture.

Your output is a 2 class multi-class output with softmax activation which is great, but you use binary_crossentropy which would only make sense when used in a multi-class setting for multi-label problems.

You would want to use categorical_crossentropy instead. Furthermore, I would have suggested focal loss if there was class imbalance but it seems you have a 50,50 class proportion, so that's not necessary.

Remember, accuracy is decided based on which loss is being used! Check the different classes here. When you use binary_crossentropy the accuracy used is binaryaccuracy while with categorical_crossentropy, it uses categoricalaccuracy

Check this chart for details on what to use in what type of problem statement.

Other than that, there is a bottleneck in your network at flatten() and Dense(). The number of trainable parameters is quite high relative to other layers. I would advise using another CNN layer to bring the number of filters to say 128 and the size of sequence even smaller. And reduce the number of neurons for that Dense layer as well.

98.9% (3,277,000/3,311,354) of all of your trainable parameters reside between the Flatten and Dense layer! Not a great architectural choice!

Outside the above points, the model results are totally dependent on your data itself. I wouldn't be able to help more without knowledge of the data.

Source https://stackoverflow.com/questions/65756787

Community Discussions, Code Snippets contain sources that include Stack Exchange Network

Vulnerabilities

No vulnerabilities reported

Install batch_normalization

You can download it from GitHub.

Support

For any new features, suggestions and bugs create an issue on GitHub. If you have any questions check and ask questions on community page Stack Overflow .

Find more information at: