- 
                Notifications
    
You must be signed in to change notification settings  - Fork 19.6k
 
Keras 2.0 release notes
This document details changes, in particular API changes, occurring from Keras 1 to Keras 2.
- The 
nb_epochargument has been renamedepochseverywhere. - The methods 
fit_generator,evaluate_generatorandpredict_generatornow work by drawing a number of batches from a generator (number of training steps), rather than a number of samples.- 
samples_per_epochwas changed tosteps_per_epochinfit_generator. It now refers to the number of batches an epoch is considered as done. - 
nb_val_sampleswas renamedvalidation_stepsinfit_generator. - 
val_sampleswas renamedstepsinevaluate_generatorandpredict_generator. 
 - 
 - It is now possible to manually add a loss to a model by calling 
model.add_loss(loss_tensor). The loss is added to the other losses of the model and minimized during training. - It is also possible to not apply any loss to a specific model output. If you pass 
Noneas thelossargument for an output (e.g. in compile,loss={'output_1': None, 'output_2': 'mse'}, the model will expect no Numpy arrays to be fed for this output when usingfit,train_on_batch, orfit_generator. The output values are still returned as usual when usingpredict. - In TensorFlow, models can now be trained using 
fitif some of their inputs (or even all) are TensorFlow queues or variables, rather than placeholders. See this test for specific examples. 
- The 
objectivesmodule has been renamedlosses. - Several legacy metric functions have been removed, namely 
matthews_correlation,precision,recall,fbeta_score,fmeasure. - Custom metric functions can no longer return a dict, they must return a single tensor.
 
- Constructor arguments for 
Modelhave been renamed:- 
input->inputs - 
output->outputs 
 - 
 - The 
Sequentialmodel not longer supports theset_inputmethod. - For any model saved with Keras 2.0 or higher, weights trained with backend X will be converted to work with backend Y without any manual conversion step.
 
Deprecated layers MaxoutDense, Highway and TimedistributedDense have been removed.
- All layers that use the learning phase now support a 
trainingargument incall(Python boolean or symbolic tensor), allowing to specify the learning phase on a layer-by-layer basis. E.g. by calling aDropoutinstance asdropout(inputs, training=True)you obtain a layer that will always apply dropout, regardless of the current global learning phase. Thetrainingargument defaults to the global Keras learning phase everywhere. - The 
callmethod of layers can now take arbitrary keyword arguments, e.g. you can define a custom layer with a call signature likecall(inputs, alpha=0.5), and then pass aalphakeyword argument when calling the layer (only with the functional API, naturally). - 
__call__now makes use of TensorFlowname_scope, so that your TensorFlow graphs will look pretty and well-structured in TensorBoard. 
dim_ordering has been renamed data_format. It now takes two values: "channels_first" (formerly "th") and "channels_last" (formerly "tf").
Changed interface:
- 
output_dim->units - 
init->kernel_initializer - added 
bias_initializerargument - 
W_regularizer->kernel_regularizer - 
b_regularizer->bias_regularizer - 
b_constraint->bias_constraint - 
bias->use_bias 
Changed interface:
- 
p->rate 
- The 
AtrousConvolution1DandAtrousConvolution2Dlayer have been deprecated. Their functionality is instead supported via thedilation_rateargument inConvolution1DandConvolution2Dlayers. - 
Convolution*layers are renamedConv*. - The 
Deconvolution2Dlayer is renamedConv2DTranspose. - The 
Conv2DTransposelayer no longer requires anoutput_shapeargument, making its use much easier. 
Interface changes common to all convolutional layers:
- 
nb_filter->filters - 'nb_filter' is renamed as 'filters'.
 - float kernel dimension arguments become a single tuple argument, 
kernelsize. E.g. a legacy callConv2D(10, 3, 3)becomesConv2D(10, (3, 3)) - 
kernel_sizecan be set to an integer instead of a tuple, e.g.Conv2D(10, 3)is equivalent toConv2D(10, (3, 3)). - 
subsample->strides. Can also be set to an integer. - 
border_mode->padding - 
init->kernel_initializer - added 
bias_initializerargument - 
W_regularizer->kernel_regularizer - 
b_regularizer->bias_regularizer - 
b_constraint->bias_constraint - 
bias->use_bias - 
dim_ordering->data_format - In the 
SeparableConv2Dlayers,initis split intodepthwise_initializerandpointwise_initializer. - Added 
dilation_rateargument inConv2DandConv1D. - 1D convolution kernels are now saved as a 3D tensor (instead of 4D as before).
 - 2D and 3D convolution kernels are now saved in format 
spatial_dims + (input_depth, depth)), even withdata_format="channels_first". 
the -> indicates that the terms has been renamed.
- 
pool_length->pool_size - 
stride->strides - 
border_mode->padding 
- 
border_mode->padding - 
dim_ordering->data_format 
The padding argument of the ZeroPadding2D and ZeroPadding3D layers must be a tuple of length 2 and 3 respectively. Each entry i contains by how much to pad the spatial dimension i. If it's an integer, symmetric padding is applied. If it's a tuple of integers, asymmetric padding is applied.
- 
length->size 
The mode argument of BatchNormalization has been removed; BatchNorm now only supports mode 0 (use batch metrics for feature-wise normalization during training, and use moving metrics for feature-wise normalization during testing).
- 
beta_init->beta_initializer - 
gamma_init->gamma_initializer - added arguments 
center,scale(booleans, whether to use abetaandgammarespectively) - added arguments 
moving_mean_initializer,moving_variance_initializer - added arguments 
beta_regularizer,gamma_regularizer - added arguments 
beta_constraint,gamma_constraint - attribute 
running_meanis renamedmoving_mean - attribute 
running_stdis renamedmoving_variance(it is in fact a variance with the current implementation). 
Same changes as for convolutional layers and recurrent layers apply.
- 
init->alpha_initializer 
- 
sigma->stddev 
- 
output_dim->units - 
init->kernel_initializer - 
inner_init->recurrent_initializer - added argument 
bias_initializer - 
W_regularizer->kernel_regularizer - 
b_regularizer->bias_regularizer - added arguments 
kernel_constraint,recurrent_constraint,bias_constraint - 
dropout_W->dropout - 
dropout_U->recurrent_dropout - 
consume_less->implementation. String values have been replaced with integers: implementation 0 (default), 1 or 2. - LSTM only: the argument 
forget_bias_inithas been removed. Instead there is a boolean argumentunit_forget_bias, defaulting toTrue. 
The Lambda layer now supports a mask argument.
Utilities should now be imported from keras.utils rather than from specific submodules (e.g. no more keras.utils.np_utils...).
- 
std->stddev 
- In the backend, 
set_image_orderingandimage_orderingare nowset_data_formatanddata_format. - Any arguments (other than 
nb_epoch) prefixed withnb_has been renamed to be prefixed withnum_instead. This affects two datasets and one preprocessing utility.