Quick start#

1. Installation#

Install tfts, follow the installation instructions first

Python 3.7+
TensorFlow 2.x installation instructions

Now you are ready, proceed with

$ pip install tfts

You can run it in docker, download the Dockerfile to host server

$ docker build -f ./Dockerfile -t "custom image name" .
$ docker run --rm -it --init --ipc=host --network=host --volume=$PWD:/app -e NVIDIA_VISIBLE_DEVICES=0 "custom image name" /bin/bash

2. Basic Usage#

The general setup for training and testing a model is

Build time series 3D training dataset and valid dataset. The shape of input and label are (examples, train_sequence_length, features) and (examples, predict_sequence_length, 1)
Instantiate a model using the AutoModel method
Create a Trainer() or KerasTrainer() object. Define the optimizer and loss function in trainer
Train the model on the training dataset and check if it has converged with acceptable accuracy
Tune the hyper-parameters of the model and training, manually or refer to tuning example
Load the model from the model checkpoint and apply it to new data

import tensorflow as tf
import tfts
from tfts import AutoConfig, AutoModel, KerasTrainer

train_length = 36
predict_sequence_length = 12
train, valid = tfts.get_data('sine', train_length, predict_sequence_length)

# build model: 'seq2seq', 'wavenet', 'transformer', 'rnn', 'tcn', 'bert'
model_name_or_path = 'seq2seq'
config = AutoConfig.for_model(model_name_or_path)
model = AutoModel.from_config(config, predict_sequence_length=predict_sequence_length)

# train
opt = tf.keras.optimizers.Adam(0.001)
loss_fn = tf.keras.losses.MeanSquaredError()
trainer = KerasTrainer(model, loss_fn=loss_fn, optimizer=opt)
trainer.train(train, valid, epochs=30, batch_size=32)

# test
trainer.predict(valid[0])

3. Train your first model#

3.1 Prepare the data#

The tfts could accept any time series data of 3D data format as model input: (num_examples, train_sequence_length, num_features), and the model supported by tfts outputs 3D data as model output: (num_examples, predict_sequence_length, num_outputs)

Before training, ensure your raw data is preprocessed into a 3D format with the shape (batch_size, train_steps, features). Perform any necessary data cleaning, normalization, or transformation steps to ensure the data is ready for training.

3.2 Train the Model#

When training the model, use appropriate loss functions, optimizers, and hyperparameters to achieve the best results.

Run with strategy to support multi-gpu or tpu training

from tfts import KerasTrainer

config = AutoConfig.for_model(model_name_or_path)
model = AutoModel.from_config(config, predict_sequence_length=predict_sequence_length)
optimizer = {
    'class_name': 'adam',
    'config': {'learning_rate': 0.0005}
}

strategy = tf.distribute.MirroredStrategy()
trainer = KerasTrainer(model, strategy=strategy)
trainer.train(train_gen, valid_gen, optimizer=optimizer, epochs=30)

Run with Learning rate scheduler

opt = tf.keras.optimizers.Adam(0.001)
loss_fn = tf.keras.losses.MeanSquaredError()
lr_scheduler = tf.keras.optimizers.schedules.CosineDecay(
    initial_learning_rate=0.001,
    decay_steps=1000,
)
trainer = KerasTrainer(model)
trainer.train(train_dataset, valid_dataset, optimizer=opt, loss_fn=loss_fn, lr_scheduler=lr_scheduler)

Run with pretrained weights

model = AutoModel.from_config(config, predict_sequence_length=predict_sequence_length)
model.save_pretrained("tfts-model")

model = AutoModel.from_pretrained("tfts-model")

3.3 Save and load the model#

3.4 Serve the model#

Once the model is trained and evaluated, deploy it for inference. Ensure the model is saved in a format compatible with your serving environment (e.g., TensorFlow SavedModel, ONNX, etc.). Set up an API or service to handle incoming requests, preprocess input data, and return predictions in real-time.

Save the model into protobuf file