欢迎加入我们~
Transformer implemented in Keras
Implementation of BERT that could load official pre-trained models for feature extraction and prediction
RAdam implemented in Keras & TensorFlow
Attention mechanism for processing sequential data that considers the context for each timestamp.