# rudall-e **Repository Path**: mirrors/rudall-e ## Basic Information - **Project Name**: rudall-e - **Description**: ruDALL-E 是一个可以根据文本自动生成图像的工具 - **Primary Language**: Python - **License**: Apache-2.0 - **Default Branch**: master - **Homepage**: https://www.oschina.net/p/rudall-e - **GVP Project**: No ## Statistics - **Stars**: 1 - **Forks**: 1 - **Created**: 2021-11-12 - **Last Updated**: 2026-02-21 ## Categories & Tags **Categories**: Uncategorized **Tags**: None ## README # ruDALL-E ### Generate images from texts [![Apache license](https://img.shields.io/badge/License-Apache-blue.svg)](https://www.apache.org/licenses/LICENSE-2.0) [![Downloads](https://pepy.tech/badge/rudalle)](https://pepy.tech/project/rudalle) [![Coverage Status](https://codecov.io/gh/sberbank-ai/ru-dalle/branch/master/graphs/badge.svg)](https://codecov.io/gh/sberbank-ai/ru-dalle) [![pipeline](https://gitlab.com/shonenkov/ru-dalle/badges/master/pipeline.svg)](https://gitlab.com/shonenkov/ru-dalle/-/pipelines) [![pre-commit.ci status](https://results.pre-commit.ci/badge/github/sberbank-ai/ru-dalle/master.svg)](https://results.pre-commit.ci/latest/github/sberbank-ai/ru-dalle/master) ``` pip install rudalle==1.1.3 ``` ### 🤗 HF Models: [ruDALL-E Malevich (XL)](https://huggingface.co/sberbank-ai/rudalle-Malevich) \ [ruDALL-E Emojich (XL)](https://huggingface.co/sberbank-ai/rudalle-Emojich) (readme [here](https://github.com/sberbank-ai/ru-dalle/blob/master/Emojich.md)) \ [ruDALL-E Surrealist (XL)](https://huggingface.co/shonenkov-AI/rudalle-xl-surrealist) \ ruDALL-E Kandinsky (XXL) (soon) ### Minimal Example: [![Open In Colab](https://colab.research.google.com/assets/colab-badge.svg)](https://colab.research.google.com/drive/1RztuaPetwz-QGHHpwlE5WAp3J3Ss70_4?usp=sharing) [![Kaggle](https://kaggle.com/static/images/open-in-kaggle.svg)](https://www.kaggle.com/shonenkov/rudalle-example-generation) [![Hugging Face Spaces](https://img.shields.io/badge/%F0%9F%A4%97%20Hugging%20Face-Spaces-blue)](https://huggingface.co/spaces/anton-l/rudall-e) **Example usage ruDALL-E Malevich (XL) with 3.5GB vRAM!** [![Open In Colab](https://colab.research.google.com/assets/colab-badge.svg)](https://colab.research.google.com/drive/1AoolDYePUpPkRCKIu0cP9zV7lX5QGD3Z?usp=sharing) **Finetuning example** [![Open In Colab](https://colab.research.google.com/assets/colab-badge.svg)](https://colab.research.google.com/drive/1Tb7J4PvvegWOybPfUubl5O7m5I24CBg5?usp=sharing) ### generation by ruDALLE: ```python import ruclip from rudalle.pipelines import generate_images, show, super_resolution, cherry_pick_by_ruclip from rudalle import get_rudalle_model, get_tokenizer, get_vae, get_realesrgan from rudalle.utils import seed_everything # prepare models: device = 'cuda' dalle = get_rudalle_model('Malevich', pretrained=True, fp16=True, device=device) tokenizer = get_tokenizer() vae = get_vae(dwt=True).to(device) # pipeline utils: realesrgan = get_realesrgan('x2', device=device) clip, processor = ruclip.load('ruclip-vit-base-patch32-384', device=device) clip_predictor = ruclip.Predictor(clip, processor, device, bs=8) text = 'радуга на фоне ночного города' seed_everything(42) pil_images = [] scores = [] for top_k, top_p, images_num in [ (2048, 0.995, 24), ]: _pil_images, _scores = generate_images(text, tokenizer, dalle, vae, top_k=top_k, images_num=images_num, bs=8, top_p=top_p) pil_images += _pil_images scores += _scores show(pil_images, 6) ``` ![](pics/malevich/rainbow-full.png) ### auto cherry-pick by ruCLIP: ```python top_images, clip_scores = cherry_pick_by_ruclip(pil_images, text, clip_predictor, count=6) show(top_images, 3) ``` ![](pics/malevich/rainbow-cherry-pick.png) ### super resolution: ```python sr_images = super_resolution(top_images, realesrgan) show(sr_images, 3) ``` ![](pics/malevich/rainbow-super-resolution.png) ```python text, seed = 'красивая тян из аниме', 6955 ``` ![](pics/malevich/anime-girl-super-resolution.png) ### Image Prompt see `jupyters/ruDALLE-image-prompts-A100.ipynb` ```python text, seed = 'Храм Василия Блаженного', 42 skyes = [red_sky, sunny_sky, cloudy_sky, night_sky] ``` ![](pics/malevich/russian-temple-image-prompt.png) ### VideoDALL-E | ru[CogVideo](https://github.com/THUDM/CogVideo) by [@cene555](https://github.com/cene555) **Video generation example** [![Open In Colab](https://colab.research.google.com/assets/colab-badge.svg)](https://colab.research.google.com/drive/1A_3Oe9r9DP3Ayd6DPvqKHIKlwNfLhVP5?usp=sharing) **Finetuning example** [![Open In Colab](https://colab.research.google.com/assets/colab-badge.svg)](https://colab.research.google.com/drive/1R_joYWlvToA24tsa9BFYa2D6ffiMtyVy?usp=sharing) ### Aspect ratio images [**-->NEW<--**](https://github.com/shonenkov-AI/rudalle-aspect-ratio) ![](https://raw.githubusercontent.com/shonenkov-AI/rudalle-aspect-ratio/main/pics/h_example.jpg) ### [Kandinsky 12B](https://github.com/ai-forever/ru-dalle/blob/master/jupyters/Kandinsky-12b-A100.ipynb) Request access: [Here](https://docs.google.com/forms/d/e/1FAIpQLSdYCT6LKDWgWGkd0Lq_sMLe2wZDZSkMUuCQx4qdXUnd6SrhvA/viewform) `роботы акварелью в стиле ван гога` ![](./pics/kandinsky/example-robots.png) [![](./pics/habr_eng.svg)](https://habr.com/ru/company/sberbank/blog/671210/) ![](./pics/kandinsky/loss.jpg) `FID = 15.4 (COCO Valid)` ### 🚀 Contributors 🚀 - [@bes](https://github.com/bes-dev) shared [great idea and realization with IDWT](https://github.com/bes-dev/vqvae_dwt_distiller.pytorch) for decoding images with higher quality 512x512! 😈💪 thanks a lot for your constructive advices, appreciate it - [@neverix](https://www.kaggle.com/neverix) thanks a lot for contributing for speed up of inference - [@Igor Pavlov](https://github.com/boomb0om) trained model and prepared code with [super-resolution](https://github.com/boomb0om/Real-ESRGAN-colab) - [@oriBetelgeuse](https://github.com/oriBetelgeuse) thanks a lot for easy API of generation using image prompt - [@Alex Wortega](https://github.com/AlexWortega) created first FREE version colab notebook with fine-tuning [ruDALL-E Malevich (XL)](https://huggingface.co/sberbank-ai/rudalle-Malevich) on sneakers domain 💪 - [@Anton Lozhkov](https://github.com/anton-l) Integrated to [Huggingface Spaces](https://huggingface.co/spaces) with [Gradio](https://github.com/gradio-app/gradio), see [here](https://huggingface.co/spaces/anton-l/rudall-e) ### Supported by [](https://airi.net) ### Social Media [![](./pics/habr_eng.svg)](https://habr.com/ru/company/sberbank/blog/589673/) [![](./pics/habr.svg)](https://habr.com/ru/company/sberdevices/blog/586926/)