The collection of pre-trained, state-of-the-art AI models.
About ailia SDK
ailia SDK is a cross-platform, high-speed inference SDK for AI. It supports Windows, Mac, Linux, iOS, Android, Jetson, and Raspberry Pi with GPU acceleration via Vulkan and Metal. Bindings are available for C++, Python, Unity (C#), Kotlin, Rust, and Flutter.
Why ailia SDK
| ailia SDK | ONNX Runtime | |
|---|---|---|
| GPU inference via Vulkan and Metal | ✓ | − |
| ailia Speech / Voice / LLM / Tokenizer / Tracker | ✓ | − |
| 400+ verified model library with sample code | ✓ | − |
| Non-OS / RTOS inference support | ✓ | − |
| Unity bindings and model collection | ✓ | △ |
| Model‑specific optimization | ✓ | △ |
△ = Supported but limited due to general-purpose implementation.
How to use
Try now on Google Colaboratory
If you would like to try on your computer:
Documentation
Supported models
403 models as of March 30, 2026
Latest update
-
2026.03.30 Add auto mode for segment-anything-v2
-
2026.03.27 Add gpt-sovits-v2pro-int4, whisper-int4, sensevoice-int4 (currently required ORT)
-
2026.03.23 Add qwen2vl-int4 (currently required ORT)
-
2026.03.12 Add depth_anything_v3, depth_pro
-
2026.03.06 Add depth_anything_v2
-
2026.03.04 Add gpt-sovits-v2-pro, bevformer, uniad
-
2026.03.02 Add g2pw, gpt-sovits-v1, v2, v3 (chinese)
-
2026.01.16 Add embeddinggemma
-
2025.12.30 Add demucs, latentsync
-
2025.12.26 Add sadtalker
-
2025.12.25 Add samurai, cotracker3 (ailia SDK 1.6.1)
-
2025.12.21 Add silerovad v5, v6, v6_2
-
2025.12.17 Add sensevoice, cosyvoice2
-
2025.12.01 Add glass, mobilevlm, donut
-
More information in our Wiki
Action recognition
| Model | Reference | Exported From | Supported Ailia Version | Date | Blog | |
|---|---|---|---|---|---|---|
![]() | va-cnn | View Adaptive Neural Networks (VA) for Skeleton-based Human Action Recognition | Pytorch | 1.2.7 and later | Mar 2017 | |
![]() | st-gcn | ST-GCN | Pytorch | 1.2.5 and later | Jan 2018 | EN JP |
![]() | mars | MARS: Motion-Augmented RGB Stream for Action Recognition | Pytorch | 1.2.4 and later | Nov 2018 | EN JP |
![]() | ax_action_recognition | Realtime-Action-Recognition | Pytorch | 1.2.7 and later | Mar 2019 | |
![]() | driver-action-recognition-adas | driver-action-recognition-adas-0002 | OpenVINO | 1.2.5 and later | Mar 2019 | |
![]() | action_clip | ActionCLIP | Pytorch | 1.2.7 and later | Sep 2021 |
Anomaly detection
| Model | Reference | Exported From | Supported Ailia Version | Date | Blog | |
|---|---|---|---|---|---|---|
![]() | mahalanobisad | MahalanobisAD-pytorch | Pytorch | 1.2.9 and later | May 2020 | |
![]() | spade-pytorch | Sub-Image Anomaly Detection with Deep Pyramid Correspondences | Pytorch | 1.2.6 and later | May 2020 | |
![]() | padim | PaDiM-Anomaly-Detection-Localization-master | Pytorch | 1.2.6 and later | Nov 2020 | EN JP |
![]() | patchcore | PatchCore_anomaly_detection | Pytorch | 1.2.6 and later | Jun 2021 | |
![]() | glass | A Unified Anomaly Synthesis Strategy with Gradient Ascent for Industrial Anomaly Detection and Localization | Pytorch | 1.2.14 and later | Jul 2024 |
Audio Language Model
| Model | Reference | Exported From | Supported Ailia Version | Date | Blog |
|---|---|---|---|---|---|
| qwen_audio | Qwen-Audio | Pytorch | 1.5.0 and later | Nov 2023 | JP |
Audio processing
Audio classification
| Model | Reference | Exported From | Supported Ailia Version | Date | Blog |
|---|---|---|---|---|---|
| crnn_audio_classification | crnn-audio-classification | Pytorch | 1.2.5 and later | Mar 2019 | EN JP |
| audioset_tagging_cnn | PANNs: Large-Scale Pretrained Audio Neural Networks for Audio Pattern Recognition | Pytorch | 1.2.9 and later | Dec 2019 | |
| transformer-cnn-emotion-recognition | Combining Spatial and Temporal Feature Representions of Speech Emotion by Parallelizing CNNs and Transformer-Encoders | Pytorch | 1.2.5 and later | Oct 2020 | |
| microsoft clap | CLAP | Pytorch | 1.2.11 and later | Jun 2022 | |
| clap | CLAP | Pytorch | 1.2.6 and later | Nov 2022 | JP |
Music enhancement
| Model | Reference | Exported From | Supported Ailia Version | Date | Blog |
|---|---|---|---|---|---|
| hifigan | HiFi-GAN | Pytorch | 1.2.9 and later | Oct 2020 | |
| deep music enhancer | On Filter Generalization for Music Bandwidth Extension Using Deep Neural Networks | Pytorch | 1.2.6 and later | Nov 2020 |
Music generation
| Model | Reference | Exported From | Supported Ailia Version | Date | Blog |
|---|---|---|---|---|---|
| pytorch_wavenet | pytorch_wavenet | Pytorch | 1.2.14 and later | Sep 2016 |
Noise reduction
| Model | Reference | Exported From | Supported Ailia Version | Date | Blog |
|---|---|---|---|---|---|
| rnnoise | rnnoise | Keras | 1.2.15 and later | Sep 2017 | |
| voicefilter | VoiceFilter | Pytorch | 1.2.7 and later | Oct 2018 | EN JP |
| unet_source_separation | source_separation | Pytorch | 1.2.6 and later | Jul 2019 | EN JP |
| demucs | Demucs | Pytorch | 1.4.0 and later | Sep 2019 | |
| dtln | Dual-signal Transformation LSTM Network | Tensorflow | 1.3.0 and later | May 2020 | |
| audiosep | AudioSep | Pytorch | 1.3.0 and later | Aug 2023 |
Phoneme alignment
| Model | Reference | Exported From | Supported Ailia Version | Date | Blog |
|---|---|---|---|---|---|
| narabas | narabas: Japanese phoneme forced alignment tool | Pytorch | 1.2.11 and later | Mar 2023 |
Pitch detection
| Model | Reference | Exported From | Supported Ailia Version | Date | Blog |
|---|---|---|---|---|---|
| crepe | torchcrepe | Pytorch | 1.2.10 and later | Feb 2018 | JP |
Speaker diarization
| Model | Reference | Exported From | Supported Ailia Version | Date | Blog |
|---|---|---|---|---|---|
| pyannote-audio | Pyannote-audio | Pytorch | 1.2.15 and later | Nov 2019 | JP |
| auto_speech | AutoSpeech: Neural Architecture Search for Speaker Recognition | Pytorch | 1.2.5 and later | May 2020 | EN JP |
| wespeaker | WeSpeaker | Onnxruntime | 1.2.9 and later | Oct 2022 |
Speech to text
| Model | Reference | Exported From | Supported Ailia Version | Date | Blog |
|---|---|---|---|---|---|
| deepspeech2 | deepspeech.pytorch | Pytorch | 1.2.2 and later | Oct 2017 | EN JP |
| whisper | Whisper | Pytorch | 1.2.10 and later | Dec 2022 | JP |
| reazon_speech | ReazonSpeech | Pytorch | 1.4.0 and later | Jan 2023 | |
| distil-whisper | Hugging Face - Distil-Whisper | Pytorch | 1.2.16 and later | Nov 2023 | |
| sensevoice | SenseVoice | Pytorch | 1.2.13 and later | July 2024 | JP |
| reazon_speech2 | ReazonSpeech2 | Pytorch | 1.4.0 and later | Feb 2024 | |
| kotoba-whisper | kotoba-whisper | Pytorch | 1.2.16 and later | Apr 2024 |
Text to speech
| Model | Reference | Exported From | Supported Ailia Version | Date | Blog |
|---|---|---|---|---|---|
| pytorch-dc-tts | Efficiently Trainable Text-to-Speech System Based on Deep Convolutional Networks with Guided Attention | Pytorch | 1.2.6 and later | Oct 2017 | EN JP |
| tacotron2 | Tacotron2 | Pytorch | 1.2.15 and later | Feb 2018 | JP |
| vall-e-x | VALL-E-X | Pytorch | 1.2.15 and later | Mar 2023 | JP |
| Bert-VITS2 | Bert-VITS2 | Pytorch | 1.2.16 and later | Aug 2023 | |
| gpt-sovits | GPT-SoVITS | Pytorch | 1.4.0 and later | Feb 2024 | JP |
| gpt-sovits-v2 | GPT-SoVITS | Pytorch | 1.4.0 and later | Aug 2024 | |
| cosyvoice2 | CosyVoice2 | Pytorch | 1.4.0 and later | Dec 2024 | |
| gpt-sovits-v3 | GPT-SoVITS | Pytorch | 1.4.0 and later | Feb 2025 | |
| gpt-sovits-v2-pro | GPT-SoVITS | Pytorch | 1.4.0 and later | Jun 2025 | JP |
Voice activity detection
| Model | Reference | Exported From | Supported Ailia Version | Date | Blog |
|---|---|---|---|---|---|
| silero-vad | Silero VAD | Pytorch | 1.2.15 and later | Dec 2020 | JP |
Voice conversion
| Model | Reference | Exported From | Supported Ailia Version | Date | Blog |
|---|---|---|---|---|---|
| rvc | Retrieval-based-Voice-Conversion-WebUI | Pytorch | 1.2.12 and later | Mar 2023 | JP |
Autonomous driving
| Model | Reference | Exported From | Supported Ailia Version | Date | Blog |
|---|---|---|---|---|---|
| bevformer | BEVFormer | Pytorch | 1.6.1 and later | Mar 2022 | JP |
| uniad | UniAD: Unified Driving | Pytorch | 1.6.1 and later | Dec 2022 | JP |
Background removal
| Model | Reference | Exported From | Supported Ailia Version | Date | Blog | |
|---|---|---|---|---|---|---|
![]() | deep-image-matting | Deep Image Matting | Keras | 1.2.3 and later | Mar 2017 | EN JP |
![]() | indexnet | Indices Matter: Learning to Index for Deep Image Matting | Pytorch | 1.2.7 and later | Aug 2019 | |
![]() | U-2-Net | U^2-Net: Going Deeper with Nested U-Structure for Salient Object Detection | Pytorch | 1.2.2 and later | May 2020 | EN JP |
![]() | u2net-portrait-matting | U^2-Net - Portrait matting | Pytorch | 1.2.7 and later | May 2020 | |
![]() | u2net-human-seg | U^2-Net - human segmentation | Pytorch | 1.2.4 and later | May 2020 | |
![]() | cascade_psp | CascadePSP | Pytorch | 1.2.9 and later | May 2020 | |
![]() | rembg | Rembg | Pytorch | 1.2.4 and later | Aug 2020 | |
![]() | gfm | Bridging Composite and Real: Towards End-to-end Deep Image Matting | Pytorch | 1.2.10 and later | Oct 2020 | |
![]() | modnet | MODNet: Trimap-Free Portrait Matting in Real Time | Pytorch | 1.2.7 and later | Nov 2020 | |
![]() | background_matting_v2 | Real-Time High-Resolution Background Matting | Pytorch | 1.2.9 and later | Dec 2020 | |
![]() | dis_seg | Highly Accurate Dichotomous Image Segmentation | Pytorch | 1.2.10 and later | Mar 2022 |
Crowd counting
| Model | Reference | Exported From | Supported Ailia Version | Date | Blog | |
|---|---|---|---|---|---|---|
![]() | crowdcount-cascaded-mtl | CNN-based Cascaded Multi-task Learning of High-level Prior and Density Estimation for Crowd Counting (Single Image Crowd Counting) | Pytorch | 1.2.1 and later | Jul 2017 | EN JP |
![]() | c-3-framework | Crowd Counting Code Framework(C^3-Framework) | Pytorch | 1.2.5 and later | Jul 2019 |
Deep fashion
| Model | Reference | Exported From | Supported Ailia Version | Date | Blog | |
|---|---|---|---|---|---|---|
![]() | fashionai-key-points-detection | A Pytorch Implementation of Cascaded Pyramid Network for FashionAI Key Points Detection | Pytorch | 1.2.5 and later | Jun 2018 | |
![]() | person-attributes-recognition-crossroad | person-attributes-recognition-crossroad-0230 | Pytorch | 1.2.10 and later | Oct 2018 | |
![]() | clothing-detection | Clothing-Detection | Pytorch | 1.2.1 and later | Jun 2019 | EN JP |
![]() | mmfashion | MMFashion | Pytorch | 1.2.5 and later | Nov 2019 | EN JP |
![]() | mmfashion_tryon | MMFashion virtual try-on | Pytorch | 1.2.8 and later | Nov 2019 | |
![]() | mmfashion_retrieval | MMFashion In-Shop Clothes Retrieval | Pytorch | 1.2.5 and later | Nov 2019 |
Depth estimation
Diffusion
Text to image
| Model | Reference | Exported From | Supported Ailia Version | Date | Blog | |
|---|---|---|---|---|---|---|
![]() | latent-diffusion-txt2img | Latent Diffusion - txt2img | Pytorch | 1.2.10 and later | Dec 2021 | |
![]() | stable-diffusion-txt2img | Stable Diffusion | Pytorch | 1.2.14 and later | Aug 2022 | JP |
![]() | anything_v3 | Linaqruf/anything-v3.0 | Pytorch | 1.5.0 and later | Nov 2022 | |
![]() | control_net | ControlNet | Pytorch | 1.2.15 and later | Feb 2023 | |
![]() | latent-consistency-models | latent-consistency-models | Pytorch | 1.2.16 and later | Oct 2023 | |
![]() | sd-turbo | Hugging Face - SD-Turbo | Pytorch | 1.2.16 and later | Nov 2023 | |
![]() | sdxl-turbo | Hugging Face - SDXL-Turbo | Pytorch | 1.2.16 and later | Nov 2023 | |
![]() | depth_anything_controlnet | DepthAnything | Pytorch | 1.2.16 and later | Jan 2024 | |
![]() | latentsync | LatentSync | Pytorch | 1.4.0 and later | Dec 2024 |
Text to audio
| Model | Reference | Exported From | Supported Ailia Version | Date | Blog | |
|---|---|---|---|---|---|---|
![]() | riffusion | Riffusion | Pytorch | 1.2.16 and later | Dec 2022 |
Others
| Model | Reference | Exported From | Supported Ailia Version | Date | Blog | |
|---|---|---|---|---|---|---|
![]() | latent-diffusion-inpainting | Latent Diffusion - inpainting | Pytorch | 1.2.10 and later | Dec 2021 | |
![]() | latent-diffusion-superresolution | Latent Diffusion - Super-resolution | Pytorch | 1.2.10 and later | Dec 2021 | |
![]() | DA-CLIP | DA-CLIP | Pytorch | 1.2.16 and later | Oct 2023 | |
![]() | marigold | Marigold: Repurposing Diffusion-Based Image Generators for Monocular Depth Estimation | Pytorch | 1.2.16 and later | Dec 2023 |
Face detection
| Model | Reference | Exported From | Supported Ailia Version | Date | Blog | |
|---|---|---|---|---|---|---|
![]() | mtcnn | mtcnn | Keras | 1.2.10 and later | Apr 2016 | |
![]() | yolov1-face | YOLO-Face-detection | Darknet | 1.1.0 and later | Mar 2017 | |
![]() | face-detection-adas | face-detection-adas-0001 | OpenVINO | 1.2.5 and later | Oct 2018 | |
![]() | retinaface | RetinaFace: Single-stage Dense Face Localisation in the Wild. | Pytorch | 1.2.5 and later | May 2019 | JP |
![]() | blazeface | BlazeFace-PyTorch | Pytorch | 1.2.1 and later | Jul 2019 | EN JP |
![]() | yolov3-face | Face detection using keras-yolov3 | Keras | 1.2.1 and later | Dec 2019 | |
![]() | face-mask-detection | Face detection using keras-yolov3 | Keras | 1.2.1 and later | Dec 2019 | EN JP |
![]() | dbface | DBFace : real-time, single-stage detector for face detection, with faster speed and higher accuracy | Pytorch | 1.2.2 and later | Mar 2020 | |
![]() | anime-face-detector | Anime Face Detector | Pytorch | 1.2.6 and later | Oct 2021 |
Face identification
| Model | Reference | Exported From | Supported Ailia Version | Date | Blog | |
|---|---|---|---|---|---|---|
![]() | facenet_pytorch | Face Recognition Using Pytorch | Pytorch | 1.2.6 and later | Mar 2015 | |
![]() | insightface | InsightFace: 2D and 3D Face Analysis Project | Pytorch | 1.2.5 and later | Sep 2017 | |
![]() | vggface2 | VGGFace2 Dataset for Face Recognition | Caffe | 1.1.0 and later | Oct 2017 | |
![]() | arcface | pytorch implement of arcface | Pytorch | 1.2.1 and later | Jan 2018 | EN JP |
![]() | cosface | Pytorch implementation of CosFace | Pytorch | 1.2.10 and later | Jan 2018 |
Face recognition
Age gender estimation
| Model | Reference | Exported From | Supported Ailia Version | Date | Blog | |
|---|---|---|---|---|---|---|
![]() | face_classification | Real-time face detection and emotion/gender classification | Keras | 1.1.0 and later | Oct 2017 | |
![]() | age-gender-recognition-retail | age-gender-recognition-retail-0013 | OpenVINO | 1.2.5 and later | May 2018 | EN JP |
![]() | mivolo | MiVOLO: Multi-input Transformer for Age and Gender Estimation | Pytorch | 1.2.13 and later | Jul 2023 | JP |
Emotion recognition
| Model | Reference | Exported From | Supported Ailia Version | Date | Blog | |
|---|---|---|---|---|---|---|
![]() | ferplus | FER+ | CNTK | 1.2.2 and later | Aug 2016 | |
![]() | hsemotion | HSEmotion (High-Speed face Emotion recognition) library | Pytorch | 1.2.5 and later | Mar 2021 |
Gaze estimation
| Model | Reference | Exported From | Supported Ailia Version | Date | Blog | |
|---|---|---|---|---|---|---|
![]() | gazeml | A deep learning framework based on Tensorflow for the training of high performance gaze estimation | TensorFlow | 1.2.0 and later | May 2018 | |
![]() | mediapipe_iris | irislandmarks.pytorch | Pytorch | 1.2.2 and later | Jun 2020 | EN JP |
![]() | gazelle | gazelle | Pytorch | 1.2.16 and later | Dec 2024 | JP |
![]() | ax_gaze_estimation | ax Gaze Estimation | Pytorch | 1.2.2 and later | EN JP |
Head pose estimation
| Model | Reference | Exported From | Supported Ailia Version | Date | Blog | |
|---|---|---|---|---|---|---|
![]() | hopenet | deep-head-pose | Pytorch | 1.2.2 and later | Oct 2017 | EN JP |
![]() | 6d_repnet | 6D Rotation Representation for Unconstrained Head Pose Estimation (Pytorch) | Pytorch | 1.2.6 and later | Feb 2022 | |
![]() | L2CS_Net | L2CS_Net | Pytorch | 1.2.9 and later | Mar 2022 | |
![]() | 6d_repnet_360 | Toward Robust and Unconstrained Full Range of Rotation Head Pose Estimation | Pytorch | 1.2.9 and later | Sep 2023 |
Keypoint detection
| Model | Reference | Exported From | Supported Ailia Version | Date | Blog | |
|---|---|---|---|---|---|---|
![]() | face_alignment | 2D and 3D Face alignment library build using pytorch | Pytorch | 1.2.1 and later | Mar 2017 | EN JP |
![]() | prnet | Joint 3D Face Reconstruction and Dense Alignment with Position Map Regression Network | TensorFlow | 1.2.2 and later | Mar 2018 | |
![]() | facemesh | facemesh.pytorch | Pytorch | 1.2.2 and later | Jul 2019 | EN JP |
![]() | facial_feature | kaggle-facial-keypoints | Pytorch | 1.2.0 and later | Oct 2019 | |
![]() | 3ddfa | Towards Fast, Accurate and Stable 3D Dense Face Alignment | Pytorch | 1.2.10 and later | Sep 2020 | |
![]() | facemesh_v2 | MediaPipe Face landmark detection | Pytorch | 1.2.9 and later | May 2023 | JP |
Others
| Model | Reference | Exported From | Supported Ailia Version | Date | Blog | |
|---|---|---|---|---|---|---|
![]() | face-anti-spoofing | Lightweight Face Anti Spoofing | Pytorch | 1.2.5 and later | Jul 2020 | EN JP |
![]() | ax_facial_features | ax Facial Features | Pytorch | 1.2.5 and later | EN |
Face restoration
| Model | Reference | Exported From | Supported Ailia Version | Date | Blog | |
|---|---|---|---|---|---|---|
![]() | gfpgan | GFP-GAN: Towards Real-World Blind Face Restoration with Generative Facial Prior | Pytorch | 1.2.10 and later | Jan 2021 | JP |
![]() | codeformer | CodeFormer: Towards Robust Blind Face Restoration with Codebook Lookup Transformer | Pytorch | 1.2.9 and later | Jun 2022 |
Face swapping
| Model | Reference | Exported From | Supported Ailia Version | Date | Blog | |
|---|---|---|---|---|---|---|
![]() | deepfacelive | DeepFaceLive | ONNX Runtime | 1.2.10 and later | Dec 2020 | |
![]() | sber-swap | SberSwap | Pytorch | 1.2.12 and later | Feb 2022 | JP |
![]() | facefusion | FaceFusion | ONNX Runtime | 1.2.10 and later | Aug 2023 |
Frame Interpolation
| Model | Reference | Exported From | Supported Ailia Version | Date | Blog | |
|---|---|---|---|---|---|---|
![]() | cain | Channel Attention Is All You Need for Video Frame Interpolation | Pytorch | 1.2.5 and later | Nov 2019 | |
![]() | rife | Real-Time Intermediate Flow Estimation for Video Frame Interpolation | Pytorch | 1.2.13 and later | Nov 2020 | |
![]() | flavr | FLAVR: Flow-Agnostic Video Representations for Fast Frame Interpolation | Pytorch | 1.2.7 and later | Dec 2020 | EN JP |
![]() | film | FILM: Frame Interpolation for Large Motion | Tensorflow | 1.2.10 and later | Feb 2022 |
Generative adversarial networks
| Model | Reference | Exported From | Supported Ailia Version | Date | Blog | |
|---|---|---|---|---|---|---|
![]() | pytorch-gan | Code repo for the Pytorch GAN Zoo project (used to train this model) | Pytorch | 1.2.4 and later | Oct 2017 | |
![]() | lipgan | LipGAN | Keras | 1.2.15 and later | Oct 2019 | JP |
![]() | council-gan | Council-GAN | Pytorch | 1.2.4 and later | Nov 2019 | |
![]() | sam | Age Transformation Using a Style-Based Regression Model | Pytorch | 1.2.9 and later | Feb 2021 | |
![]() | encoder4editing | Designing an Encoder for StyleGAN Image Manipulation | Pytorch | 1.2.10 and later | Feb 2021 | |
![]() | restyle-encoder | ReStyle | Pytorch | 1.2.9 and later | Apr 2021 | |
![]() | SadTalker | SadTalker | Pytorch | 1.5.0 and later | Nov 2022 | |
![]() | live_portrait | LivePortrait | Pytorch | 1.5.0 and later | Jul 2024 | JP |
Hand detection
| Model | Reference | Exported From | Supported Ailia Version | Date | Blog | |
|---|---|---|---|---|---|---|
![]() | hand_detection_pytorch | hand-detection.PyTorch | Pytorch | 1.2.2 and later | Mar 2019 | |
![]() | yolov3-hand | Hand detection branch of Face detection using keras-yolov3 | Keras | 1.2.1 and later | Dec 2019 | |
![]() | blazepalm | MediaPipePyTorch | Pytorch | 1.2.5 and later | Jun 2020 |
Hand recognition
| Model | Reference | Exported From | Supported Ailia Version | Date | Blog | |
|---|---|---|---|---|---|---|
![]() | hand3d | ColorHandPose3D network | TensorFlow | 1.2.5 and later | May 2017 | |
![]() | v2v-posenet | V2V-PoseNet | Pytorch | 1.2.6 and later | Nov 2017 | |
![]() | minimal-hand | Minimal Hand | TensorFlow | 1.2.8 and later | Mar 2020 | |
![]() | blazehand | MediaPipePyTorch | Pytorch | 1.2.5 and later | Jun 2020 | EN JP |
![]() | hands_segmentation_pytorch | hands-segmentation-pytorch | Pytorch | 1.2.10 and later | Apr 2021 |
Image captioning
| Model | Reference | Exported From | Supported Ailia Version | Date | Blog | |
|---|---|---|---|---|---|---|
![]() | illustration2vec | Illustration2Vec | Caffe | 1.2.2 and later | Nov 2015 | |
![]() | image_captioning_pytorch | Image Captioning pytorch | Pytorch | 1.2.5 and later | Dec 2016 | EN JP |
![]() | blip2 | Hugging Face - BLIP-2 | Pytorch | 1.2.16 and later | Jan 2023 |
Image classification
CNN
Transformer
| Model | Reference | Exported From | Supported Ailia Version | Date | Blog | |
|---|---|---|---|---|---|---|
![]() | vit | Pytorch reimplementation of the Vision Transformer (An Image is Worth 16x16 Words: Transformers for Image Recognition at Scale) | Pytorch | 1.2.7 and later | Oct 2020 | EN JP |
![]() | clip | CLIP | Pytorch | 1.2.9 and later | Feb 2021 | EN JP |
![]() | swin-transformer | Swin Transformer | Pytorch | 1.2.6 and later | Mar 2021 | |
![]() | japanese-clip | Japanese-CLIP | Pytorch | 1.2.15 and later | May 2022 | |
![]() | japanese-stable-clip-vit-l-16 | japanese-stable-clip-vit-l-16 | Pytorch | 1.2.11 and later | Nov 2023 | |
![]() | clip-japanese-base | line-corporation/clip-japanese-base | Pytorch | 1.2.16 and later | Apr 2024 | |
![]() | siglip2 | Multilingual Vision-Language Encoders with Improved Semantic Understanding, Localization, and Dense Features | Pytorch | 1.2.16 and later | Feb 2025 | JP |
Specific task
| Model | Reference | Exported From | Supported Ailia Version | Date | Blog | |
|---|---|---|---|---|---|---|
![]() | weather-prediction-from-image | Weather Prediction From Image - (Warmth Of Image) | Keras | 1.2.5 and later | Oct 2017 | |
| partialconv | Partial Convolution Layer for Padding and Image Inpainting | Pytorch | 1.2.0 and later | Nov 2018 |
Image inpainting
| Model | Reference | Exported From | Supported Ailia Version | Date | Blog | |
|---|---|---|---|---|---|---|
![]() | inpainting-with-partial-conv | pytorch-inpainting-with-partial-conv | PyTorch | 1.2.6 and later | Apr 2018 | EN JP |
![]() | deepfillv2 | Free-Form Image Inpainting with Gated Convolution | Pytorch | 1.2.9 and later | Jun 2018 | |
![]() | inpainting_gmcnn | Image Inpainting via Generative Multi-column Convolutional Neural Networks | TensorFlow | 1.2.6 and later | Oct 2018 | |
![]() | 3d-photo-inpainting | 3D Photography using Context-aware Layered Depth Inpainting | Pytorch | 1.2.7 and later | Apr 2020 | |
![]() | lama | LaMa: Resolution-robust Large Mask Inpainting with Fourier Convolutions | Pytorch | 1.2.13 and later | Sep 2021 |
Image manipulation
Image restoration
| Model | Reference | Exported From | Supported Ailia Version | Date | Blog | |
|---|---|---|---|---|---|---|
![]() | nafnet | NAFNet: Nonlinear Activation Free Network for Image Restoration | Pytorch | 1.2.10 and later | Mar 2022 | JP |
Image segmentation
Landmark classification
| Model | Reference | Exported From | Supported Ailia Version | Date | Blog | |
|---|---|---|---|---|---|---|
![]() | places365 | Release of Places365-CNNs | Pytorch | 1.2.5 and later | Oct 2016 | |
![]() | landmarks_classifier_asia | Landmarks classifier_asia_V1.1 | TensorFlow Hub | 1.2.4 and later | Apr 2020 | EN JP |
Line segment detection
| Model | Reference | Exported From | Supported Ailia Version | Date | Blog | |
|---|---|---|---|---|---|---|
![]() | dexined | DexiNed: Dense Extreme Inception Network for Edge Detection | Pytorch | 1.2.5 and later | Sep 2019 | |
![]() | mlsd | M-LSD: Towards Light-weight and Real-time Line Segment Detection | TensorFlow | 1.2.8 and later | Jun 2021 | EN JP |
Low Light Image Enhancement
| Model | Reference | Exported From | Supported Ailia Version | Date | Blog | |
|---|---|---|---|---|---|---|
![]() | agllnet | AGLLNet: Attention Guided Low-light Image Enhancement (IJCV 2021) | Pytorch | 1.2.9 and later | Aug 2019 | EN JP |
![]() | drbn_skf | DRBN SKF | Pytorch | 1.2.14 and later | Apr 2023 |
Natural language processing
Bert
| Model | Reference | Exported From | Supported Ailia Version | Date | Blog |
|---|---|---|---|---|---|
| bert | pytorch-pretrained-bert | Pytorch | 1.2.2 and later | Oct 2018 | EN JP |
| bert_maskedlm | huggingface/transformers | Pytorch | 1.2.5 and later | Oct 2018 | |
| bert_question_answering | huggingface/transformers | Pytorch | 1.2.5 and later | Oct 2018 |
Embedding
| Model | Reference | Exported From | Supported Ailia Version | Date | Blog |
|---|---|---|---|---|---|
| sentence_transformers_japanese | sentence transformers | Pytorch | 1.2.7 and later | Aug 2019 | JP |
| multilingual-e5 | multilingual-e5-base | Pytorch | 1.2.15 and later | Dec 2022 | JP |
| glucose | GLuCoSE (General Luke-based Contrastive Sentence Embedding)-base-Japanese | Pytorch | 1.2.15 and later | Jul 2023 | |
| ruri-v3 | ruri-v3-310m | Pytorch | 1.2.13 and later | Apr 2025 | |
| embeddinggemma | EmbeddingGemma | Pytorch | 1.2.14 and later | Sep 2025 | JP |
Error corrector
| Model | Reference | Exported From | Supported Ailia Version | Date | Blog |
|---|---|---|---|---|---|
| bert_insert_punctuation | bert-japanese | Pytorch | 1.2.15 and later | Nov 2019 | |
| bertjsc | bertjsc | Pytorch | 1.2.15 and later | Mar 2023 | |
| t5_whisper_medical | error correction of medical terms using t5 | Pytorch | 1.2.13 and later |
Grapheme to phoneme
| Model | Reference | Exported From | Supported Ailia Version | Date | Blog |
|---|---|---|---|---|---|
| g2p_en | g2p_en | Pytorch | 1.2.14 and later | Jan 2019 | JP |
| g2pw | g2pW | Pytorch | 1.2.9 and later | Mar 2022 | |
| soundchoice-g2p | Hugging Face - speechbrain/soundchoice-g2p | Pytorch | 1.2.16 and later | Jul 2022 |
Named entity recognition
| Model | Reference | Exported From | Supported Ailia Version | Date | Blog |
|---|---|---|---|---|---|
| bert_ner | huggingface/transformers | Pytorch | 1.2.5 and later | Oct 2018 | |
| t5_base_japanese_ner | t5-japanese | Pytorch | 1.2.13 and later | Mar 2021 | |
| bert_ner_japanese | jurabi/bert-ner-japanese | Pytorch | 1.2.10 and later | Mar 2023 |
Reranker
| Model | Reference | Exported From | Supported Ailia Version | Date | Blog |
|---|---|---|---|---|---|
| cross_encoder_mmarco | jeffwan/mmarco-mMiniLMv2-L12-H384-v | Pytorch | 1.2.10 and later | Sep 2022 | JP |
| japanese-reranker-cross-encoder | hotchpotch/japanese-reranker-cross-encoder-large-v1 | Pytorch | 1.2.16 and later | Apr 2024 |
Sentence generation
| Model | Reference | Exported From | Supported Ailia Version | Date | Blog |
|---|---|---|---|---|---|
| gpt2 | GPT-2 | Pytorch | 1.2.7 and later | Feb 2019 | |
| rinna_gpt2 | japanese-pretrained-models | Pytorch | 1.2.7 and later | Apr 2021 |
Sentiment analysis
| Model | Reference | Exported From | Supported Ailia Version | Date | Blog |
|---|---|---|---|---|---|
| bert_sentiment_analysis | huggingface/transformers | Pytorch | 1.2.5 and later | Oct 2018 | |
| bert_tweets_sentiment | huggingface/transformers | Pytorch | 1.2.5 and later | Oct 2018 |
Summarize
| Model | Reference | Exported From | Supported Ailia Version | Date | Blog |
|---|---|---|---|---|---|
| bert_sum_ext | BERTSUMEXT | Pytorch | 1.2.7 and later | May 2019 | |
| presumm | PreSumm | Pytorch | 1.2.8 and later | Aug 2019 | |
| t5_base_japanese_title_generation | t5-japanese | Pytorch | 1.2.13 and later | Mar 2021 | JP |
| t5_base_summarization | t5-japanese | Pytorch | 1.2.13 and later | Mar 2021 |
Translation
| Model | Reference | Exported From | Supported Ailia Version | Date | Blog |
|---|---|---|---|---|---|
| fugumt-en-ja | Fugu-Machine Translator | Pytorch | 1.2.9 and later | Nov 2020 | JP |
| fugumt-ja-en | Fugu-Machine Translator | Pytorch | 1.2.10 abd later | Nov 2020 |
Zero shot classification
| Model | Reference | Exported From | Supported Ailia Version | Date | Blog |
|---|---|---|---|---|---|
| bert_zero_shot_classification | huggingface/transformers | Pytorch | 1.2.5 and later | Oct 2018 | |
| multilingual-minilmv2 | MoritzLaurer/multilingual-MiniLMv2-L12-mnli-xnli | Pytorch | 1.2.10 and later | Jun 2022 |
Network intrusion detection
| Model | Reference | Exported From | Supported Ailia Version | Date | Blog |
|---|---|---|---|---|---|
| bert-network-packet-flow-header-payload | bert-network-packet-flow-header-payload | Pytorch | 1.2.10 and later | Sep 2023 | |
| falcon-adapter-network-packet | falcon-adapter-network-packet | Pytorch | 1.2.10 and later | Sep 2023 |
Neural Rendering
| Model | Reference | Exported From | Supported Ailia Version | Date | Blog | |
|---|---|---|---|---|---|---|
![]() | nerf | NeRF: Neural Radiance Fields | Tensorflow | 1.2.10 and later | Mar 2020 | EN JP |
![]() | TripoSR | TripoSR | Pytorch | 1.2.6 and later | Mar 2024 |
NSFW detector
| Model | Reference | Exported From | Supported Ailia Version | Date | Blog |
|---|---|---|---|---|---|
| clip-based-nsfw-detector | CLIP-based-NSFW-Detector | Keras | 1.2.10 and later | Mar 2022 | JP |
Object detection
CNN
Transformer
| Model | Reference | Exported From | Supported Ailia Version | Date | Blog | |
|---|---|---|---|---|---|---|
![]() | glip | GLIP | Pytorch | 1.2.13 and later | Dec 2021 | |
![]() | dab-detr | DAB-DETR | Pytorch | 1.2.12 and later | Jan 2022 | |
![]() | detic | Detecting Twenty-thousand Classes using Image-level Supervision | Pytorch | 1.2.10 and later | Jan 2022 | EN JP |
![]() | groundingdino | Grounding DINO | Pytorch | 1.2.16 and later | Mar 2023 | JP |
![]() | rt-detr-v2 | RT-DETR | Pytorch | 1.2.13 and later | Jul 2024 | JP |
Specific target
| Model | Reference | Exported From | Supported Ailia Version | Date | Blog | |
|---|---|---|---|---|---|---|
![]() | traffic-sign-detection | Traffic Sign Detection | Tensorflow | 1.2.10 and later | Aug 2018 | EN JP |
![]() | sku110k-densedet | SKU110K-DenseDet | Pytorch | 1.2.9 and later | Apr 2019 | EN JP |
![]() | footandball | FootAndBall: Integrated player and ball detector | Pytorch | 1.2.0 and later | Dec 2019 | |
![]() | qrcode_wechatqrcode | qrcode_wechatqrcode | Caffe | 1.2.15 and later | Mar 2021 | |
![]() | mobile_object_localizer | mobile_object_localizer_v1 | TensorFlow Hub | 1.2.6 and later | Jun 2021 | EN JP |
![]() | layout_parsing | unstructured-inference | Pytorch | 1.2.9 and later | Dec 2022 |
Object detection 3d
| Model | Reference | Exported From | Supported Ailia Version | Date | Blog | |
|---|---|---|---|---|---|---|
![]() | 3d_bbox | 3D Bounding Box Estimation Using Deep Learning and Geometry | Pytorch | 1.2.6 and later | Dec 2016 | |
![]() | d4lcn | D4LCN | Pytorch | 1.2.9 and later | Dec 2019 | |
![]() | egonet | EgoNet | Pytorch | 1.2.9 and later | Nov 2020 | |
![]() | mediapipe_objectron | MediaPipe Objectron | TensorFlow Lite | 1.2.5 and later | Dec 2020 | |
![]() | 3d-object-detection.pytorch | 3d-object-detection.pytorch | Pytorch | 1.2.8 and later | Feb 2021 | EN JP |
![]() | did_m3d | DID M3D | Pytorch | 1.2.11 and later | Jul 2022 |
Object tracking
| Model | Reference | Exported From | Supported Ailia Version | Date | Blog | |
|---|---|---|---|---|---|---|
| deepsort | Deep Sort with PyTorch | Pytorch | 1.2.3 and later | Mar 2017 | EN JP | |
| person_reid_baseline_pytorch | UTS-Person-reID-Practical | Pytorch | 1.2.6 and later | Mar 2019 | ||
| abd_net | Attentive but Diverse Person Re-Identification | Pytorch | 1.2.7 and later | Aug 2019 | ||
| deepsort_vehicle | Multi-Camera Live Object Tracking | Pytorch | 1.2.9 and later | May 2020 | ||
| qd-3dt | Monocular Quasi-Dense 3D Object Tracking | Pytorch | 1.2.11 and later | Mar 2021 | ||
| centroids-reid | On the Unreasonable Effectiveness of Centroids in Image Retrieval | Pytorch | 1.2.9 and later | Apr 2021 | ||
| siam-mot | SiamMOT | Pytorch | 1.2.9 and later | May 2021 | ||
| bytetrack | ByteTrack | Pytorch | 1.2.5 and later | Oct 2021 | EN JP | |
| strong_sort | StrongSORT | Pytorch | 1.2.15 and later | Feb 2022 | ||
| samurai | SAMURAI: Adapting Segment Anything Model for Zero-Shot Visual Tracking with Motion-Aware Memory | Pytorch | 1.6.1 and later | Nov 2024 |
Optical Flow Estimation
| Model | Reference | Exported From | Supported Ailia Version | Date | Blog | |
|---|---|---|---|---|---|---|
![]() | raft | RAFT: Recurrent All Pairs Field Transforms for Optical Flow | Pytorch | 1.2.6 and later | Mar 2020 | EN JP |
| cotracker3 | CoTracker3: Simpler and Better Point Tracking by Pseudo-Labelling Real Videos | Pytorch | 1.6.1 and later | Oct 2024 |
Point segmentation
| Model | Reference | Exported From | Supported Ailia Version | Date | Blog | |
|---|---|---|---|---|---|---|
![]() | pointnet_pytorch | PointNet.pytorch | Pytorch | 1.2.6 and later | Dec 2016 |
Pose estimation
| Model | Reference | Exported From | Supported Ailia Version | Date | Blog | |
|---|---|---|---|---|---|---|
![]() | openpose | Code repo for realtime multi-person pose estimation in CVPR'17 (Oral) | Caffe | 1.2.1 and later | Nov 2016 | |
![]() | posenet | PoseNet Pytorch | Pytorch | 1.2.10 and later | Jan 2017 | |
![]() | pose_resnet | Simple Baselines for Human Pose Estimation and Tracking | Pytorch | 1.2.1 and later | Apr 2018 | EN JP |
![]() | lightweight-human-pose-estimation | Fast and accurate human pose estimation in PyTorch. Contains implementation of "Real-time 2D Multi-Person Pose Estimation on CPU: Lightweight OpenPose" paper. | Pytorch | 1.2.1 and later | Nov 2018 | EN JP |
![]() | animalpose | MMPose - 2D animal pose estimation | Pytorch | 1.2.7 and later | Aug 2019 | EN JP |
![]() | efficientpose | Code repo for EfficientPose | TensorFlow | 1.2.6 and later | Apr 2020 | |
![]() | blazepose | MediaPipePyTorch | Pytorch | 1.2.5 and later | Jun 2020 | |
![]() | mediapipe_holistic | MediaPipe Holistic | TensorFlow | 1.2.9 and later | Dec 2020 | |
![]() | movenet | Code repo for movenet | TensorFlow | 1.2.8 and later | May 2021 | EN JP |
![]() | ap-10k | AP-10K | Pytorch | 1.2.4 and later | Aug 2021 | |
![]() | e2pose | E2Pose | Tensorflow | 1.2.5 and later | Oct 2022 |
Pose estimation 3d
| Model | Reference | Exported From | Supported Ailia Version | Date | Blog | |
|---|---|---|---|---|---|---|
![]() | pose-hg-3d | Towards 3D Human Pose Estimation in the Wild: a Weakly-supervised Approach | Pytorch | 1.2.6 and later | Apr 2017 | |
![]() | 3d-pose-baseline | A simple baseline for 3d human pose estimation in tensorflow. Presented at ICCV 17. | TensorFlow | 1.2.3 and later | May 2017 | |
![]() | lightweight-human-pose-estimation-3d | Real-time 3D multi-person pose estimation demo in PyTorch. OpenVINO backend can be used for fast inference on CPU. | Pytorch | 1.2.1 and later | Dec 2017 | |
![]() | 3dmppe_posenet | PoseNet of "Camera Distance-aware Top-down Approach for 3D Multi-person Pose Estimation from a Single RGB Image" | Pytorch | 1.2.6 and later | Jul 2019 | |
![]() | gast | A Graph Attention Spatio-temporal Convolutional Networks for 3D Human Pose Estimation in Video (GAST-Net) | Pytorch | 1.2.7 and later | Mar 2020 | EN JP |
![]() | blazepose-fullbody | MediaPipe | TensorFlow Lite | 1.2.5 and later | Jun 2020 | EN JP |
![]() | mediapipe_pose_world_landmarks | MediaPipe Pose real-world 3D coordinates | TensorFlow Lite | 1.2.10 and later | Jun 2022 |
Road detection
| Model | Reference | Exported From | Supported Ailia Version | Date | Blog | |
|---|---|---|---|---|---|---|
![]() | road-segmentation-adas | road-segmentation-adas-0001 | OpenVINO | 1.2.5 and later | Sep 2018 | |
![]() | codes-for-lane-detection | Codes-for-Lane-Detection | Pytorch | 1.2.6 and later | Aug 2019 | EN JP |
![]() | ultra-fast-lane-detection | Ultra-Fast-Lane-Detection | Pytorch | 1.2.6 and later | Apr 2020 | |
![]() | polylanenet | PolyLaneNet | Pytorch | 1.2.9 and later | Apr 2020 | |
![]() | roneld | RONELD-Lane-Detection | Pytorch | 1.2.6 and later | Oct 2020 | |
![]() | lstr | LSTR | Pytorch | 1.2.8 and later | Nov 2020 | |
![]() | yolop | YOLOP | Pytorch | 1.2.6 and later | Aug 2021 | |
![]() | cdnet | CDNet | Pytorch | 1.2.5 and later | Feb 2022 | |
![]() | hybridnets | HybridNets | Pytorch | 1.2.6 and later | Mar 2022 |
Rotation prediction
| Model | Reference | Exported From | Supported Ailia Version | Date | Blog | |
|---|---|---|---|---|---|---|
![]() | rotnet | CNNs for predicting the rotation angle of an image to correct its orientation | Keras | 1.2.1 and later | Mar 2018 |
Style transfer
| Model | Reference | Exported From | Supported Ailia Version | Date | Blog | |
|---|---|---|---|---|---|---|
![]() | adain | Arbitrary Style Transfer in Real-time with Adaptive Instance Normalization | Pytorch | 1.2.1 and later | Mar 2017 | EN JP |
![]() | pix2pixHD | pix2pixHD: High-Resolution Image Synthesis and Semantic Manipulation with Conditional GANs | Pytorch | 1.2.6 and later | Nov 2017 | |
![]() | beauty_gan | BeautyGAN | Pytorch | 1.2.7 and later | Jul 2018 | |
![]() | psgan | PSGAN: Pose and Expression Robust Spatial-Aware GAN for Customizable Makeup Transfer | Pytorch | 1.2.7 and later | Sep 2019 | |
![]() | animeganv2 | PyTorch Implementation of AnimeGANv2 | Pytorch | 1.2.5 and later | Nov 2020 | |
![]() | EleGANt | EleGANt: Exquisite and Locally Editable GAN for Makeup Transfer | Pytorch | 1.2.15 and later | Jul 2022 |
Super resolution
| Model | Reference | Exported From | Supported Ailia Version | Date | Blog | |
|---|---|---|---|---|---|---|
![]() | srresnet | Photo-Realistic Single Image Super-Resolution Using a Generative Adversarial Network | Pytorch | 1.2.0 and later | Sep 2016 | EN JP |
![]() | edsr | Enhanced Deep Residual Networks for Single Image Super-Resolution | Pytorch | 1.2.6 and later | Jul 2017 | EN JP |
![]() | han | Single Image Super-Resolution via a Holistic Attention Network | Pytorch | 1.2.6 and later | Aug 2020 | |
![]() | real-esrgan | Real-ESRGAN | Pytorch | 1.2.9 and later | Jul 2021 | JP |
![]() | swinir | SwinIR: Image Restoration Using Swin Transformer | Pytorch | 1.2.12 and later | Aug 2021 | |
![]() | rcan-it | Revisiting RCAN: Improved Training for Image Super-Resolution | Pytorch | 1.2.10 and later | Jan 2022 | |
![]() | Hat | Hat | Pytorch | 1.2.6 and later | May 2022 | |
![]() | SPAN | SPAN | Pytorch | 1.2.14 and later | Nov 2023 | JP |
Text detection
| Model | Reference | Exported From | Supported Ailia Version | Date | Blog | |
|---|---|---|---|---|---|---|
![]() | east | EAST: An Efficient and Accurate Scene Text Detector | TensorFlow | 1.2.6 and later | Apr 2017 | |
| pixel_link | Pixel-Link | TensorFlow | 1.2.6 and later | Jan 2018 | ||
![]() | craft_pytorch | CRAFT: Character-Region Awareness For Text detection | Pytorch | 1.2.2 and later | Apr 2019 |
Text recognition
| Model | Reference | Exported From | Supported Ailia Version | Date | Blog | |
|---|---|---|---|---|---|---|
![]() | etl | Japanese Character Classification | Keras | 1.1.0 and later | 1973 | JP |
![]() | crnn.pytorch | Convolutional Recurrent Neural Network | Pytorch | 1.2.6 and later | Jul 2015 | |
![]() | deep-text-recognition-benchmark | deep-text-recognition-benchmark | Pytorch | 1.2.6 and later | Apr 2019 | |
![]() | easyocr | Ready-to-use OCR with 80+ supported languages | Pytorch | 1.2.6 and later | Apr 2020 | |
![]() | paddleocr | PaddleOCR : Awesome multilingual OCR toolkits based on PaddlePaddle | Pytorch | 1.2.6 and later | Sep 2020 | EN JP |
![]() | donut | Donut | Pytorch | 1.2.16 and later | Nov 2021 | |
![]() | ndlocr_text_recognition | NDL OCR | Pytorch | 1.2.5 and later | Apr 2022 | |
![]() | paddleocr_v3 | PaddleOCR : Awesome multilingual OCR toolkits based on PaddlePaddle | Pytorch | 1.2.17 and later | Jun 2022 | JP |
Time-Series Forecasting
| Model | Reference | Exported From | Supported Ailia Version | Date | Blog |
|---|---|---|---|---|---|
| informer2020 | Informer: Beyond Efficient Transformer for Long Sequence Time-Series Forecasting (AAAI'21 Best Paper) | Pytorch | 1.2.10 and later | Dec 2020 | |
| timesfm | TimesFM | Pytorch | 1.2.16 and later | Oct 2023 | JP |
Vehicle recognition
| Model | Reference | Exported From | Supported Ailia Version | Date | Blog | |
|---|---|---|---|---|---|---|
![]() | vehicle-attributes-recognition-barrier | vehicle-attributes-recognition-barrier-0042 | OpenVINO | 1.2.5 and later | May 2018 | EN JP |
![]() | vehicle-license-plate-detection-barrier | vehicle-license-plate-detection-barrier-0106 | OpenVINO | 1.2.5 and later | May 2018 |
Vision Language Model
| Model | Reference | Exported From | Supported Ailia Version | Date | Blog | |
|---|---|---|---|---|---|---|
![]() | llava | LLaVA | Pytorch | 1.2.16 and later | Apr 2023 | JP |
![]() | florence2 | Hugging Face - microsoft/Florence-2-base | Pytorch | 1.2.16 and later | Nov 2023 | JP |
![]() | mobilevlm | MobileVLM | Pytorch | 1.5.0 and later | Dec 2023 | |
![]() | llava-jp | LLaVA-JP | Pytorch | 1.5.0 and later | Jan 2024 | |
![]() | qwen2_vl | Qwen2-VL | Pytorch | 1.5.0 and later | Sep 2024 | JP |
Commercial model
| Model | Reference | Exported From | Supported Ailia Version | Date | Blog |
|---|---|---|---|---|---|
| acculus-pose | Acculus, Inc. | Caffe | 1.2.3 and later | May 2018 |
Other platforms
Prototype with ailia MODELS (Python), then deploy to production.

























































































































































































































































































































