A collection of services using third party DNN models.
For more details on how to publish and test a service, select it from the list below:
- real-time-voice-cloning (User's Guide) - This service uses Real-Time-Voice-Cloning to clone a voice from a 5 seconds audio file to generate arbitrary speech in real-time [Reference]
- cntk-image-recon (User's Guide) - This service uses ResNet152 model, trained to recognize different types of flowers and dog breeds. [Reference]
- deepfakes-faceswap (User's Guide) - This service uses the Deepfakes Faceswap, trained on two input videos A and B, to perform face swapping on videos. [Reference]
- siggraph-colorization (User's Guide) - This service learns to automatically color grayscale images with a deep network. [Reference]
@Article{IizukaSIGGRAPH2016, author = {Satoshi Iizuka and Edgar Simo-Serra and Hiroshi Ishikawa}, title = {{Let there be Color!: Joint End-to-end Learning of Global and Local Image Priors for Automatic Image Colorization with Simultaneous Classification}}, journal = "ACM Transactions on Graphics (Proc. of SIGGRAPH 2016)", year = 2016, volume = 35, number = 4, }
- yolov3-object-detection (User's Guide) - This service uses YOLOv3 model to detect objects on images. [Reference]
@article{yolov3, title={YOLOv3: An Incremental Improvement}, author={Redmon, Joseph and Farhadi, Ali}, journal = {arXiv}, year={2018} }
- places365-scene-recognition (User's Guide) - This service uses various convolutional neural networks trained on Places365 to perform scene recognition. [Reference]
@article{zhou2017places, title={Places: A 10 million Image Database for Scene Recognition}, author={Zhou, Bolei and Lapedriza, Agata and Khosla, Aditya and Oliva, Aude and Torralba, Antonio}, journal={IEEE Transactions on Pattern Analysis and Machine Intelligence}, year={2017}, publisher={IEEE} }
- pneumonia-diagnosis (User's Guide) - This service uses VGG19 to classify X-ray chest images. Based on Alishba Imran's work.
- i3d-video-action-recognition (User's Guide) - This service uses I3D model to recognize actions on videos (with 400 or 600 labels). [Reference]
- s2vt-video-captioning (User's Guide) - This service uses "Sequence to Sequence - Video to Text" to describe video content with natural language text. [Reference]
@inproceedings{venugopalan15iccv, title = {Sequence to Sequence -- Video to Text}, author = {Venugopalan, Subhashini and Rohrbach, Marcus and Donahue, Jeff and Mooney, Raymond and Darrell, Trevor and Saenko, Kate}, booktitle = {Proceedings of the IEEE International Conference on Computer Vision (ICCV)}, year = {2015} }
- zeta36-chess-alpha-zero (User's Guide) - This service uses AlphaGo Zero methods to learn and play chess. [Reference]
Please read our guidelines before submitting an issue. If your issue is a bug, please use the bug template pre-populated here. For feature requests and queries you can use this template.
- Artur Gontijo - Maintainer - SingularityNET
- Ramon Durães - Maintainer - SingularityNET
This project is licensed under the MIT License - see the LICENSE file for details.
Each service is licensed as followed:
- cntk-image-recon - MIT License
- deepfakes-faceswap - GPL-3.0
- i3d-video-action-recognition - Apache License 2.0
- places365-scene-recognition - MIT License
- pneumonia-diagnosis - Attribution 4.0 International (CC BY 4.0)
- real-time-voice-cloning - MIT License
- s2vt-video-captioning - Attribution 4.0 International (CC BY 4.0)
- yolov3-object-detection - Public domain
- zeta36-chess-alpha-zero - MIT License