RobertBiehl/CLIP-tf2

📦 Open Source ProjectRobertBiehl

A complete implementation of OpenAI's CLIP model ported to TensorFlow 2 and Keras.

CLIP-tf2 bridges the gap between PyTorch-centric research and TensorFlow-based production environments. By converting the original OpenAI CLIP architecture into Keras-compatible layers, this project allows practitioners to utilize pre-trained weights for tasks like image classification, semantic search, and zero-shot object recognition within the TensorFlow ecosystem. The implementation maintains the integrity of the original CLIP model, featuring the dual-encoder architecture consisting of a vision transformer (ViT) or ResNet image encoder and a Transformer-based text encoder. This repository is particularly useful for developers who require high-performance vision-language models but are constrained by infrastructure or project requirements that mandate the use of TensorFlow 2. It supports standard Keras model loading and inference workflows, making it a drop-in solution for integrating state-of-the-art multimodal capabilities into legacy or TF-native AI applications.

💡Highlights

├─Native Keras/TF2 implementation
├─Zero-shot image classification
└─Supports ViT and ResNet backbones

🎯For

├─TensorFlow Developers
└─Computer Vision Engineers

🔗Links

└─GitHub Repository