DenseFusion-1M: Merging Vision Experts for Comprehensive Multimodal Perception
-
Updated
Sep 27, 2024 - Python
DenseFusion-1M: Merging Vision Experts for Comprehensive Multimodal Perception
we generate captions to the images which are given by user(user input) using prompt engineering and Generative AI
Integrate AI capabilities into a DevExpress-powered Office File API Web API application.
Data release for the ImageInWords (IIW) paper.
Testing the Moondream tiny vision model
This repo represents our machine learning project Image Description which is used to generate a description of an image based on activities and objects detected in the image.
NL Generation from structured inputs. Focuses on generating natural language descriptions for images by exploring the relationship between textual descriptions and image attributes. Leveraging an encoder-decoder architecture with LSTM cells, the system transforms normalized vector representations of attributes into fixed-length vector.
Trabalho de Conclusão de Curso de Engenharia de Computação (UTFPR): Descritor de imagem baseado em curvas de Hilbert
In this project, we use a Deep Recurrent Architecture, which uses CNN (VGG-16 Net) pretrained on ImageNet to extract 4096-Dimensional image feature Vector and an LSTM which generates a caption from these feature vectors.
Lucene Image Retrieval (LIRe) code to extract Open Access Series of Imaging Studies (OASIS) features.
Content-Based Image Retrieval System
Key Pointers/ Exhaustive Notes for various Machine Learning Research Papers
Add a description, image, and links to the image-descriptions topic page so that developers can more easily learn about it.
To associate your repository with the image-descriptions topic, visit your repo's landing page and select "manage topics."