We are Rodrigo and Javier, and we are currently looking for more talented individuals with expertise in Computer Vision (CV) and Natural Language Processing (NLP), particularly those familiar with Large Language Models (LLMs), to join us in an exciting and impactful project. Our goal is to develop an innovative system designed to automatically describe images for visually impaired individuals, enhancing their ability to understand and interact with the world around them.
The core of this project involves creating a seamless integration between CV and NLP technologies. Specifically, we plan to use advanced Computer Vision models to accurately identify and extract features, objects, and relevant contextual information from images. Once these visual elements are identified, the data will be fed into a sophisticated Large Language Model. The LLM will then generate detailed, coherent, and contextually appropriate descriptions of the images, effectively narrating the scene or conveying key topics present in the picture.
This system aims to provide visually impaired users with real-time, dynamic descriptions, making everyday experiences like navigating environments, understanding social media content, or even enjoying visual arts more accessible. We believe this project has the potential to significantly improve the quality of life for many people by making digital and real-world visual information more accessible.
If you are passionate about technology and accessibility, and have experience or interest in Computer Vision, NLP, and LLMs, we would love to hear from you. Join us in making a meaningful difference through technology!
This project sounds both impactful and inspiring. While Iām still building my experience in Computer Vision and NLP, Iām eager to contribute and learn more, especially when it comes to integrating LLMs with CV models. The idea of using technology to improve accessibility for visually impaired individuals really sounds exciting. Iād love the opportunity to get involved.
Hello, I am Pratik. Your project idea sounds quite interesting. I am currently learning AIML and have a keen interest in NLP. I would love to contribute to this project.
LinkedIn profile: https://www.linkedin.com/in/pratik-chakraborty-862a8228b/
Hi I am Sarthak Kapaliya, a graduate student with keen interest to work in CV and LLM domain. I am currently pursuing masters at McMaster University. I want to collaborate on this project.
Hello, I am Omorinsola Makinde. The idea of using Computer vision and NLP to solve problems and make it easier for the Visually impaired sounds amazing .I am still currently learning more on Computer vision and will love to contribute to this project. www.linkedin.com/in/omorinsola-makinde
Hi Rodrigo and Javier, this is Jue. I find your idea very inspiring and I believe there will be a huge market for it. I have completed Deep Learning Specialization and several other machine learning courses. The following is my LinkedIn address. Let me know how I can contribute. Thanks!
Hello, I am Diwakar. I am interested in this project and I would like to collaborate. I have completed deep learning and machine learning specializations and other AI courses.