# Starting My Journey into AI Research 🚀

Goutam Sharma — Wed, 27 May 2026 20:54:49 +0000

Hello everyone!

I am a 3rd year Computer Science student, and recently I started exploring the world of AI research, especially in:

Vision Language Models (VLMs)
Spatial AI
Semantic Segmentation
Multimodal Learning
Scene Understanding

At first, research looked very complicated to me because of research papers, mathematical concepts, and large AI architectures. But after reading papers and experimenting with datasets and models, I realized research is mainly about curiosity and solving problems.

What I Am Currently Exploring

Recently, I have been learning about:

How VLMs understand images and text together
Spatial reasoning in AI systems
Scene graph understanding
Data curation and annotation pipelines
Reducing hallucinations in multimodal models

My Goal

I want to work on advanced AI systems that can understand the real world more accurately, especially for applications like:

Robotics
Autonomous systems
Smart surveillance
Human-AI interaction

What I Learned So Far

One important thing I learned is:

Research is not about knowing everything.
It is about continuously learning and improving ideas.

Technologies I Am Using

Python
PyTorch
Hugging Face
OpenCV
Transformers

Final Thoughts

This is just the beginning of my research journey, and I am excited to keep learning, building, and sharing my progress with the community.

If you are also starting in AI research, feel free to connect with me!

ai #machinelearning #research #computervision