DEV Community

Cover image for # Starting My Journey into AI Research πŸš€
Goutam Sharma
Goutam Sharma

Posted on

# Starting My Journey into AI Research πŸš€

Hello everyone!

I am a 3rd year Computer Science student, and recently I started exploring the world of AI research, especially in:

  • Vision Language Models (VLMs)
  • Spatial AI
  • Semantic Segmentation
  • Multimodal Learning
  • Scene Understanding

At first, research looked very complicated to me because of research papers, mathematical concepts, and large AI architectures. But after reading papers and experimenting with datasets and models, I realized research is mainly about curiosity and solving problems.

What I Am Currently Exploring

Recently, I have been learning about:

  • How VLMs understand images and text together
  • Spatial reasoning in AI systems
  • Scene graph understanding
  • Data curation and annotation pipelines
  • Reducing hallucinations in multimodal models

My Goal

I want to work on advanced AI systems that can understand the real world more accurately, especially for applications like:

  • Robotics
  • Autonomous systems
  • Smart surveillance
  • Human-AI interaction

What I Learned So Far

One important thing I learned is:

Research is not about knowing everything.
It is about continuously learning and improving ideas.

Technologies I Am Using

  • Python
  • PyTorch
  • Hugging Face
  • OpenCV
  • Transformers

Final Thoughts

This is just the beginning of my research journey, and I am excited to keep learning, building, and sharing my progress with the community.

If you are also starting in AI research, feel free to connect with me!

ai #machinelearning #research #computervision

Top comments (0)