DEV Community

Aggrey Paintsil Ishmeal
Aggrey Paintsil Ishmeal

Posted on

✨ Built with Google Gemini: PaperLens AI — Chat With Research Papers

This is a submission for the Built with Google Gemini: Writing Challenge


What I Built with Google Gemini

As a computer science student working on research projects, I often face the same problem: research papers are difficult and time-consuming to understand. Papers can be long, technical, and full of complex equations, making it hard to quickly extract the key ideas.

To solve this, I built PaperLens AI, an application that allows users to upload a research paper PDF and interact with it using AI.

The app automatically:

  • Extracts text from the uploaded research paper
  • Splits the document into semantic chunks
  • Creates embeddings for each chunk using Google Gemini embeddings
  • Retrieves the most relevant sections of the paper
  • Uses Gemini to answer questions about the paper

In simple terms, PaperLens AI lets users chat with research papers.

For example, users can ask questions like:

  • What is the research gap in this paper?
  • What dataset was used?
  • What are the limitations of the method?

The AI then finds the most relevant sections of the paper and generates an answer based on those sections.


Architecture of the System

The system uses a simple Retrieval Augmented Generation (RAG) pipeline.

Research Paper PDF
        ↓
Text Extraction (PyPDF2)
        ↓
Chunking
        ↓
Gemini Embeddings
        ↓
Similarity Search
        ↓
Gemini generates answer
Enter fullscreen mode Exit fullscreen mode

Technologies used:

  • Python
  • Streamlit (UI)
  • Google Gemini API
  • NumPy / Scikit-learn
  • Vector similarity search

The goal was to create something lightweight but powerful enough to help students quickly understand complex research.


Demo

Here is the interface of PaperLens AI.

GitHub logo aggreypaintsil168 / paperlens-ai

Research papers are difficult and time-consuming to understand. Papers can be long, technical, and full of complex equations, making it hard to quickly extract the key ideas.


📄 PaperLens AI

Chat With Research Papers Using Google Gemini

Python Streamlit Google Gemini Status


🚀 Overview

PaperLens AI is an AI-powered application that allows users to upload research papers and interact with them conversationally.

Instead of reading long academic papers line by line, users can ask questions like:

What is the research gap in this paper?
What dataset was used?
What methodology does the paper propose?
What are the limitations of the study?

PaperLens AI retrieves the most relevant sections of the paper and uses Google Gemini to generate contextual answers.

This transforms static research documents into an interactive knowledge system.


🎥 Demo

Dashboard Screenshot

Example UI:

Upload Research Paper
↓
Extract + Chunk Paper
↓
Create Embeddings
↓
Ask Gemini

Example query:

What is the research gap of this paper?

Example output:

The research gap addressed by this paper is the inability of
traditional forecasting models to capture both temporal
dependencies and trend

Users can:

  1. Upload a research paper
  2. Extract and chunk the document
  3. Generate embeddings
  4. Ask questions about the paper

Example workflow:

Upload PDF
→ Extract + Chunk Paper
→ Create Embeddings
→ Ask Gemini
Enter fullscreen mode Exit fullscreen mode

Example question:

What is the research gap in this paper?
Enter fullscreen mode Exit fullscreen mode

The system retrieves relevant sections and Gemini generates an explanation.

(If deployed, the Cloud Run app would be embedded here.)


What I Learned

This project taught me several technical and conceptual lessons.

1. How Retrieval-Augmented Generation Works

Before this project, I had only read about RAG systems. Implementing it myself helped me understand how modern AI assistants work internally.

Instead of sending an entire document to the model, we:

  1. Convert text into embeddings
  2. Store them as vectors
  3. Retrieve the most relevant pieces of text
  4. Send those pieces to the AI model

This dramatically improves both accuracy and efficiency.


2. Building AI Applications Requires More Than Just Calling an API

The real engineering work was in:

  • Text chunking
  • Embedding generation
  • Similarity search
  • Prompt engineering

These components are what transform a basic model call into a useful application.


3. Debugging AI Infrastructure

While building PaperLens AI I encountered several issues, including:

  • API authentication errors
  • Deprecated Gemini models
  • Embedding model changes
  • API quota limitations

Debugging these issues taught me a lot about how real AI systems evolve quickly, and why developers must constantly adapt to new APIs and model updates.


4. Building AI Tools for Research

As someone interested in machine learning and research, building this project made me realize how powerful AI can be for assisting with scientific work.

Tools like this could help students:

  • Understand research papers faster
  • Identify research gaps
  • Generate ideas for new research directions

Google Gemini Feedback

Overall, Google Gemini made it relatively easy to build a powerful AI application.

What worked well

The embedding capabilities and text generation features made it possible to build a full RAG pipeline with relatively little code.

Gemini was particularly good at:

  • Explaining complex technical concepts
  • Summarizing research sections
  • Answering contextual questions about documents

Where I ran into friction

One challenge I encountered was related to model changes and deprecations.

During development, some models referenced in earlier documentation were no longer available, which required updating the application to use newer Gemini models.

Another challenge involved API quota limits, which temporarily prevented some requests from completing during testing.

These issues highlighted how quickly AI platforms evolve and the importance of keeping projects updated with the latest SDKs and models.


What I’d like to see improved

Two improvements that would make Gemini development even better are:

  • clearer versioning for model releases
  • easier visibility into quota usage during development

These would help developers avoid unexpected errors while building applications.


What’s Next

PaperLens AI is just the beginning.

Future improvements could include:

  • Multi-paper comparison
  • Automatic research gap detection
  • Citation extraction
  • Integration with academic databases

Ultimately, I’d love to turn PaperLens AI into a tool that helps students and researchers navigate the growing volume of scientific literature.


Final Thoughts

This project reminded me that the most exciting part of AI development isn’t just the models themselves — it’s how we apply them to real problems.

For students and researchers, understanding complex papers is often a major barrier. With tools like Google Gemini, we can start building systems that make knowledge more accessible.

And that’s something worth building.

Top comments (0)