<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0" xmlns:atom="http://www.w3.org/2005/Atom" xmlns:dc="http://purl.org/dc/elements/1.1/">
  <channel>
    <title>DEV Community: Reel Crave</title>
    <description>The latest articles on DEV Community by Reel Crave (@reel_crave).</description>
    <link>https://dev.to/reel_crave</link>
    <image>
      <url>https://media2.dev.to/dynamic/image/width=90,height=90,fit=cover,gravity=auto,format=auto/https:%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Fuser%2Fprofile_image%2F3933759%2Fb16159df-5375-4c0a-90e8-88d863eca0fd.png</url>
      <title>DEV Community: Reel Crave</title>
      <link>https://dev.to/reel_crave</link>
    </image>
    <atom:link rel="self" type="application/rss+xml" href="https://dev.to/feed/reel_crave"/>
    <language>en</language>
    <item>
      <title>Anyone else building training corpora from academic literature?</title>
      <dc:creator>Reel Crave</dc:creator>
      <pubDate>Fri, 15 May 2026 19:04:21 +0000</pubDate>
      <link>https://dev.to/reel_crave/anyone-else-building-training-corpora-from-academic-literature-2pfp</link>
      <guid>https://dev.to/reel_crave/anyone-else-building-training-corpora-from-academic-literature-2pfp</guid>
      <description>&lt;p&gt;Curious what your data collection pipeline looks like.&lt;/p&gt;

&lt;p&gt;I've been pulling from ScholarAPI for domain-specific RAG datasets.. medical, materials science, chemistry. The structured JSON + PDF access makes chunking for embeddings cleaner than parsing scraped HTML.&lt;/p&gt;

&lt;p&gt;Current setup: ScholarAPI → extract → chunk → embed into Chroma. Works well for domain-specific Q&amp;amp;A.&lt;/p&gt;

&lt;p&gt;What are you using?&lt;/p&gt;

&lt;p&gt;&lt;a href="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Fkmtzm4xtxlf5s2vaoj8j.png" class="article-body-image-wrapper"&gt;&lt;img src="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Fkmtzm4xtxlf5s2vaoj8j.png" alt=" " width="800" height="533"&gt;&lt;/a&gt;&lt;br&gt;
Genuinely curious if there's something better I'm missing for open-access coverage. &lt;/p&gt;

&lt;p&gt;(&lt;a href="https://scholarapi.net?via=-asig3" rel="noopener noreferrer"&gt;https://scholarapi.net?via=-asig3&lt;/a&gt;)&lt;/p&gt;

</description>
    </item>
    <item>
      <title>Resource for anyone building tools for systematic literature reviews/automating paper monitoring:
ScholarAPI provides access to 30M+ open-access papers, metadata, full-text, &amp; PDFs via REST API. Useful if you're building research tools. scholarapi.net</title>
      <dc:creator>Reel Crave</dc:creator>
      <pubDate>Fri, 15 May 2026 18:50:04 +0000</pubDate>
      <link>https://dev.to/reel_crave/resource-for-anyone-building-tools-for-systematic-literature-reviewsautomating-paper-monitoring-1ddh</link>
      <guid>https://dev.to/reel_crave/resource-for-anyone-building-tools-for-systematic-literature-reviewsautomating-paper-monitoring-1ddh</guid>
      <description></description>
      <category>api</category>
      <category>automation</category>
      <category>data</category>
      <category>science</category>
    </item>
  </channel>
</rss>
