Small Language Models are Going to Eat the World.

#llmops #promptengineering #softwareengineering #llm

Today, Large Language Models (LLMs) typically require internet access. As prompt-based applications become ubiquitous, there is a high likelihood we slowly begin to see a transition from internet-based models to locally hosted models.

Local models are nothing new. Google product users are often pushed to download local models for Google Maps, Google Translate, and Text2Speech. These models run locally for four primary purposes:

speed
reliability
privacy
cost

Benefits

Speed

Local models have no network latency. They run locally, and instructions and data transfers happen closer to the application layer, resulting in increased performance.

Reliability

Local models are self-reliant. They don't require additional computers to operate and don't rely on 3rd party service providers. They run as stand-alone and won't break if internet connectivity is lost.

Privacy

Private information is processed locally and never shared with another provider. Information passed into these models may contain private or confidential information that an external processor should not process.

Cost

Local models require zero hosting. Models may run frequently, and costs involved in processing data regularly at scale may become unaffordable or may better be absorbed by a local device.

How can we make local models a reality?

Python is the language of choice to run LLMs. However, we know that embedded devices, mobile apps, and web servers often use different languages to run and operate efficiently.

To bridge the gap in SDKs for accessing large language models across various platforms, engineers should consider developing and integrating multi-language libraries and frameworks that are compatible with mobile, embedded, and diverse server environments. Embracing innovation and flexibility in these developments is critical, as large language models represent a new technological frontier rather than merely enhancing existing tools.

This original article can be found here: https://promptdesk.ai/articles/small-languages-models-are-going-to-eat-the-world

🚀 pgai Vectorizer: SQLAlchemy and LiteLLM Make Vector Search Simple

We built pgai Vectorizer to simplify embedding management for AI applications—without needing a separate database or complex infrastructure. Since launch, developers have created over 3,000 vectorizers on Timescale Cloud, with many more self-hosted.

Read full post →

DEV Community

Small Language Models are Going to Eat the World.

Benefits

Speed

Reliability

Privacy

Cost

How can we make local models a reality?

🚀 pgai Vectorizer: SQLAlchemy and LiteLLM Make Vector Search Simple

Top comments (0)

Read next

Overcoming LLM Testing Challenges with Pytest and Trulens: Ensuring Reliable Responses

Detecting Hallucinations in LLMs with Discrete Semantic Entropy and Perplexity

Local LLMs: The Future of Private AI Computing? A Complete Guide for 2024

My first LLM dialog box project