DEV Community

Yodit Weldegeorgise
Yodit Weldegeorgise

Posted on

๐—š๐—ฒ๐˜๐˜๐—ถ๐—ป๐—ด ๐—ฆ๐˜๐—ฎ๐—ฟ๐˜๐—ฒ๐—ฑ ๐˜„๐—ถ๐˜๐—ต ๐—”๐—œ ๐—˜๐—ป๐—ด๐—ถ๐—ป๐—ฒ๐—ฒ๐—ฟ๐—ถ๐—ป๐—ด

Recently, I got a recommendation for the book AI Engineering: Building Applications with Foundation Models by Chip Huyen, and I decided to give it a try.

I finished Chapter 1, and I really loved it. The examples, use cases, and overall flow make the concepts easy to understand.

To complement that, I also started preparing for the AWS AI Practitioner certification. My goal is to not only understand the concepts but also see how they apply in real world systems. I did not want to just read about AI, I wanted to connect it to how things are actually built and used.

๐—” ๐— ๐—ผ๐—ฟ๐—ฒ ๐—ฃ๐—ฟ๐—ฎ๐—ฐ๐˜๐—ถ๐—ฐ๐—ฎ๐—น ๐—ฉ๐—ถ๐—ฒ๐˜„ ๐—ผ๐—ณ ๐—”๐—œ ๐—˜๐—ป๐—ด๐—ถ๐—ป๐—ฒ๐—ฒ๐—ฟ๐—ถ๐—ป๐—ด

One idea that stood out to me is how AI engineering is evolving in practice.

While traditional machine learning focuses heavily on model development and training, AI engineering today is increasingly centered around how we use and adapt existing foundation models in real world applications.

This shift moves the focus from building models to designing systems. It is less about training from scratch and more about how effectively we can guide models, provide the right context, and evaluate their outputs.

What I found particularly valuable is how this perspective aligns closely with software engineering principles. The emphasis is not just on the model itself, but on how it fits into a larger system and delivers value in a practical setting.

๐—จ๐—ป๐—ฑ๐—ฒ๐—ฟ๐˜€๐˜๐—ฎ๐—ป๐—ฑ๐—ถ๐—ป๐—ด ๐—ต๐—ผ๐˜„ ๐—บ๐—ผ๐—ฑ๐—ฒ๐—น๐˜€ ๐—ฝ๐—ฟ๐—ผ๐—ฐ๐—ฒ๐˜€๐˜€ ๐—น๐—ฎ๐—ป๐—ด๐˜‚๐—ฎ๐—ด๐—ฒ

One concept that stood out to me was how models process text.

Instead of reading full words the way we do, they break text into smaller pieces called tokens. For example, a word like โ€œcookingโ€ can be split into โ€œcookโ€ and โ€œing.โ€ Even a completely new or made up word can still be interpreted because the model looks at its smaller components.

This approach allows models to capture meaning while remaining efficient, and it helps them handle words they have not explicitly seen before

It also explains why interactions with these models feel flexible and natural.

๐—›๐—ผ๐˜„ ๐—ฟ๐—ฒ๐˜€๐—ฝ๐—ผ๐—ป๐˜€๐—ฒ๐˜€ ๐—ฎ๐—ฟ๐—ฒ ๐—ด๐—ฒ๐—ป๐—ฒ๐—ฟ๐—ฎ๐˜๐—ฒ๐—ฑ

Another important idea is how these models generate responses.

They do not reason in the same structured way humans do. Instead, they predict what comes next based on the sequence of tokens they have already seen. Each response is built incrementally, one step at a time, based on learned patterns.

When viewed this way, it becomes clearer why interactions feel conversational. The model is continuously predicting the next most likely piece of text in a sequence

๐—™๐—ฟ๐—ผ๐—บ ๐—ฏ๐˜‚๐—ถ๐—น๐—ฑ๐—ถ๐—ป๐—ด ๐—บ๐—ผ๐—ฑ๐—ฒ๐—น๐˜€ ๐˜๐—ผ ๐—ฎ๐—ฑ๐—ฎ๐—ฝ๐˜๐—ถ๐—ป๐—ด ๐˜๐—ต๐—ฒ๐—บ

A key takeaway for me is that building AI applications today does not always require training models from scratch.

Instead, much of the focus is on adapting existing models. This can be done by providing better instructions, supplying additional context through external data, or refining the model for specific use cases.

This shift lowers the barrier to entry and enables faster experimentation. It allows developers to focus more on solving problems and less on the complexity of model training

It feels great to understand the nuances behind what we use every day. I use LLMs on a daily basis, and getting to learn what is happening underneath has been fascinating so far.

From a software engineering perspective, this shift is particularly meaningful. It reinforces the importance of building, iterating, and integrating systems rather than waiting for complete theoretical understanding.

It also highlights that many core engineering skills remain relevant. Designing systems, thinking about user experience, and building reliable applications continue to be central, even as the tools evolve.

At this stage, I am continuing the book while also preparing for the AWS AI Practitioner certification. I want to build a solid foundation while staying grounded in practical application.

In parallel, I am exploring ways to apply these concepts in smaller projects. Even simple implementations can provide valuable insight into how these systems behave and how they can be improved.

This combination of learning and building feels like the most effective way to deepen my understanding.

AI is no longer just about models, it is about systems.

Understanding how to use, guide, and integrate those systems is what makes the difference.

Top comments (0)