Exploring the world of software development, technology, data science, and life

How Large Does a Large Language Model Need To Be?

Cyborg Immanuel Kant

Large language models can have billions, or even trillions, of parameters. But how big do they need to be to achieve acceptable performance? To test this, I experimented with several of Google’s Gemma 3 models, all small enough to run locally on a single GPU. Specifically I used the 1 […]

Making LLMs Useful with Function Calls and Embeddings

Large Language Model AIs like Google’s Gemini and Open-AI’s GPT can be interesting to play around with even in a simple chatbot like Chat-GPT. But those chatbots largely waste their potential. Their understanding of natural language is impressive, but their “knowledge” is limited to what they were trained on. But […]

Are QBs Overrated? What the Data Says.

It’s almost Super Bowl Sunday, one of the most sacred holidays here in the United States. It’s the day of the championship game between the winners of the NFC and AFC, this year the Philadelphia Eagles and Kansas City Chiefs. Yes, for those of you from outside of the US, […]

Goodbye 2024

Don’t let the door hit you in the way out. I don’t think I will miss this year too much. But there were some good things that happened. Ok, not much, but some good music came out. Here is last year’s playlist on Qobuz and on Spotify.

Something is in the sky

There is something weird going on the Eastern United States. Lots of people along the Atlantic coast have reported seeing weird things in the sky over the past few days. If this happened in the 20th century everyone would be saying they are UFOs. Today, they are assumed to be […]