Member-only story
How does ChatGPT works from a technical perspective?
Everyone is wanting to leverage ChatGPT these days.
But not many people know how it works.
Like really works.
I will explain it to you here.
The main technology behind ChatGPT is inside the name.
GPT.
“GPT” in ChatGPT stands for “Generative Pre-trained Transformer”.
It’s a type of artificial intelligence model designed for understanding and generating human-like text based on the input it receives.
“Generative” means it can create text, “Pre-trained” indicates that it has been trained on a large dataset before being fine-tuned for specific tasks, and “Transformer” refers to the underlying architecture that enables it to understand the context and relationships between words in a sentence.
So what does this really mean?
Specially the “Transformer” part.
The “Transformer” architecture is a deep learning model introduced in the paper “Attention is All You Need” by Vaswani et al. in 2017.