Who Invented Artificial Intelligence? The Fascinating History

The story of this field reads like a mix of myth and engineering. It traces ideas from ancient automata to modern large language models. Early work in the 1940s tied cybernetics, information theory, and digital computers into new research paths.

Alan Turing shaped the debate with his 1950 paper that proposed the Turing test, a practical way to ask if a machine can act like a human. A few years later, John McCarthy coined the term in his 1955 proposal that led to the Dartmouth workshop in 1956.

From symbolic programs in the 1950s and 1960s to today’s data-driven models, the field evolved through shifts in approach. We will break down how machine learning and deep learning grew from simple algorithms into systems that handle speech, images, chess, and language tasks you use every day.

Ancient Roots of Artificial Beings

From bronze giants to clever gears, early cultures explored what it meant to build life-like machines. These stories and devices set the tone for later work in artificial intelligence and computing.

Item Name	Description	Price
Golden Handmaid	Mythic automaton serving a god	$0
Programmable Fountain	Water-driven timing device by Al-Jazari	$0
Talos	Bronze guardian from Greek myth	$0

Mythological Automata

The Greek word “automaton” means “acting of oneself.” Tales of Hephaestus’ golden workers and Talos show that people long imagined machines that act like humans.

Sacred statues in Egypt and Greece were sometimes treated as if they had minds. Those beliefs reveal how early societies mixed faith and early engineering.

The Legacy of Al-Jazari

In 1206, Al-Jazari wrote The Book of Knowledge of Ingenious Mechanical Devices. He documented programmable devices that used cams and water to time actions.

His analog systems influenced later makers, and historians note links to Leonardo da Vinci. By studying that work, researchers see a clear line from gears and pumps to modern artificial intelligence models and networked systems.

Early Concepts of Formal Reasoning

Long before modern computers, thinkers sketched systems to turn reasoning into repeatable steps.

Aristotle and other Greek philosophers laid out structured methods for argument and proof. Their rules for logic became a foundation for later work in formal systems.

In 1308, Ramon Llull built the Ars magna, a paper-based method to combine information and generate new ideas. It reads like an early model for structured problem solving.

Gottfried Leibniz in the 1600s pushed the idea further with a universal symbolic language to reduce debate to calculation. That approach anticipated symbolic methods used in modern software and expert systems.

Item Name	Description	Price
Ars magna	Llull’s combinatory logic device	$0
Characteristica	Leibniz’s universal notation idea	$0
Boolean Logic	Boole’s symbolic math for computing	$0
Hilbert’s Program	20th‑century push to formalize math	$0

George Boole’s 1854 laws gave a mathematical form that lets a computer process data. Later, Hilbert’s challenge helped lead to Turing’s model and the idea that machines could follow algorithms to answer complex questions.

In short, the history shows a steady move from paper devices to symbolic systems and then to programmable machines that power today’s research, learning, and deep learning work.

The Mechanical Foundations of Computing

Before modern PCs filled labs, engineers built hulking machines that turned equations into electric signals.

The Atanasoff‑Berry Computer (ABC), built in 1939 at Iowa State College by John Vincent Atanasoff and Clifford Berry, was a clear leap forward.

The Atanasoff-Berry Computer

The ABC used binary digits—1s and 0s—to represent data. That choice became the standard for every digital computer that followed.

The machine relied on electronic circuits rather than gears. It weighed over 700 pounds and used roughly 300 vacuum tubes. At that scale it could solve up to 29 simultaneous linear equations.

Separated memory and processing, a design pattern still common in computer architecture.
Proved electronic computation could be faster and more reliable than mechanical approaches.
Laid hardware groundwork that researchers used to build software and algorithms aiming to mimic human intelligence.

Item Name	Description	Price
Atanasoff-Berry Computer	Early electronic binary computer (1939)	$0
Vacuum tube array	About 300 tubes for computation	$0
Linear solver	Could handle 29 simultaneous equations	$0

Even Ada Lovelace’s early caution that Babbage’s designs might overpromise shows how the field balanced big ideas and engineering limits.

In short, the ABC turned a philosophical idea into working hardware. That shift made modern artificial intelligence and deep learning possible by giving software reliable, high-speed systems to run on.

Who Invented Artificial Intelligence and the Dartmouth Workshop

In the summer of 1956 a single meeting changed how researchers approached making thinking machines. That gathering at Dartmouth College is widely seen as the official birth of the field called artificial intelligence.

Item Name	Description	Price
Dartmouth Workshop	Summer research meeting to test machine intelligence ideas (1956)	$0
The Logic Theorist	Early program that proved theorems using symbolic logic	$0
Proposal Document	John McCarthy’s 1955 plan that introduced the term	$0

John McCarthy and the Terminology

John McCarthy wrote the 1955 proposal that named the field artificial intelligence. The term framed a new research agenda: build systems that simulate human problem solving.

The Logic Theorist

In 1956 Allen Newell and Herbert A. Simon presented the Logic Theorist. It was the first program to mimic how humans solve symbolic problems.

Example: It proved theorems from Principia Mathematica.
Impact: It showed a computer could model a human reasoning task.

Key Participants

Organizers included McCarthy, Marvin Minsky, Nathaniel Rochester, and Claude Shannon. Other early contributors were Ray Solomonoff, Oliver Selfridge, Trenchard More, and Arthur Samuel.

Together, these researchers set goals that guided decades of work. Their focus on models, data, and learning led to the networks and deep learning work we see today.

The Turing Test and Machine Intelligence

Turing turned a debate about thought into a practical experiment about machines and conversation.

In his 1950 paper “Computing Machinery and Intelligence,” Alan Turing introduced the imitation game, now called the turing test. He framed the challenge to avoid arguing over the meaning of “thinking” and to focus on observable, conversational behavior instead.

The idea made a clear benchmark for researchers. If a computer’s replies could be mistaken for a human’s, the system earned a passing score. That practical approach guided early research into language, learning, and dialog systems.

Practical focus: tests behavior, not inner states.
Research tool: a benchmark for conversational AI development.
Lasting influence: shapes how we evaluate language models and expert systems today.

Item Name	Description	Price
Imitation Game	Method to judge conversational indistinguishability	$0
Turing’s 1950 Paper	Argued for behavior-based testing of machines	$0
Conversational Systems	Applications that use language to interact with users	$0

Early Neural Networks and Cybernetic Robots

Small, noisy machines taught scientists a big lesson: learning can come from connections, not just code. In 1943 Walter Pitts and Warren McCulloch published a model that treated neurons as logical units. That work seeded later network research and early deep learning ideas.

Item Name	Description	Price
SNARC (1951)	Marvin Minsky & Dean Edmunds’ neural network machine using vacuum tubes	$0
Perceptron (1958)	Frank Rosenblatt’s pattern‑recognition model that learned from data	$0
Walter’s Turtles	Analog robots that navigated using simple circuits	$0
Johns Hopkins Beast	Experimental robot exploring autonomous behavior	$0

Foundations: Pitts and McCulloch modeled networks that inspired later neural work.
Hardware experiments: SNARC and the Beast showed physical learning in machines.
Analog insight: Walter’s robots proved systems could behave without symbolic rules.
Pattern learning: Rosenblatt’s Perceptron made practical use of data for recognition.

These mid‑century projects shaped how researchers approach learning models today. They form a key chapter in the history of artificial intelligence and set the stage for the later rise of modern machine learning and deep learning work.

The Rise of Symbolic Reasoning

By the late 1950s, a clear school of thought took hold: thinking could be encoded as symbols and rules.

Symbolic reasoning dominated research from the mid‑1950s into the mid‑1990s. Teams built programs that manipulated high‑level symbols to model human problem solving.

Allen Newell and Herbert A. Simon created the General Problem Solver in 1957. It searched goals and subgoals to mimic human steps.

Item Name	Description	Price
General Problem Solver	Program that modeled stepwise human problem solving	$0
LISP	John McCarthy’s 1958 language for symbolic work	$0
Symbolic Solvers	Programs that proved theorems and solved word problems	$0

Using LISP, researchers built complex systems with rules and symbolic representations. These programs solved algebra word problems and proved geometry theorems.

Belief: cognition reduces to symbol manipulation.
Strength: clear, rule‑based expert systems.
Legacy: frameworks and data structures still inform modern deep learning and hybrid systems.

The Cognitive Revolution in Science

A shift in the 1950s pulled psychologists, linguists, and computer scientists into one shared problem: modeling thought.

In 1956 figures like Noam Chomsky and George Miller helped turn attention to internal mental objects—thoughts, memories, and language structures—that behaviorism had ignored.

That interdisciplinary move treated the mind as an information system. Researchers began building software that could represent and manipulate symbolic mental items.

The result was new subfields: cognitive science, generative linguistics, and functionalist approaches that used computational models to explain human language and memory.

Practical outcome: scientists modeled how a machine might store and retrieve thoughts.
Funding focus: symbolic systems became a priority for research grants and expert systems work.
Long-term impact: these ideas set foundations for later data-driven and deep learning approaches.

Item Name	Description	Price
Cognitive Revolution	Interdisciplinary shift to model mental processes	$0
Generative Linguistics	Chomsky’s framework for language structure	$0
Cognitive Science	Field combining psychology, linguistics, and computing	$0

Put simply, the cognitive revolution reframed research. It made it possible to design systems that mimic parts of human intelligence and paved the way for much of the modern work in computer learning and networks.

Early Successes and Optimism

The late 1950s produced programs that proved machines could handle tasks once seen as purely human. Early software solved algebra word problems, proved geometry theorems, and even learned simple English phrases.

With millions from DARPA, universities in the United States and Britain opened major labs. That funding let researchers build bigger systems and collect more data for experiments.

Item Name	Description	Price
General Problem Solver	Program modeling stepwise reasoning	$0
Geometry Theorem Prover	Automated proofs of geometric theorems	$0
Early Language Programs	Systems that parsed and generated simple English	$0

Major impact: optimism grew that a machine could pass the turing test within a short time.
Research surge: the world invested in labs, people, and networks that advanced future systems.
Legacy: those wins laid groundwork for expert systems, modern deep learning, and robotics.

The First AI Winter

By the early 1970s, high hopes around machine thinking collided with hard realities.

In 1973 James Lighthill told the British Science Research Council that early work had not met expectations. His report led to major cuts in funding for artificial intelligence research in the United Kingdom.

In 1974 the U.S. also scaled back support for undirected work after congressional scrutiny. The result was the first “AI winter” — a period when public faith, grants, and large projects all cooled.

Item Name	Description	Price
Lighthill Report	Critical review that triggered UK funding cuts (1973)	$0
US Congressional Review	Pressure that reduced undirected research grants (1974)	$0
AI Winter	Period of reduced funding and slowed progress	$0

Cause: researchers underestimated complexity; results lagged behind promises.
Effect: funding and public interest fell, slowing systems and language projects.
Legacy: work continued quietly and set foundations for later deep learning gains.

Expert Systems and Commercial Growth

A wave of practical systems in the 1980s turned academic ideas into business tools.

Item Name	Description	Price
MYCIN	Stanford expert system for diagnosing bacterial infections and recommending antibiotics	$0
XCON	Digital Equipment Corp. rule-based system for configuring computer orders (1980)	$0
Fifth Generation Project	Japan’s $850M program (1981) to build logic-capable supercomputers	$0
Commercial Impact	Expert systems drove new markets and large corporate investments in the 1980s	$0

The Impact of MYCIN

MYCIN showed that a rule-based program could help doctors with diagnosis and treatment choices. It used a clear set of medical rules to recommend antibiotics based on patient data.

The success of systems like MYCIN and XCON convinced managers to fund deployments. Governments also joined in. Japan’s Fifth Generation Project pumped large sums into research and computer systems aimed at logical reasoning.

Breakthrough: expert systems simulated decision-making by encoding specialist rules.
Business result: the 1980s saw a boom in commercial adoption and investment.
Legacy: although deep learning later dominated, expert systems left durable tools and design lessons.

The Second AI Winter

A cascade of financial setbacks in 1987 cooled the fever around machine research. LISP‑hardware firms failed, stock prices dropped, and investors pulled back fast.

Item Name	Description	Price
1987 Market Crash	Triggered funding cuts and shook investor confidence	$0
LISP Hardware Failures	Specialized computer vendors went bankrupt or pivoted	$0
Press Backlash	Media criticism reduced public trust in systems promises	$0

At the 1984 AAAI meeting, Roger Schank and Marvin Minsky warned that inflated promises would end badly. Their caution proved accurate as the 1990s brought a long period of waning public and private interest.

Researchers kept working, often under different labels and smaller grants. Labs shifted focus to core problems like language, data handling, and practical tools rather than grand claims.

The second AI winter shows how hype and slow returns can stall progress.
After a boom in expert systems, costs rose while benefits lagged.
Work from that era later fed the revival that led to modern deep learning.

The Machine Learning Renaissance

By the end of the 1990s, practical wins showed that data and math could reshape machine behavior.

Item Name	Description	Price
Deep Blue (1997)	IBM computer that defeated Garry Kasparov in a chess match	$0
Dragon NaturallySpeaking (1997)	Commercial speech recognition software by Dragon Systems	$0
Candide (IBM)	Statistical machine translation project using data patterns	$0

Deep Blue proved a machine could beat a world chess champion. That 1997 moment showed the power of focused algorithms and fast hardware.

At the same time, Dragon NaturallySpeaking made speech recognition practical for users. It processed fluent, real speech and supported hundreds of words per minute.

Researchers moved from rigid rule sets like older expert systems to methods that learn patterns in large data sets. IBM’s Candide highlighted this shift by using statistics for language translation.

Drivers: cheaper computing, vast data, and stronger math gave rise to modern machine learning.
Outcome: systems that learn from examples enabled practical tools in speech, translation, and more.

The renaissance reignited research and investment. It also laid the groundwork for the later rise of deep learning and many applications you use today.

The Deep Learning Breakthrough

The 2010s ushered in a clear turning point when multilayer networks learned complex patterns at scale.

Item Name	Description	Price
Backpropagation (1986)	Algorithm by Rumelhart, Hinton, and Williams that enabled weight updates in multilayer nets.	$0
LeCun’s ZIP Study	Yann LeCun’s team applied backprop to read handwritten ZIP codes, showing real gains.	$0
Transformer (2017)	Architecture that used attention and large datasets to scale generative models.	$0
GPT-3	Large language model with 175 billion parameters that performed many tasks with minimal tuning.	$0

Backpropagation made deep networks trainable. It let layers adjust internal weights so models could learn complex mappings.

Backpropagation and Neural Networks

In 1986 Rumelhart, Hinton, and Williams formalized backpropagation for multilayer networks.

Later, Yann LeCun and colleagues used it in practical settings like ZIP code recognition. That work proved neural nets could solve real tasks on real computers.

The Role of Big Data

Big datasets and cheaper compute let models absorb vast amounts of information.

The transformer model of 2017 and systems like GPT-3 show how data plus architecture produce strong generative results.

Outcome: deep learning eclipsed older expert systems for many tasks.
Impact: modern machine learning and language systems grew rapidly from this base.

The Modern Era of Generative AI

The 2020s opened a new chapter as transformer models turned raw text into coherent, creative output.

Large language models like ChatGPT show traits of knowledge, focus, and creative flair. They help professionals write code, translate text, compose music, and generate images.

These systems scale quickly. Public releases and easy access drove massive investment and broad adoption across business and media.

Item Name	Description	Price
Transformer	Architecture that enabled scalable sequence modeling	$0
ChatGPT	Large model for conversational and generative tasks	$0
Creative Tools	Apps for art, audio, and code generation	$0

Impact: integration into software, translation, and creative fields.
Risk: ethical questions and governance debates grew louder.
Roots: advances rest on decades of machine learning and computer research.

We now face practical choices: how to use these tools well while managing their risks and measuring them against the turing test and other standards of intelligence.

Conclusion

Looking back, the field’s long arc shows steady effort, setbacks, and fresh breakthroughs.

The history proves that our drive to build a thinking machine blends craft, math, and bold research. Early ideas led to rule‑based systems, then to data‑driven learning and modern language models.

Today, practical computer tools and large networks shape work, art, and daily life. We must pair innovation with ethics and clear governance so these systems help people fairly.

Understanding the past helps you see how future advances may unfold. Keep asking questions, support careful research, and use new tools with intent.

FAQ

Who coined the term "artificial intelligence" and when?

The phrase was coined by John McCarthy for the 1956 Dartmouth Workshop, where researchers gathered to explore whether machines could simulate aspects of human reasoning and learning.

What was the Dartmouth Workshop and why does it matter?

The Dartmouth Workshop was a foundational 1956 meeting that launched organized research into machine reasoning, expert systems, and the idea that machines could perform tasks requiring humanlike thought.

How does the Turing Test relate to machine intelligence?

Proposed by Alan Turing in 1950, the test evaluates whether a machine’s conversational responses are indistinguishable from a human’s, offering a behavioral benchmark for evaluating machine intelligence.

What were early computing machines that influenced modern systems?

Pioneering devices like the Atanasoff-Berry Computer and later programmable electronic computers provided the mechanical and electronic foundations that made symbolic reasoning and machine learning possible.

What role did the Logic Theorist play in early research?

The Logic Theorist, developed by Newell and Simon in the 1950s, was one of the first programs to mimic human problem solving by proving theorems, showing that formal reasoning could be automated.

What caused the AI winters and how did research survive them?

Periods called “AI winters” followed unmet expectations and reduced funding. Research persisted through smaller academic projects, shifts to expert systems in industry, and gradual progress in algorithms and hardware.

How did expert systems shape commercial AI?

Expert systems like MYCIN demonstrated practical value by codifying domain knowledge for diagnosis and decision support, driving commercial interest and industry adoption in the 1970s and 1980s.

What triggered the modern machine learning and deep learning resurgence?

Advances in algorithms such as backpropagation, larger labeled datasets, and vastly improved computing power led to breakthroughs in neural networks and deep learning, enabling tasks like image and speech recognition at scale.

How does generative AI differ from earlier approaches?

Generative AI focuses on creating new content—text, images, audio—by learning patterns from large datasets. It builds on deep learning architectures and differs from symbolic systems that relied on explicit rules.