Dialogue-based test of intelligence

Turing proposed that a human evaluator would judge natural language conversations between a human and a machine. The evaluator would be aware that one of the two partners in conversation was a machine. The conversation would be limited to a text-only channel, such as a computer keyboard and screen, so the result would not depend on the machine’s ability to render words as speech. If the evaluator could not reliably tell the machine from the human, the machine would be said to have passed the test

LLMs are one of the first breakthrough technologies to have passed this test by the traditional definition

Semantics

Are the things the machine refers to the same things we refer to? e.g. when talking about steak, are we talking about the same steak?

The machine doesn’t have any real input into the world, no sense organs. It is true that the machine can discourse beautifully about, say, the scenery in New England. But it could not recognize an apple tree or an apple, a mountain or a cow, a field or a steeple, if it were in front of one