python - 我的井字游戏告诉我平局时我赢了-6ren

python - 我的井字游戏告诉我平局时我赢了

转载作者：行者123 更新时间：2023-12-05 07:00:01

理论上，我的 minimax 算法应该可以正常工作。我找不到任何明显的问题。但是......每当我将它作为“O”运行时，我的算法都会说玩家将获胜。然而，井字棋是一种抽签游戏。我很困惑!

import time
positionsChecked = 0
maxDepthChecked = 0
drawn = 0
wonForAI = 0
wonForPlayer = 0

def areMovesLeft(board):
    for i in board:
        for j in i:
            if j == '-':
                return True
    return False


def evaluate(board, humanSprite, AISprite):
    movesLeft = areMovesLeft(board)
    if not movesLeft:
        return 0
    for i in range(3):
        if board[i][0] == board[i][1] and board[i][1] == board[i][2]:
            if board[i][0] == humanSprite:
                return -10
            elif board[i][0] == AISprite:
                return 10
        if board[0][i] == board[1][i] and board[1][i] == board[2][i]:
            if board[0][i] == humanSprite:
                return -10
            elif board[0][i] == AISprite:
                return 10
    if board[0][0] == board[1][1] and board[1][1] == board[2][2]:
        if board[0][0] == humanSprite:
            return -10
        elif board[0][0] == AISprite:
            return 10
    if board[0][2] == board[1][1] and board[1][1] == board[2][0]:
        if board[0][2] == humanSprite:
            return -10
        elif board[0][2] == AISprite:
            return 10
    return 0


def minimax(board, isMaxPlayer, depth, humanSprite, AISprite, alpha, beta):
    global positionsChecked, maxDepthChecked, drawn, wonForAI, wonForPlayer
    best = 0
    if depth > maxDepthChecked:
        maxDepthChecked = depth
    movesLeft = areMovesLeft(board)
    if not movesLeft:
        positionsChecked += 1
        return 0
    score = evaluate(board, humanSprite, AISprite)

    if score == 10:
        return score - depth
        positionsChecked += 1
    if score == -10:
        positionsChecked +=1
        return score + depth
    for i in range(3):
        for j in range(3):
            if board[i][j] == '-':
                movesLeft = True
    if isMaxPlayer:
        best = -1000
        for i in range(3):
            for j in range(3):
                if board[i][j] == '-':
                    board[i][j] = AISprite
                    var2 = minimax(board, not isMaxPlayer, depth + 1, humanSprite, AISprite, alpha, beta)
                    best = max(best, var2)
                    board[i][j] = '-'
                    alpha = max(alpha, best)
                    if alpha <= beta:
                        break
                        break


        return best
    else:
        best = 1000
        for i in range(3):
            for j in range(3):
                if board[i][j] == '-':
                    board[i][j] = humanSprite
                    var2 = minimax(board, not isMaxPlayer, depth + 1, humanSprite, AISprite, alpha, beta)
                    best = min(best, var2)
                    board[i][j] = '-'
                    beta = min(beta, best)
                    if beta <= alpha:
                        break
                        break

        print(best)
        if best > 0:
            wonForAI += 1
        elif best == 0:
            drawn +=1
        elif best < 0:
            wonForPlayer += 1
        return best


def calcBestMove(board, humanSprite, AISprite):
    global positionsChecked
    global maxDepthChecked
    maxDepthChecked = 0
    positionsChecked = 0
    print("Best move being calculated...")
    timeBeforeMove = time.time()  # I am using this to calculate how long the program takes to find the best move
    bestValue = -1000
    for i in range(3):
        for j in range(3):
            if board[i][j] == '-':
                board[i][j] = AISprite
                moveValue = minimax(board, False, 1, humanSprite, AISprite, -1000, 1000)
                board[i][j] = '-'
                if moveValue > bestValue:
                    bestValue = moveValue
                    bestRow = i
                    bestCol = j
    print(f"\nThe value of the best move is {bestValue}. The best move is ({bestRow + 1}, {bestCol + 1})")
    timeAfterMove = time.time()
    timeTaken = timeAfterMove - timeBeforeMove
    print(f"The time it took the AI to find the best move is {timeTaken} seconds.")
    print(f"The AI searched {positionsChecked} positions.")
    print(f"Deepest search is {maxDepthChecked}")
    print(f"Drawn is {drawn} and won for player is {wonForPlayer} and wonfor ai is {wonForAI}")
    board[bestRow][bestCol] = AISprite


def printBoard(board):  # this prints the board
    print("         |         |         ")
    print(f"    {board[0][0]}    |    {board[0][1]}    |    {board[0][2]}    ")
    print("         |         |         ")
    print("---------|---------|---------")
    print("         |         |         ")
    print(f"    {board[1][0]}    |    {board[1][1]}    |    {board[1][2]}    ")
    print("         |         |         ")
    print("---------|---------|---------")
    print("         |         |         ")
    print(f"    {board[2][0]}    |    {board[2][1]}    |    {board[2][2]}    ")
    print("         |         |         ")


def switchTurns(humanSprite, AISprite, isHumanTurn, board):  # controls the flow of the game
    printBoard(board)

    if isHumanTurn:
        while True:
            row = input("Enter row: ")
            column = input("Enter column: ")
            if (row == '1' or row == '2' or row == '3') and (column == '1' or column == '2' or column == '3'):
                row = int(row)-1
                column = int(column)-1
                if 0 <= row <= 2 and 0 <= column <= 2 and board[row][column] == '-':
                    board[row][column] = humanSprite
                    break
                else:
                    if not (0 <= row <= 2 and 0 <= column <= 2):
                        print("Out of range!")
                    else:
                        print("Box already filled")
            else:
                print("Not valid values!")
    else:
        calcBestMove(board, humanSprite, AISprite)
    gameStatus = evaluate(board, humanSprite, AISprite)

    if gameStatus == 0:
        movesLeft = False
        for i in range(3):
            for j in range(3):
                if board[i][j] == '-':
                    movesLeft = True
        if not movesLeft:
            print("It's a tie!")
        else:
            switchTurns(humanSprite, AISprite, not isHumanTurn, board)
    else:
        printBoard(board)
        if gameStatus == -10:
            print("Human wins!")
        elif gameStatus == 10:
            print("AI wins!")
        return


def chooseFirstOrSecond():
    playerGoesFirst = input("DO you want to be first or second? (f/s) ")
    if playerGoesFirst == 'f' or playerGoesFirst == 's':
        if playerGoesFirst == 'f':
            switchTurns('X', 'O', True, [['-', '-', '-'], ['-', '-', '-'], ['-', '-', '-']])
        if playerGoesFirst == 's':
            switchTurns('O', 'X', False, [['-', '-', '-'], ['-', '-', '-'], ['-', '-', '-']])
    else:
        print("Invalid!")
        chooseFirstOrSecond()


chooseFirstOrSecond()

举个我玩过的游戏的例子:

DO you want to be first or second? (f/s) s
         |         |         
    -    |    -    |    -    
         |         |         
---------|---------|---------
         |         |         
    -    |    -    |    -    
         |         |         
---------|---------|---------
         |         |         
    -    |    -    |    -    
         |         |         
Best move being calculated...

The value of the best move is -4. The best move is (1, 1)
The time it took the AI to find the best move is 1.8505730628967285 seconds.
The AI searched 59492 positions.
Deepest search is 9
Drawn is 17158 and won for player is 20451 and wonfor ai is 2820
         |         |         
    X    |    -    |    -    
         |         |         
---------|---------|---------
         |         |         
    -    |    -    |    -    
         |         |         
---------|---------|---------
         |         |         
    -    |    -    |    -    
         |         |         
Enter row: 1
Enter column: 3
         |         |         
    X    |    -    |    O    
         |         |         
---------|---------|---------
         |         |         
    -    |    -    |    -    
         |         |         
---------|---------|---------
         |         |         
    -    |    -    |    -    
         |         |         
Best move being calculated...

The value of the best move is -4. The best move is (2, 1)
The time it took the AI to find the best move is 0.04171943664550781 seconds.
The AI searched 1322 positions.
Deepest search is 7
Drawn is 17529 and won for player is 20903 and wonfor ai is 2862
         |         |         
    X    |    -    |    O    
         |         |         
---------|---------|---------
         |         |         
    X    |    -    |    -    
         |         |         
---------|---------|---------
         |         |         
    -    |    -    |    -    
         |         |         
Enter row: 3
Enter column: 1
         |         |         
    X    |    -    |    O    
         |         |         
---------|---------|---------
         |         |         
    X    |    -    |    -    
         |         |         
---------|---------|---------
         |         |         
    O    |    -    |    -    
         |         |         
Best move being calculated...

The value of the best move is -6. The best move is (2, 2)
The time it took the AI to find the best move is 0.0019884109497070312 seconds.
The AI searched 66 positions.
Deepest search is 5
Drawn is 17531 and won for player is 20939 and wonfor ai is 2862
         |         |         
    X    |    -    |    O    
         |         |         
---------|---------|---------
         |         |         
    X    |    X    |    -    
         |         |         
---------|---------|---------
         |         |         
    O    |    -    |    -    
         |         |         
Enter row: 2
Enter column: 3
         |         |         
    X    |    -    |    O    
         |         |         
---------|---------|---------
         |         |         
    X    |    X    |    O    
         |         |         
---------|---------|---------
         |         |         
    O    |    -    |    -    
         |         |         
Best move being calculated...

The value of the best move is 9. The best move is (3, 3)
The time it took the AI to find the best move is 0.0 seconds.
The AI searched 4 positions.
Deepest search is 3
Drawn is 17531 and won for player is 20941 and wonfor ai is 2862
         |         |         
    X    |    -    |    O    
         |         |         
---------|---------|---------
         |         |         
    X    |    X    |    O    
         |         |         
---------|---------|---------
         |         |         
    O    |    -    |    X    
         |         |         
AI wins!

(0 以上是 AI 胜利，0 以下是玩家胜利。)你可以清楚地看到，它说我一直赢到最后一个回合......我不知道为什么...

最佳答案

我认为您在 alpha-beta 修剪中翻转了比较运算符。具体来说，在 if isMaxPlayer: 分支中，

if alpha <= beta:
    break

应该是

if alpha >= beta:
    break

关于python - 我的井字游戏告诉我平局时我赢了，我们在Stack Overflow上找到一个类似的问题： https://stackoverflow.com/questions/64226267/

文章推荐： python-3.x - 为 CLI 脚本构建 python 包

文章推荐： Angular FormArray 无效不会将表单标记为无效

文章推荐： algorithm - 解决递归方法

文章推荐： python - 使用 Numba 进行调试

java - 井字棋游戏预测平局(平局)
我正在为我的作业制作井字游戏，老师告诉我在只能打印领带的情况下打印领带。 (假设玩家不聪明)例如， x o x- - -o x o can only result in a tie so when t
math - 剪刀石头布。使用数学确定输赢/平局？
所以当我开始编写这个函数时，我正在编写一个石头剪刀布游戏: a是玩家自己的举动，b是玩家二的举动。我需要弄清楚的是，第一个玩家是赢了、输了还是平了。 //rock=0, paper=1, scisso
mysql - 我如何选择每支球队的最后五场比赛并确定胜/负/平局？
我知道用 PHP 可以做到这一点，但是有没有办法只用 MySQL 来做到这一点？我有这个数据库: --------------------------------------------------

行者123

个人简介

我是一名优秀的程序员,十分优秀！

作者热门文章

滴滴打车优惠券免费领取

全站热门文章

首页

博学

6Ren·AI

商城

python - 我的井字游戏告诉我平局时我赢了