D-VisionDraughts: uma rede neural jogadora de damas que aprende por reforço em um ambiente
de computação distribuída

Barcelos, Ayres Roberto Araújo

Please use this identifier to cite or link to this item: https://repositorio.ufu.br/handle/123456789/12509

Full metadata record

DC Field	Value	Language
dc.creator	Barcelos, Ayres Roberto Araújo
dc.date.accessioned	2016-06-22T18:32:20Z	-
dc.date.available	2011-10-03
dc.date.available	2016-06-22T18:32:20Z	-
dc.date.issued	2011-02-23
dc.identifier.citation	BARCELOS, Ayres Roberto Araújo. D-VisionDraughts: uma rede neural jogadora de damas que aprende por reforço em um ambiente de computação distribuída. 2011. 139 f. Dissertação (Mestrado em Ciências Exatas e da Terra) - Universidade Federal de Uberlândia, Uberlândia, 2011.	por
dc.identifier.uri	https://repositorio.ufu.br/handle/123456789/12509	-
dc.description.abstract	The objetive of this work is to propose a draughts learning system, the D-VisionDraughts (Distributed VisionDraughts): a distributed draughts player agent based on neural networks that learns by reinforcement. The D-VisionDraughts is trained in a distributed processing environment in order to achieve a high level of play without expert game analysis and with minimal human intervention as possible (distinctly from the world draughts champion Chinook). The D-VisionDraughts corresponds to a distributed version of the eficient VisionDraughts player, where the latter corresponds to a MLP (multilayer perceptron) neural network that learns by means of temporal diferences. The role of the neural network is to evaluate how much a board state is favorable to the agent (prediction). This value will lead the search module to determine the best action (in this case, the best move) of the current board state of the game. Another factor that has an important impact on the search eciency, which is analyzed in this work, is the degree of ordering of the game tree. Thus, the main contributions of this work are: the replacement of the serial algorithm used in VisionDraughts, the minimax with alpha-beta pruning, by the distributed algorithm Young Brothers Wait Concept (YBWC); the use of heuristics for game tree ordering, that is essential for the proper performance of YBWC and alpha-beta pruning in general; the impact analysis of the high-performance processing environment on the unsupervised learning skills of the player. This work shows that with the techniques used, the time required to perform a game tree search was signicantly reduced and through tournaments played with VisionDraughts the overall performance of the distributed agent is improved.	eng
dc.format	application/pdf	por
dc.language	por	por
dc.publisher	Universidade Federal de Uberlândia	por
dc.rights	Acesso Aberto	por
dc.subject	Damas	por
dc.subject	Aprendizagem de máquina	por
dc.subject	Aprendizagem por reforço	por
dc.subject	Aprendizagem por diferenças temporais	por
dc.subject	Redes neurais articiais	por
dc.subject	Poda alpha-beta	por
dc.subject	Tabelas de transposição	por
dc.subject	Aprofundamento iterativo	por
dc.subject	Busca paralela	por
dc.subject	Draughts	eng
dc.subject	Machine learning	eng
dc.subject	Reinforcement learning	eng
dc.subject	Temporal differences learning	eng
dc.subject	Articial neural network	eng
dc.subject	Alpha-beta pruning	eng
dc.subject	Transposition table	eng
dc.subject	Iterative deepening	eng
dc.subject	Parallel search	eng
dc.title	D-VisionDraughts: uma rede neural jogadora de damas que aprende por reforço em um ambiente de computação distribuída	por
dc.type	Dissertação	por
dc.contributor.advisor1	Julia, Rita Maria da Silva
dc.contributor.advisor1Lattes	http://buscatextual.cnpq.br/buscatextual/visualizacv.do?id=K4788590Z8	por
dc.contributor.referee1	Matias Júnior, Rivalino
dc.contributor.referee1Lattes	http://buscatextual.cnpq.br/buscatextual/visualizacv.do?id=K4792617U6	por
dc.contributor.referee2	Bazzan, Ana Lucia Cetertich
dc.contributor.referee2Lattes	http://buscatextual.cnpq.br/buscatextual/visualizacv.do?id=K4723207J7	por
dc.creator.Lattes	http://buscatextual.cnpq.br/buscatextual/visualizacv.do?id=K4298322P6	por
dc.description.degreename	Mestre em Ciência da Computação	por
dc.description.resumo	O objetivo deste trabalho é propor um sistema de aprendizagem de damas, o DVisionDraughts (Distributed VisionDraughts): um agente distribuído jogador de damas baseado em redes neurais que aprende por reforço. O D-VisionDraughts é treinado em um ambiente de processamento distribuído de modo a alcançar um alto nível de jogo sem a análise de especialistas e com o mínimo de intervenção humana possível (diferentemente do agente campeão do mundo de damas Chinook). O D-VisionDraughts corresponde a uma versão distribuída do eciente jogador VisionDraughts, onde este último corresponde à uma rede neural MLP (multilayer perceptron) que aprende pelo método das diferenças temporais. O papel da rede neural é avaliar o quanto um estado de tabuleiro é favorável ao agente (valor de predição). Este valor irá guiar o módulo de busca na procura pela melhor ação (neste caso, o melhor movimento) correspondente ao estado de tabuleiro corrente do jogo. Outro fator que é importante na eciência da busca, e que foi analisado neste trabalho, é o grau de ordenação da árvore de jogo. Desta forma, as principais contribuições deste trabalho consistem em: substituir o algoritmo serial utilizado para a busca em árvore de jogos do VisionDraughts, o minimax com poda alpha-beta, pelo algoritmo distribuído Young Brothers Wait Concept (YBWC); o uso de heurísticas para ordenação da árvore de jogos, que é essencial para o bom desempenho do YBWC e da poda alpha-beta em geral; a análise do impacto do ambiente de processamento distribuído nas habilidades de aprendizado não supervisionadas do jogador. Este trabalho mostra que, com a aplicação das técnicas no D-VisionDraughts, reduzimos expressivamente o tempo necessário para a etapa de busca e, através de torneios realizados com o VisionDraughts, o desempenho geral do agente distribuído foi nitidamente melhor.	por
dc.publisher.country	BR	por
dc.publisher.program	Programa de Pós-graduação em Ciência da Computação	por
dc.subject.cnpq	CNPQ::CIENCIAS EXATAS E DA TERRA::CIENCIA DA COMPUTACAO	por
dc.publisher.department	Ciências Exatas e da Terra	por
dc.publisher.initials	UFU	por
dc.orcid.putcode	81752984	-
Appears in Collections:	DISSERTAÇÃO - Ciência da Computação

Files in This Item:

File	Description	Size	Format
Diss Ayres.pdf		3.47 MB	Adobe PDF	View/Open

Show simple item record