Search ThaiLIS Digital Collection 2019 x

แจ้งเอกสารไม่ครบถ้วน, ไม่ตรงกับชื่อเรื่อง หรือมีข้อผิดพลาดเกี่ยวกับเอกสาร ติดต่อที่นี่ ==>
หากไม่มีอีเมลผู้รับให้กรอก thailis-noc@uni.net.th

Seksan  Kiatsupaibul.  Algorithm comparison between Thompson sampling and upper confidence bound for sequential decision problem in the game of Rock-Paper-Scissors.  ().  King Mongkut's University of Technology North Bangkok. Central Library. : , 2022.

Title

Algorithm comparison between Thompson sampling and upper confidence bound for sequential decision problem in the game of Rock-Paper-Scissors

Creator

Name: Thanyavuth Akarasomcheep

Organization : Chulalongkorn University. Faculty of Commerce and Accountancy

Email : thanyavuth@outlook.com

Creator

Name: Seksan Kiatsupaibul

Organization : Chulalongkorn University. Faculty of Commerce and Accountancy

Email : seksan@cbs.chula.ac.th

Subject

keyword: Thompson Sampling

ThaSH: Reinforcement learning

; Upper Confidence Bound

ThaSH: Markov processes

ThaSH: Decision support systems

Description

Abstract: The Thompson sampling algorithm and the upper confidence bound algorithm are known to be two efficient algorithms for solving multi-armed bandit problems and some Markov decision processes. However, it is unclear how well those two algorithms perform when encountering human decisions that involve behavioral components. This study set up a simple human decision problem in the form of a series of simulated rock-paper-scissors games. Two human behavioral traits, a mixed clockwise strategy and a stop loss strategy, are simulated in the games. Two reinforcement learning agents, one with the Thompson sampling algorithm and another one with the upper confidence bound algorithm, are designed by modeling the behavioral problem as a Markov decision process. Under different parameter settings, the performances of the two agents solving the problem are compared, taking the cumulative reward as the performance measure. We find that the upper confidence bound agent outperforms the Thompson sampling agent in most of the experimental settings. The exception where the Thompson sampling outperforms the upper confidence bound is when there exists a strong behavioral pattern in the case of a long decision horizon.

Publisher

King Mongkut's University of Technology North Bangkok. Central Library

Address: BANGKOK

Email: library@kmutnb.ac.th

Date

Created: 2022

Modified: 2025-09-17

Issued: 2025-09-17

Type

บทความ/Article

Format

application/pdf

Identifier

BibliograpyCitation : In King Mongkut's University of Technology North Bangkok Faculty of Applied Science, Thai Statistical Association (TSA) and Statistics Cooperative Research Network (Statistics CRN). The Proceeding of International Conference on Applied Statistics (ICAS 2022) (pp.208-213). Bangkok : King Mongkut's University of Technology North Bangkok

Language

eng

Rights

RightsAccess:

ลำดับที่.	ชื่อแฟ้มข้อมูล	ขนาดแฟ้มข้อมูล	จำนวนเข้าถึง	วัน-เวลาเข้าถึงล่าสุด
1	ICAS 2022pp.208-213.pdf	1.41 MB

ใช้เวลา

0.024169 วินาที

Creator : Thanyavuth Akarasomcheep

Title	Contributor	Type
Algorithm comparison between Thompson sampling and upper confidence bound for sequential decision problem in the game of Rock-Paper-Scissors มหาวิทยาลัยเทคโนโลยีพระจอมเกล้าพระนครเหนือ Thanyavuth Akarasomcheep;Seksan Kiatsupaibul		บทความ/Article

Creator : Seksan Kiatsupaibul

Title	Contributor	Type
An application of reinforcement learning to credit scoring based on the Logistic Bandit framework มหาวิทยาลัยเทคโนโลยีพระจอมเกล้าพระนครเหนือ Kantapong Visantavarakul;Seksan Kiatsupaibul		บทความ/Article
An efficiency comparison of distance measures in K-Nearest Neighbors imputation for spatial data มหาวิทยาลัยเทคโนโลยีพระจอมเกล้าพระนครเหนือ Prawit Banjong;Seksan Kiatsupaibul		บทความ/Article
Algorithm comparison between Thompson sampling and upper confidence bound for sequential decision problem in the game of Rock-Paper-Scissors มหาวิทยาลัยเทคโนโลยีพระจอมเกล้าพระนครเหนือ Thanyavuth Akarasomcheep;Seksan Kiatsupaibul		บทความ/Article

You can access to TDC Database at URL http://www.thailis.or.th/tdc/ or http://dcms.thailis.or.th/tdc/ or http://tdc.thailis.or.th/tdc/

ThaiLIS is Thailand Library Integrated System
สนับสนุนโดย สำนักงานบริหารเทคโนโลยีสารสนเทศเพื่อพัฒนาการศึกษา
กระทรวงการอุดมศึกษา วิทยาศาสตร์ วิจัยและนวัตกรรม
328 ถ.ศรีอยุธยา แขวง ทุ่งพญาไท เขต ราชเทวี กรุงเทพ 10400 โทร. โทร. 02-232-4000

กำลัง ออน์ไลน์
ภายในเครือข่าย ThaiLIS จำนวน 0
ภายนอกเครือข่าย ThaiLIS จำนวน 1,813
รวม 1,813 คน

More info..

นอก ThaiLIS = 38,743 ครั้ง
มหาวิทยาลัยสังกัดทบวงเดิม = 31 ครั้ง
มหาวิทยาลัยราชภัฏ = 1 ครั้ง
หน่วยงานอื่น = 1 ครั้ง
รวม 38,776 ครั้ง

Database server :
Version 2.5 Last update 1-06-2018
Power By SUSE PHP MySQL IndexData Mambo Bootstrap
มีปัญหาในการใช้งานติดต่อผ่านระบบ UniNetHelp

Server : 8.199.134
Client : Not ThaiLIS Member
From IP : 216.73.216.60