571 999 090 8.00—15.30.

Doprava zdarma se Zásilkovnou nad 1 499 Kč

PPL Parcel Shop 54 Kč Balík do ruky 74 Kč Balíkovna 49 Kč PPL 99 Kč Zásilkovna 54 Kč

Kontakt

Jak nakupovat

Pomoc

Můj účet

▸ Prázdný :-(

Regret Analysis of Stochastic and Nonstochastic Multi-armed Bandit Problems

Sebastian Bubeck

Nicolo Cesa-Bianchi

Jazyk

Angličtina

Kniha Brožovaná

Libristo kód: 04834934

Nakladatelství now publishers Inc, prosince 2012

A multi-armed bandit problem - or, simply, a bandit problem - is a sequential allocation problem def... Celý popis

Libristo kód: 04834934

254 b

2 535 Kč

Skladem u dodavatele Odesíláme za 15-20 dnů

30 dní na vrácení zboží

Mohlo by vás také zajímat

Science of Cooking Stuart Farrimond

/ Pevná

560 Kč

Happiness 1 Shuzo Oshimi

/ Brožovaná

288 Kč

The Course of Love Alain de Botton

/ Brožovaná

254 Kč

Learn to Draw Manga Women

/ Brožovaná

412 Kč

Practical Veterinary Dental Radiography Brook A. Niemiec

/ Pevná

3 144 Kč

Colour Quest (R) Cityscapes John Woodcock

/ Brožovaná

357 Kč

Advanced Nutrition and Human Metabolism Sareen Gropper

/ Pevná

2 973 Kč

Dog Behaviour, Evolution, and Cognition Adam Miklosi

/ Brožovaná

1 689 Kč

Clean C++20 Stephan Roth

/ Brožovaná

1 036 Kč

Japanese Sake Bible Takashi Eguchi

/ Brožovaná

405 Kč

Chicken Soup for the Soul: Be You

/ Brožovaná

301 Kč

Albert Camus: A Life Olivier Todd

/ Brožovaná

592 Kč

Guess What! Level 4 Flashcards Spanish Edition REED SUSANNAH

/ Karty

1 153 Kč

Reverie: The Art of Sibylline Meynet

/ Pevná

1 003 Kč

Design Principles for Photography Jeremy Webb

/ Brožovaná

1 162 Kč

Battles of Tolkien David Day

/ Brožovaná

385 Kč

A multi-armed bandit problem - or, simply, a bandit problem - is a sequential allocation problem defined by a set of actions. At each time step, a unit resource is allocated to an action and some observable payoff is obtained. The goal is to maximize the total payoff obtained in a sequence of allocations. The name bandit refers to the colloquial term for a slot machine (a "one-armed bandit" in American slang). In a casino, a sequential allocation problem is obtained when the player is facing many slot machines at once (a "multi-armed bandit"), and must repeatedly choose where to insert the next coin. Multi-armed bandit problems are the most basic examples of sequential decision problems with an exploration-exploitation trade-off. This is the balance between staying with the option that gave highest payoffs in the past and exploring new options that might give higher payoffs in the future. Although the study of bandit problems dates back to the 1930s, exploration-exploitation trade-offs arise in several modern applications, such as ad placement, website optimization, and packet routing. Mathematically, a multi-armed bandit is defined by the payoff process associated with each option. In this book, the focus is on two extreme cases in which the analysis of regret is particularly simple and elegant: independent and identically distributed payoffs and adversarial payoffs. Besides the basic setting of finitely many actions, it also analyzes some of the most important variants and extensions, such as the contextual bandit model. This monograph is an ideal reference for students and researchers with an interest in bandit problems.

Informace o knize

Plný název Regret Analysis of Stochastic and Nonstochastic Multi-armed Bandit Problems

Autor Sebastian Bubeck, Nicolo Cesa-Bianchi

Jazyk

Angličtina

Vazba Kniha - Brožovaná

Datum vydání 2012

Počet stran 138

EAN 9781601986269

ISBN 1601986262

Libristo kód 04834934

Nakladatelství now publishers Inc

Váha 208

Rozměry 234 x 159 x 8

Kategorie

Výpočetní a informační technologie > Informatika > Teorie algoritmů

Darujte tuto knihu ještě dnes

Je to snadné

1 Přidejte knihu do košíku a zvolte doručit jako dárek 2 Obratem vám zašleme poukaz 3 Kniha dorazí na adresu obdarovaného

Doručení

Nákupní rádce