next up previous
Next: Conclusions Up: Rational and Bounded-Rational Players Previous: The Amoeba algorithm

Applying Amoeba to Increase Profit

In the model delineated in this paper, the producer faces a profit maximization problem with two independent variables: the normalized subscription fee f and the price-per-item tex2html_wrap_inline1038 . For this problem, the amoeba employs a two dimensional simplex (i.e., a triangle) to search in the profit landscape. In the scenario in which the consumers know their valuations and the producer knows both g and h, the producer can compute the profit landscape and use amoeba offline to compute the optimal f and tex2html_wrap_inline1038 to several digits of accuracy within a few dozen iterations. This takes little more than a second on a modest processor. It then sets f and tex2html_wrap_inline1038 to their computed values. If the consumers know their valuations, but the producer does not know g and h, the producer can still use amoeba, but it must do so in an online configuration. In this case, each iteration of the amoeba samples the profit landscape by experimentation. Thus each step is slow and costly, and the producer has to expect non-optimal profit during the exploration phase of the algorithm. Nonetheless, after a few dozen iterations, the producer will have found near-optimal settings of f and tex2html_wrap_inline1038 , and will enjoy the same steady-state profits as a producer that was fully informed from the beginning.

The situation becomes more interesting when the producer uses amoeba in a market in which the consumers learn their own valuations by sampling items and updating their estimates via Eq. 14. The producer's knowledge of g and h is now immaterial: its profit landscape is determined by the consumers' beliefs about their valuations. As has been seen, the distribution of estimated tex2html_wrap_inline1520 can diverge substantially from tex2html_wrap_inline1370 , resulting in a profit landscape that shifts dramatically over time. Can the amoeba algorithm cope with a dynamic landscape?

Fig. 6a illustrates what happens when the amoeba algorithm is started with very small, low-profit values of f and tex2html_wrap_inline1038 . Quickly, it finds profitable values for these parameters. Having reached a high-profit region, the amoeba begins to contract its simplex. However, once the price parameters stabilize, consumer leakage begins to set in, and the amoeba is unable to escape from an ever-deepening trough similar in nature to that of Fig. 5c. Amoeba falls into this trap because it never bothers to resample vertices that it has already evaluated. In particularly, it fails to consider that the profit obtained at the best vertex in its simplex may change over time -- it (falsely) believes that the best vertex is an ever-sharper needle sitting in a ever-deepening trough in the profit landscape. Once the amoeba finds a high-profit region, it is compelled to shrink its simplex, and it is completely blind to the ever-diminishing profits that occur throughout that simplex.

 
Figure 6: (a)Profit tex2html_wrap_inline1042 (solid line; normalized to ``ideal'' value of 0.41367) and proportion of subscribed consumers m (dashed line) vs. time (in subscription periods) and (b) f (solid) and tex2html_wrap_inline1038 (dashed) vs. time when the producer uses amoeba for online learning and the consumers estimate their tex2html_wrap_inline1056 values with flightiness parameter tex2html_wrap_inline1058 . The horizontal dashed lines indicate the optimal f and tex2html_wrap_inline1038 values for fully informed consumers. As before, the market consists of M=10000 consumers and one seller offering N=10 articles per subscription period. For a simplex in two dimensions (a triangle), we have used the standard coefficients tex2html_wrap_inline1128 for the amoeba algorithm. In this and all subsequent experiments involving the amoeba algorithm, the initial simplex consists of the following tex2html_wrap_inline1130 values: (0.10, 0.10), (0.10, 0.15), and (0.15, 0.10).  

The amoeba algorithm can be improved considerably for this application by having it occasionally resample previously sampled points and by conditionally expanding its simplex from time to time. Specifically, we propose a modified amoeba that is more suitable for dynamic landscapes by replacing Step 5 (Shrink Simplex) in the amoeba algorithm as follows:

5. Resample best vertex. Evaluate tex2html_wrap_inline1802 at tex2html_wrap_inline1760 . Denote the new evaluate as tex2html_wrap_inline1874 .
If tex2html_wrap_inline1876 then expand simplex, by computing the n new vertices for tex2html_wrap_inline1806 :

equation566

Else, (i.e., tex2html_wrap_inline1882 ) shrink simplex, and compute the n new vertices for tex2html_wrap_inline1806 :

equation531

Fig. 7 illustrates that the modified amoeba algorithm avoids long-term profit and consumer leakage. Leakage is thwarted because the modified amoeba briefly lowers its prices to recapture disenfranchised consumers, and then raises them again to enjoy increased profits from consumers who have learned again to appreciate the value of the producer's wares. In steady state, the average profit (computed between iterations 50 and 200 to avoid transient effects) is 0.31683, or 0.766 of the ideal profit. This is approximately a 12% improvement over the highest profit that can be sustained with fixed prices, which was tex2html_wrap_inline1888 .

Fig. 8 illustrates how the distribution of tex2html_wrap_inline1520 responds to lowering of prices by the modified amoeba. In iteration 40, the producer makes a profit tex2html_wrap_inline1892 by setting tex2html_wrap_inline1894 . In Fig. 8a, less than 65% of the consumer population subscribes -- the ones whose estimates tex2html_wrap_inline1520 are below the critical threshold for those price parameters, tex2html_wrap_inline1898 .

The situation changes dramatically in iteration 41 when the producer sets tex2html_wrap_inline1900 . While the profit decreases temporarily ( tex2html_wrap_inline1902 ), all consumers are now encouraged to subscribe because the critical threshold has risen to tex2html_wrap_inline1904 . The consumers re-estimate their tex2html_wrap_inline1056 values, and the overall distribution now shifts to that illustrated in Fig. 8b. This sets the stage for the producer to obtain profit tex2html_wrap_inline1908 in the next four time steps by increasing f (but reducing tex2html_wrap_inline1038 ).

Fig. 9 depicts the corresponding profit landscapes at iterations a) 40 and b) 41 conditioned on the distribution of tex2html_wrap_inline1520 . While the profit landscape still shifts continually, the modified amoeba manages to set prices in such a way as to avoid the formation of long-lived troughs. Additionally, Fig. 9c, which shows the profit landscape at iteration 200, indicates that the algorithm can be be successful for long periods of time. Ironically, it is the modified amoeba's recognition that the landscape may be dynamic that helps stabilize that landscape sufficiently to yield long-term profits.

 
Figure: (a)Profit tex2html_wrap_inline1042 (solid line; normalized to ``ideal'' value of 0.41367) and proportion of subscribed consumers m (dashed line) vs. time (in subscription periods) and (b) f (solid) and tex2html_wrap_inline1038 (dashed) vs. time when the producer uses the modified amoeba algorithm for online learning. The parameter settings remain unchanged from Fig. 6.  

 
Figure 8: Estimated tex2html_wrap_inline1056 distributions at (a) time 40 and (b) time 41 (in subscription periods) when the producer uses modified amoeba for online learning. The dashed line represents the initial distribution of tex2html_wrap_inline1520 .  

 
Figure: Profit landscape showing the mean profit per article per consumer as a function of f and tex2html_wrap_inline1038 conditioned on the distribution of tex2html_wrap_inline1520 a) at iteration 40, b) at iteration 41, and c) at iteration 200 when the producer uses modified amoeba for online learning. The parameter settings remain unchanged from Fig. 6.  


next up previous
Next: Conclusions Up: Rational and Bounded-Rational Players Previous: The Amoeba algorithm

kephart
Thu Nov 18 11:46:57 EST 1999