Codeforces 148D Bag of mice (概率dp)

177 篇文章 0 订阅
19 篇文章 0 订阅


D. Bag of mice
time limit per test:2 seconds
memory limit per test:256 megabytes

The dragon and the princess are arguing about what to do on the New Year's Eve. The dragon suggests flying to the mountains to watch fairies dancing in the moonlight, while the princess thinks they should just go to bed early. They are desperate to come to an amicable agreement, so they decide to leave this up to chance.

They take turns drawing a mouse from a bag which initially containsw white andb black mice. The person who is the first to draw a white mouse wins. After each mouse drawn by the dragon the rest of mice in the bag panic, and one of them jumps out of the bag itself (the princess draws her mice carefully and doesn't scare other mice).Princess draws first. What is the probability of the princess winning?

If there are no more mice in the bag and nobody has drawn a white mouse, the dragon wins. Mice which jump out of the bag themselves are not considered to be drawn (do not define the winner). Once a mouse has left the bag, it never returns to it. Every mouse is drawn from the bag with the same probability as every other one, and every mouse jumps out of the bag with the same probability as every other one.

Input

The only line of input data contains two integersw andb (0 ≤ w, b ≤ 1000).

Output

Output the probability of the princess winning. The answer is considered to be correct if its absolute or relative error does not exceed10 - 9.

Sample test(s)
Input
1 3
Output
0.500000000
Input
5 5
Output
0.658730159
Note

Let's go through the first sample. The probability of the princess drawing a white mouse on her first turn and winning right away is 1/4. The probability of the dragon drawing a black mouse and not winning on his first turn is 3/4 * 2/3 = 1/2. After this there are two mice left in the bag — one black and one white; one of them jumps out, and the other is drawn by the princess on her second turn. If the princess' mouse is white, she wins (probability is 1/2 * 1/2 = 1/4), otherwise nobody gets the white mouse, so according to the rule the dragon wins.


题目链接:http://codeforces.com/contest/148/problem/d


题目大意:公主和龙抓老鼠,老鼠有w只白的,b只黑的,先抓到白老鼠的赢,龙每次抓完一只老鼠后会多跑出去一只老鼠,一旦没有老鼠或者没人抓到白老鼠,则算公主输,公主和龙抓的老鼠是黑或白是等可能的,龙抓完跑掉的老鼠是黑或白也是等可能的,现在公主先抓,要求公主赢的概率


题目分析:比较好想的一道概率dp,dp[i][j]表示还剩i只白老鼠和j只黑老鼠时公主赢的概率,这里只需要推公主赢的情况即可,显然有三种情况公主会赢或者还有赢的可能

注意每一轮是按顺序执行的,即公主先抓,龙再抓,老鼠再跑,注意每次还剩多少

第一:公主抓到白老鼠  dp[i][j] += i / (i + j)

第二:公主和龙都抓到黑老鼠,跑出去的是黑老鼠  dp[i][j] = ( j / (i + j) ) * ( (j - 1) / (i + j - 1) * (j - 2) / (i + j - 2) ) * dp[i][j - 3] (减3是因为这种情况下需要至少三只黑老鼠)

第三:公主和龙都抓到黑老鼠,跑出去的是白老鼠  dp[i][j] = ( j / (i + j) ) * ( (j - 1) / (i + j - 1) * i / (i + j - 2) ) * dp[i - 1][j - 2]

注意初始化,没有白老鼠,dp[0][i] = 0,没有黑老鼠dp[i][0] = 1,一只老鼠都没有dp[0][0] = 0

最后答案是dp[w][b]


#include <cstdio>
#include <cstring>
int const MAX = 1005;
double dp[MAX][MAX];

int main()
{
    int w, b;
    memset(dp, 0, sizeof(dp));
    scanf("%d %d", &w, &b);
    for(int i = 0; i <= b; i ++)
        dp[0][i] = 0;
    for(int i = 1; i <= w; i++)
        dp[i][0] = 1.0;
    for(int i = 1; i <= w; i++)
    {
        for(int j = 1; j <= b; j++)
        {
            dp[i][j] += (double) i / (i + j);
            if(j >= 2)
                dp[i][j] += (double) j / (i + j) * (double) (j - 1) / (i + j - 1) * (double) i / (i + j - 2) * dp[i - 1][j - 2];
            if(j >= 3)
                dp[i][j] += (double) j / (i + j) * (double) (j - 1) / (i + j - 1) * (double) (j - 2) / (i + j - 2) * dp[i][j - 3];
        }
    }
    printf("%.9f\n", dp[w][b]);
}


  • 0
    点赞
  • 1
    收藏
    觉得还不错? 一键收藏
  • 0
    评论
区间DP是一种动态规划的方法,用于解决区间范围内的问题。在Codeforces竞赛中,区间DP经常被用于解决一些复杂的字符串或序列相关的问题。 在区间DP中,dp[i][j]表示第一个序列前i个元素和第二个序列前j个元素的最优解。具体的转移方程会根据具体的问题而变化,但是通常会涉及到比较两个序列的元素是否相等,然后根据不同的情况进行状态转移。 对于区间长度为1的情况,可以先进行初始化,然后再通过枚举区间长度和区间左端点,计算出dp[i][j]的值。 以下是一个示例代码,展示了如何使用区间DP来解决一个字符串匹配的问题: #include <cstdio> #include <cstring> #include <string> #include <iostream> #include <algorithm> using namespace std; const int maxn=510; const int inf=0x3f3f3f3f; int n,dp[maxn][maxn]; char s[maxn]; int main() { scanf("%d", &n); scanf("%s", s + 1); for(int i = 1; i <= n; i++) dp[i][i] = 1; for(int i = 1; i <= n; i++) { if(s[i] == s[i - 1]) dp[i][i - 1] = 1; else dp[i][i - 1] = 2; } for(int len = 3; len <= n; len++) { int r; for(int l = 1; l + len - 1 <= n; l++) { r = l + len - 1; dp[l][r] = inf; if(s[l] == s[r]) dp[l][r] = min(dp[l + 1][r], dp[l][r - 1]); else { for(int k = l; k <= r; k++) { dp[l][r] = min(dp[l][r], dp[l][k] + dp[k + 1][r]); } } } } printf("%d\n", dp[n]); return 0; } 希望这个例子能帮助你理解区间DP的基本思想和应用方法。如果你还有其他问题,请随时提问。

“相关推荐”对你有帮助么?

  • 非常没帮助
  • 没帮助
  • 一般
  • 有帮助
  • 非常有帮助
提交
评论
添加红包

请填写红包祝福语或标题

红包个数最小为10个

红包金额最低5元

当前余额3.43前往充值 >
需支付:10.00
成就一亿技术人!
领取后你会自动成为博主和红包主的粉丝 规则
hope_wisdom
发出的红包
实付
使用余额支付
点击重新获取
扫码支付
钱包余额 0

抵扣说明:

1.余额是钱包充值的虚拟货币,按照1:1的比例进行支付金额的抵扣。
2.余额无法直接购买下载,可以购买VIP、付费专栏及课程。

余额充值