cross_entropy中ignore——index

官方文档

https://pytorch.org/docs/master/nn.functional.html#torch.nn.functional.cross_entropy

作用

ignore_index用于忽略ground-truth中某些不需要参与计算的类。假设有两类{0:背景,1:前景},若想在计算交叉熵时忽略背景(0)类,则可令ignore_index=0(同理忽略前景计算可设ignore_index=1)。

代码示例

import torch
import torch.nn.functional as F
pred = torch.Tensor(
    [
        [0.9, 0.1],
        [0.8, 0.2],
        [0.7, 0.3]
    ]
)  # shape=(N,C)=(3,2),N为样本数,C为类数
label = torch.LongTensor([1, 0, 1])  # shape=(N)=(3),3个样本的label分别为1,0,1
out = F.cross_entropy(pred, label, ignore_index=0)  # 忽略0类
print(out)

   
   
  • 1
  • 2
  • 3
  • 4
  • 5
  • 6
  • 7
  • 8
  • 9
  • 10
  • 11
  • 12

输出

tensor(1.0421)

   
   
  • 1

验证

pytorch的CrossEntropy使用公式:
在这里插入图片描述
计算:

        l
       
       
        o
       
       
        s
       
       
        s
       
      
     
    
    
     
      
       
       
        =
       
       
        
         1
        
        
         2
        
       
       
        ×
       
       
        {
       
       
        [
       
       
        −
       
       
        0.1
       
       
        +
       
       
        l
       
       
        n
       
       
        (
       
       
        
         e
        
        
         0.9
        
       
       
        +
       
       
        
         e
        
        
         0.1
        
       
       
        )
       
       
        ]
       
       
        +
       
       
        [
       
       
        −
       
       
        0.3
       
       
        +
       
       
        l
       
       
        n
       
       
        (
       
       
        
         e
        
        
         0.7
        
       
       
        +
       
       
        
         e
        
        
         0.3
        
       
       
        )
       
       
        ]
       
       
        }
       
      
     
    
   
   
    
     
      
     
    
    
     
      
       
       
        =
       
       
        
         1
        
        
         2
        
       
       
        ×
       
       
        (
       
       
        1.1711
       
       
        +
       
       
        0.9130
       
       
        )
       
      
     
    
   
   
    
     
      
     
    
    
     
      
       
       
        =
       
       
        1.0421
       
      
     
    
   
  
  
   <span class="MathJax_Preview" style="color: inherit; display: none;"></span><div class="MathJax_Display"><span class="MathJax MathJax_FullWidth" id="MathJax-Element-1-Frame" tabindex="0" data-mathml="<math xmlns=&quot;http://www.w3.org/1998/Math/MathML&quot; display=&quot;block&quot;><mtable columnalign=&quot;right left right left right left right left right left right left&quot; rowspacing=&quot;3pt&quot; columnspacing=&quot;0em 2em 0em 2em 0em 2em 0em 2em 0em 2em 0em&quot; displaystyle=&quot;true&quot;><mtr><mtd><mi>l</mi><mi>o</mi><mi>s</mi><mi>s</mi></mtd><mtd><mi></mi><mo>=</mo><mfrac><mn>1</mn><mn>2</mn></mfrac><mo>&amp;#x00D7;</mo><mo fence=&quot;false&quot; stretchy=&quot;false&quot;>{</mo><mo stretchy=&quot;false&quot;>[</mo><mo>&amp;#x2212;</mo><mn>0.1</mn><mo>+</mo><mi>l</mi><mi>n</mi><mo stretchy=&quot;false&quot;>(</mo><msup><mi>e</mi><mrow class=&quot;MJX-TeXAtom-ORD&quot;><mn>0.9</mn></mrow></msup><mo>+</mo><msup><mi>e</mi><mrow class=&quot;MJX-TeXAtom-ORD&quot;><mn>0.1</mn></mrow></msup><mo stretchy=&quot;false&quot;>)</mo><mo stretchy=&quot;false&quot;>]</mo><mo>+</mo><mo stretchy=&quot;false&quot;>[</mo><mo>&amp;#x2212;</mo><mn>0.3</mn><mo>+</mo><mi>l</mi><mi>n</mi><mo stretchy=&quot;false&quot;>(</mo><msup><mi>e</mi><mrow class=&quot;MJX-TeXAtom-ORD&quot;><mn>0.7</mn></mrow></msup><mo>+</mo><msup><mi>e</mi><mrow class=&quot;MJX-TeXAtom-ORD&quot;><mn>0.3</mn></mrow></msup><mo stretchy=&quot;false&quot;>)</mo><mo stretchy=&quot;false&quot;>]</mo><mo fence=&quot;false&quot; stretchy=&quot;false&quot;>}</mo></mtd></mtr><mtr><mtd /><mtd><mi></mi><mo>=</mo><mfrac><mn>1</mn><mn>2</mn></mfrac><mo>&amp;#x00D7;</mo><mo stretchy=&quot;false&quot;>(</mo><mn>1.1711</mn><mo>+</mo><mn>0.9130</mn><mo stretchy=&quot;false&quot;>)</mo></mtd></mtr><mtr><mtd /><mtd><mi></mi><mo>=</mo><mn>1.0421</mn></mtd></mtr></mtable></math>" role="presentation" style="position: relative;"><nobr aria-hidden="true"><span class="math" id="MathJax-Span-1" style="width: 100%; display: inline-block; min-width: 27.058em;"><span style="display: inline-block; position: relative; width: 100%; height: 0px; font-size: 102%;"><span style="position: absolute; clip: rect(3.397em, 1026.45em, 9.376em, -999.997em); top: -6.635em; left: 0em; width: 100%;"><span class="mrow" id="MathJax-Span-2"><span style="display: inline-block; position: relative; width: 100%; height: 0px;"><span style="position: absolute; clip: rect(3.397em, 1026.45em, 9.376em, -999.997em); top: -6.635em; left: 50%; margin-left: -13.272em;"><span class="mtable" id="MathJax-Span-3" style="padding-left: 0.155em;"><span style="display: inline-block; position: relative; width: 26.349em; height: 0px;"><span style="position: absolute; clip: rect(3.245em, 1001.67em, 8.362em, -999.997em); top: -5.824em; left: 0em;"><span style="display: inline-block; position: relative; width: 1.725em; height: 0px;"><span style="position: absolute; width: 100%; clip: rect(3.144em, 1001.67em, 4.157em, -999.997em); top: -5.723em; left: 0em;"><span class="mtd" id="MathJax-Span-4"><span class="mrow" id="MathJax-Span-5"><span style="display: inline-block; position: relative; width: 100%; height: 0px;"><span style="position: absolute; clip: rect(3.144em, 1001.67em, 4.157em, -999.997em); top: -4em; left: 50%; margin-left: -0.859em;"><span class="mi" id="MathJax-Span-6" style="font-family: MathJax_Math-italic;">l</span><span class="mi" id="MathJax-Span-7" style="font-family: MathJax_Math-italic;">o</span><span class="mi" id="MathJax-Span-8" style="font-family: MathJax_Math-italic;">s</span><span class="mi" id="MathJax-Span-9" style="font-family: MathJax_Math-italic;">s</span><span style="display: inline-block; width: 0px; height: 4.005em;"></span></span></span></span></span><span style="display: inline-block; width: 0px; height: 4.005em;"></span></span><span style="position: absolute; clip: rect(3.853em, 1000em, 4.157em, -999.997em); top: -3.392em; right: 0em;"><span class="mtd" id="MathJax-Span-61"><span class="mrow" id="MathJax-Span-62"></span></span><span style="display: inline-block; width: 0px; height: 4.005em;"></span></span><span style="position: absolute; clip: rect(3.853em, 1000em, 4.157em, -999.997em); top: -1.619em; right: 0em;"><span class="mtd" id="MathJax-Span-76"><span class="mrow" id="MathJax-Span-77"></span></span><span style="display: inline-block; width: 0px; height: 4.005em;"></span></span></span><span style="display: inline-block; width: 0px; height: 5.829em;"></span></span><span style="position: absolute; clip: rect(3.245em, 1024.58em, 9.021em, -999.997em); top: -6.483em; left: 1.725em;"><span style="display: inline-block; position: relative; width: 24.626em; height: 0px;"><span style="position: absolute; width: 100%; clip: rect(2.536em, 1024.58em, 4.866em, -999.997em); top: -5.723em; left: 0em;"><span class="mtd" id="MathJax-Span-10"><span class="mrow" id="MathJax-Span-11"><span style="display: inline-block; position: relative; width: 100%; height: 0px;"><span style="position: absolute; clip: rect(2.536em, 1024.58em, 4.866em, -999.997em); top: -4em; left: 50%; margin-left: -12.309em;"><span class="mi" id="MathJax-Span-12"></span><span class="mo" id="MathJax-Span-13" style="font-family: MathJax_Main; padding-left: 0.256em;">=</span><span class="mfrac" id="MathJax-Span-14" style="padding-left: 0.256em;"><span style="display: inline-block; position: relative; width: 0.611em; height: 0px; margin-right: 0.104em; margin-left: 0.104em;"><span style="position: absolute; clip: rect(3.195em, 1000.41em, 4.157em, -999.997em); top: -4.659em; left: 50%; margin-left: -0.251em;"><span class="mn" id="MathJax-Span-15" style="font-family: MathJax_Main;">1</span><span style="display: inline-block; width: 0px; height: 4.005em;"></span></span><span style="position: absolute; clip: rect(3.195em, 1000.46em, 4.157em, -999.997em); top: -3.291em; left: 50%; margin-left: -0.251em;"><span class="mn" id="MathJax-Span-16" style="font-family: MathJax_Main;">2</span><span style="display: inline-block; width: 0px; height: 4.005em;"></span></span><span style="position: absolute; clip: rect(0.864em, 1000.61em, 1.219em, -999.997em); top: -1.264em; left: 0em;"><span style="display: inline-block; overflow: hidden; vertical-align: 0em; border-top: 1.3px solid; width: 0.611em; height: 0px;"></span><span style="display: inline-block; width: 0px; height: 1.067em;"></span></span></span></span><span class="mo" id="MathJax-Span-17" style="font-family: MathJax_Main; padding-left: 0.205em;">×</span><span class="mo" id="MathJax-Span-18" style="font-family: MathJax_Main; padding-left: 0.205em;">{</span><span class="mo" id="MathJax-Span-19" style="font-family: MathJax_Main;">[</span><span class="mo" id="MathJax-Span-20" style="font-family: MathJax_Main;">−</span><span class="mn" id="MathJax-Span-21" style="font-family: MathJax_Main;">0.1</span><span class="mo" id="MathJax-Span-22" style="font-family: MathJax_Main; padding-left: 0.205em;">+</span><span class="mi" id="MathJax-Span-23" style="font-family: MathJax_Math-italic; padding-left: 0.205em;">l</span><span class="mi" id="MathJax-Span-24" style="font-family: MathJax_Math-italic;">n</span><span class="mo" id="MathJax-Span-25" style="font-family: MathJax_Main;">(</span><span class="msubsup" id="MathJax-Span-26"><span style="display: inline-block; position: relative; width: 1.472em; height: 0px;"><span style="position: absolute; clip: rect(3.397em, 1000.41em, 4.157em, -999.997em); top: -4em; left: 0em;"><span class="mi" id="MathJax-Span-27" style="font-family: MathJax_Math-italic;">e</span><span style="display: inline-block; width: 0px; height: 4.005em;"></span></span><span style="position: absolute; top: -4.405em; left: 0.459em;"><span class="texatom" id="MathJax-Span-28"><span class="mrow" id="MathJax-Span-29"><span style="display: inline-block; position: relative; width: 0.915em; height: 0px;"><span style="position: absolute; clip: rect(3.397em, 1000.86em, 4.157em, -999.997em); top: -4em; left: 0em;"><span class="mn" id="MathJax-Span-30" style="font-size: 70.7%; font-family: MathJax_Main;">0.9</span><span style="display: inline-block; width: 0px; height: 4.005em;"></span></span></span></span></span><span style="display: inline-block; width: 0px; height: 4.005em;"></span></span></span></span><span class="mo" id="MathJax-Span-31" style="font-family: MathJax_Main; padding-left: 0.205em;">+</span><span class="msubsup" id="MathJax-Span-32" style="padding-left: 0.205em;"><span style="display: inline-block; position: relative; width: 1.472em; height: 0px;"><span style="position: absolute; clip: rect(3.397em, 1000.41em, 4.157em, -999.997em); top: -4em; left: 0em;"><span class="mi" id="MathJax-Span-33" style="font-family: MathJax_Math-italic;">e</span><span style="display: inline-block; width: 0px; height: 4.005em;"></span></span><span style="position: absolute; top: -4.405em; left: 0.459em;"><span class="texatom" id="MathJax-Span-34"><span class="mrow" id="MathJax-Span-35"><span style="display: inline-block; position: relative; width: 0.915em; height: 0px;"><span style="position: absolute; clip: rect(3.397em, 1000.86em, 4.157em, -999.997em); top: -4em; left: 0em;"><span class="mn" id="MathJax-Span-36" style="font-size: 70.7%; font-family: MathJax_Main;">0.1</span><span style="display: inline-block; width: 0px; height: 4.005em;"></span></span></span></span></span><span style="display: inline-block; width: 0px; height: 4.005em;"></span></span></span></span><span class="mo" id="MathJax-Span-37" style="font-family: MathJax_Main;">)</span><span class="mo" id="MathJax-Span-38" style="font-family: MathJax_Main;">]</span><span class="mo" id="MathJax-Span-39" style="font-family: MathJax_Main; padding-left: 0.205em;">+</span><span class="mo" id="MathJax-Span-40" style="font-family: MathJax_Main; padding-left: 0.205em;">[</span><span class="mo" id="MathJax-Span-41" style="font-family: MathJax_Main;">−</span><span class="mn" id="MathJax-Span-42" style="font-family: MathJax_Main;">0.3</span><span class="mo" id="MathJax-Span-43" style="font-family: MathJax_Main; padding-left: 0.205em;">+</span><span class="mi" id="MathJax-Span-44" style="font-family: MathJax_Math-italic; padding-left: 0.205em;">l</span><span class="mi" id="MathJax-Span-45" style="font-family: MathJax_Math-italic;">n</span><span class="mo" id="MathJax-Span-46" style="font-family: MathJax_Main;">(</span><span class="msubsup" id="MathJax-Span-47"><span style="display: inline-block; position: relative; width: 1.472em; height: 0px;"><span style="position: absolute; clip: rect(3.397em, 1000.41em, 4.157em, -999.997em); top: -4em; left: 0em;"><span class="mi" id="MathJax-Span-48" style="font-family: MathJax_Math-italic;">e</span><span style="display: inline-block; width: 0px; height: 4.005em;"></span></span><span style="position: absolute; top: -4.405em; left: 0.459em;"><span class="texatom" id="MathJax-Span-49"><span class="mrow" id="MathJax-Span-50"><span style="display: inline-block; position: relative; width: 0.915em; height: 0px;"><span style="position: absolute; clip: rect(3.397em, 1000.91em, 4.157em, -999.997em); top: -4em; left: 0em;"><span class="mn" id="MathJax-Span-51" style="font-size: 70.7%; font-family: MathJax_Main;">0.7</span><span style="display: inline-block; width: 0px; height: 4.005em;"></span></span></span></span></span><span style="display: inline-block; width: 0px; height: 4.005em;"></span></span></span></span><span class="mo" id="MathJax-Span-52" style="font-family: MathJax_Main; padding-left: 0.205em;">+</span><span class="msubsup" id="MathJax-Span-53" style="padding-left: 0.205em;"><span style="display: inline-block; position: relative; width: 1.472em; height: 0px;"><span style="position: absolute; clip: rect(3.397em, 1000.41em, 4.157em, -999.997em); top: -4em; left: 0em;"><span class="mi" id="MathJax-Span-54" style="font-family: MathJax_Math-italic;">e</span><span style="display: inline-block; width: 0px; height: 4.005em;"></span></span><span style="position: absolute; top: -4.405em; left: 0.459em;"><span class="texatom" id="MathJax-Span-55"><span class="mrow" id="MathJax-Span-56"><span style="display: inline-block; position: relative; width: 0.915em; height: 0px;"><span style="position: absolute; clip: rect(3.397em, 1000.86em, 4.157em, -999.997em); top: -4em; left: 0em;"><span class="mn" id="MathJax-Span-57" style="font-size: 70.7%; font-family: MathJax_Main;">0.3</span><span style="display: inline-block; width: 0px; height: 4.005em;"></span></span></span></span></span><span style="display: inline-block; width: 0px; height: 4.005em;"></span></span></span></span><span class="mo" id="MathJax-Span-58" style="font-family: MathJax_Main;">)</span><span class="mo" id="MathJax-Span-59" style="font-family: MathJax_Main;">]</span><span class="mo" id="MathJax-Span-60" style="font-family: MathJax_Main;">}</span><span style="display: inline-block; width: 0px; height: 4.005em;"></span></span></span></span></span><span style="display: inline-block; width: 0px; height: 4.005em;"></span></span><span style="position: absolute; width: 100%; clip: rect(2.536em, 1010.74em, 4.866em, -999.997em); top: -3.392em; left: 0em;"><span class="mtd" id="MathJax-Span-63"><span class="mrow" id="MathJax-Span-64"><span style="display: inline-block; position: relative; width: 100%; height: 0px;"><span style="position: absolute; clip: rect(2.536em, 1010.74em, 4.866em, -999.997em); top: -4em; left: 50%; margin-left: -5.419em;"><span class="mi" id="MathJax-Span-65"></span><span class="mo" id="MathJax-Span-66" style="font-family: MathJax_Main; padding-left: 0.256em;">=</span><span class="mfrac" id="MathJax-Span-67" style="padding-left: 0.256em;"><span style="display: inline-block; position: relative; width: 0.611em; height: 0px; margin-right: 0.104em; margin-left: 0.104em;"><span style="position: absolute; clip: rect(3.195em, 1000.41em, 4.157em, -999.997em); top: -4.659em; left: 50%; margin-left: -0.251em;"><span class="mn" id="MathJax-Span-68" style="font-family: MathJax_Main;">1</span><span style="display: inline-block; width: 0px; height: 4.005em;"></span></span><span style="position: absolute; clip: rect(3.195em, 1000.46em, 4.157em, -999.997em); top: -3.291em; left: 50%; margin-left: -0.251em;"><span class="mn" id="MathJax-Span-69" style="font-family: MathJax_Main;">2</span><span style="display: inline-block; width: 0px; height: 4.005em;"></span></span><span style="position: absolute; clip: rect(0.864em, 1000.61em, 1.219em, -999.997em); top: -1.264em; left: 0em;"><span style="display: inline-block; overflow: hidden; vertical-align: 0em; border-top: 1.3px solid; width: 0.611em; height: 0px;"></span><span style="display: inline-block; width: 0px; height: 1.067em;"></span></span></span></span><span class="mo" id="MathJax-Span-70" style="font-family: MathJax_Main; padding-left: 0.205em;">×</span><span class="mo" id="MathJax-Span-71" style="font-family: MathJax_Main; padding-left: 0.205em;">(</span><span class="mn" id="MathJax-Span-72" style="font-family: MathJax_Main;">1.1711</span><span class="mo" id="MathJax-Span-73" style="font-family: MathJax_Main; padding-left: 0.205em;">+</span><span class="mn" id="MathJax-Span-74" style="font-family: MathJax_Main; padding-left: 0.205em;">0.9130</span><span class="mo" id="MathJax-Span-75" style="font-family: MathJax_Main;">)</span><span style="display: inline-block; width: 0px; height: 4.005em;"></span></span></span></span></span><span style="display: inline-block; width: 0px; height: 4.005em;"></span></span><span style="position: absolute; width: 100%; clip: rect(3.195em, 1004em, 4.157em, -999.997em); top: -1.619em; left: 0em;"><span class="mtd" id="MathJax-Span-78"><span class="mrow" id="MathJax-Span-79"><span style="display: inline-block; position: relative; width: 100%; height: 0px;"><span style="position: absolute; clip: rect(3.195em, 1004em, 4.157em, -999.997em); top: -4em; left: 50%; margin-left: -2.024em;"><span class="mi" id="MathJax-Span-80"></span><span class="mo" id="MathJax-Span-81" style="font-family: MathJax_Main; padding-left: 0.256em;">=</span><span class="mn" id="MathJax-Span-82" style="font-family: MathJax_Main; padding-left: 0.256em;">1.0421</span><span style="display: inline-block; width: 0px; height: 4.005em;"></span></span></span></span></span><span style="display: inline-block; width: 0px; height: 4.005em;"></span></span></span><span style="display: inline-block; width: 0px; height: 6.488em;"></span></span></span></span><span style="display: inline-block; width: 0px; height: 6.64em;"></span></span></span></span><span style="display: inline-block; width: 0px; height: 6.64em;"></span></span></span><span style="display: inline-block; overflow: hidden; vertical-align: -2.685em; border-left: 0px solid; width: 0px; height: 5.894em;"></span></span></nobr><span class="MJX_Assistive_MathML MJX_Assistive_MathML_Block" role="presentation"><math xmlns="http://www.w3.org/1998/Math/MathML" display="block"><mtable columnalign="right left right left right left right left right left right left" rowspacing="3pt" columnspacing="0em 2em 0em 2em 0em 2em 0em 2em 0em 2em 0em" displaystyle="true"><mtr><mtd><mi>l</mi><mi>o</mi><mi>s</mi><mi>s</mi></mtd><mtd><mi></mi><mo>=</mo><mfrac><mn>1</mn><mn>2</mn></mfrac><mo>×</mo><mo fence="false" stretchy="false">{</mo><mo stretchy="false">[</mo><mo>−</mo><mn>0.1</mn><mo>+</mo><mi>l</mi><mi>n</mi><mo stretchy="false">(</mo><msup><mi>e</mi><mrow class="MJX-TeXAtom-ORD"><mn>0.9</mn></mrow></msup><mo>+</mo><msup><mi>e</mi><mrow class="MJX-TeXAtom-ORD"><mn>0.1</mn></mrow></msup><mo stretchy="false">)</mo><mo stretchy="false">]</mo><mo>+</mo><mo stretchy="false">[</mo><mo>−</mo><mn>0.3</mn><mo>+</mo><mi>l</mi><mi>n</mi><mo stretchy="false">(</mo><msup><mi>e</mi><mrow class="MJX-TeXAtom-ORD"><mn>0.7</mn></mrow></msup><mo>+</mo><msup><mi>e</mi><mrow class="MJX-TeXAtom-ORD"><mn>0.3</mn></mrow></msup><mo stretchy="false">)</mo><mo stretchy="false">]</mo><mo fence="false" stretchy="false">}</mo></mtd></mtr><mtr><mtd></mtd><mtd><mi></mi><mo>=</mo><mfrac><mn>1</mn><mn>2</mn></mfrac><mo>×</mo><mo stretchy="false">(</mo><mn>1.1711</mn><mo>+</mo><mn>0.9130</mn><mo stretchy="false">)</mo></mtd></mtr><mtr><mtd></mtd><mtd><mi></mi><mo>=</mo><mn>1.0421</mn></mtd></mtr></mtable></math></span></span></div><script type="math/tex; mode=display" id="MathJax-Element-1">\begin{aligned} loss&=\frac{1}{2}\times\{[-0.1+ln(e^{0.9}+e^{0.1})]+[-0.3+ln(e^{0.7}+e^{0.3})]\}\\ &=\frac{1}{2}\times(1.1711+0.9130)\\ &=1.0421 \end{aligned}</script>
  
 
</span><span class="katex-html"><span class="base"><span class="strut" style="height: 6.11488em; vertical-align: -2.80744em;"></span><span class="mord"><span class="mtable"><span class="col-align-r"><span class="vlist-t vlist-t2"><span class="vlist-r"><span class="vlist" style="height: 3.30744em;"><span class="" style="top: -5.30744em;"><span class="pstrut" style="height: 3.32144em;"></span><span class="mord"><span class="mord mathdefault" style="margin-right: 0.01968em;">l</span><span class="mord mathdefault">o</span><span class="mord mathdefault">s</span><span class="mord mathdefault">s</span></span></span><span class="" style="top: -3em;"><span class="pstrut" style="height: 3.32144em;"></span><span class="mord"></span></span><span class="" style="top: -1.174em;"><span class="pstrut" style="height: 3.32144em;"></span><span class="mord"></span></span></span><span class="vlist-s">​</span></span><span class="vlist-r"><span class="vlist" style="height: 2.80744em;"><span class=""></span></span></span></span></span><span class="col-align-l"><span class="vlist-t vlist-t2"><span class="vlist-r"><span class="vlist" style="height: 3.30744em;"><span class="" style="top: -5.30744em;"><span class="pstrut" style="height: 3.32144em;"></span><span class="mord"><span class="mord"></span><span class="mspace" style="margin-right: 0.277778em;"></span><span class="mrel">=</span><span class="mspace" style="margin-right: 0.277778em;"></span><span class="mord"><span class="mopen nulldelimiter"></span><span class="mfrac"><span class="vlist-t vlist-t2"><span class="vlist-r"><span class="vlist" style="height: 1.32144em;"><span class="" style="top: -2.314em;"><span class="pstrut" style="height: 3em;"></span><span class="mord"><span class="mord">2</span></span></span><span class="" style="top: -3.23em;"><span class="pstrut" style="height: 3em;"></span><span class="frac-line" style="border-bottom-width: 0.04em;"></span></span><span class="" style="top: -3.677em;"><span class="pstrut" style="height: 3em;"></span><span class="mord"><span class="mord">1</span></span></span></span><span class="vlist-s">​</span></span><span class="vlist-r"><span class="vlist" style="height: 0.686em;"><span class=""></span></span></span></span></span><span class="mclose nulldelimiter"></span></span><span class="mspace" style="margin-right: 0.222222em;"></span><span class="mbin">×</span><span class="mspace" style="margin-right: 0.222222em;"></span><span class="mopen">{<!-- --></span><span class="mopen">[</span><span class="mord">−</span><span class="mord">0</span><span class="mord">.</span><span class="mord">1</span><span class="mspace" style="margin-right: 0.222222em;"></span><span class="mbin">+</span><span class="mspace" style="margin-right: 0.222222em;"></span><span class="mord mathdefault" style="margin-right: 0.01968em;">l</span><span class="mord mathdefault">n</span><span class="mopen">(</span><span class="mord"><span class="mord mathdefault">e</span><span class="msupsub"><span class="vlist-t"><span class="vlist-r"><span class="vlist" style="height: 0.864108em;"><span class="" style="top: -3.113em; margin-right: 0.05em;"><span class="pstrut" style="height: 2.7em;"></span><span class="sizing reset-size6 size3 mtight"><span class="mord mtight"><span class="mord mtight">0</span><span class="mord mtight">.</span><span class="mord mtight">9</span></span></span></span></span></span></span></span></span><span class="mspace" style="margin-right: 0.222222em;"></span><span class="mbin">+</span><span class="mspace" style="margin-right: 0.222222em;"></span><span class="mord"><span class="mord mathdefault">e</span><span class="msupsub"><span class="vlist-t"><span class="vlist-r"><span class="vlist" style="height: 0.864108em;"><span class="" style="top: -3.113em; margin-right: 0.05em;"><span class="pstrut" style="height: 2.7em;"></span><span class="sizing reset-size6 size3 mtight"><span class="mord mtight"><span class="mord mtight">0</span><span class="mord mtight">.</span><span class="mord mtight">1</span></span></span></span></span></span></span></span></span><span class="mclose">)</span><span class="mclose">]</span><span class="mspace" style="margin-right: 0.222222em;"></span><span class="mbin">+</span><span class="mspace" style="margin-right: 0.222222em;"></span><span class="mopen">[</span><span class="mord">−</span><span class="mord">0</span><span class="mord">.</span><span class="mord">3</span><span class="mspace" style="margin-right: 0.222222em;"></span><span class="mbin">+</span><span class="mspace" style="margin-right: 0.222222em;"></span><span class="mord mathdefault" style="margin-right: 0.01968em;">l</span><span class="mord mathdefault">n</span><span class="mopen">(</span><span class="mord"><span class="mord mathdefault">e</span><span class="msupsub"><span class="vlist-t"><span class="vlist-r"><span class="vlist" style="height: 0.864108em;"><span class="" style="top: -3.113em; margin-right: 0.05em;"><span class="pstrut" style="height: 2.7em;"></span><span class="sizing reset-size6 size3 mtight"><span class="mord mtight"><span class="mord mtight">0</span><span class="mord mtight">.</span><span class="mord mtight">7</span></span></span></span></span></span></span></span></span><span class="mspace" style="margin-right: 0.222222em;"></span><span class="mbin">+</span><span class="mspace" style="margin-right: 0.222222em;"></span><span class="mord"><span class="mord mathdefault">e</span><span class="msupsub"><span class="vlist-t"><span class="vlist-r"><span class="vlist" style="height: 0.864108em;"><span class="" style="top: -3.113em; margin-right: 0.05em;"><span class="pstrut" style="height: 2.7em;"></span><span class="sizing reset-size6 size3 mtight"><span class="mord mtight"><span class="mord mtight">0</span><span class="mord mtight">.</span><span class="mord mtight">3</span></span></span></span></span></span></span></span></span><span class="mclose">)</span><span class="mclose">]</span><span class="mclose">}</span></span></span><span class="" style="top: -3em;"><span class="pstrut" style="height: 3.32144em;"></span><span class="mord"><span class="mord"></span><span class="mspace" style="margin-right: 0.277778em;"></span><span class="mrel">=</span><span class="mspace" style="margin-right: 0.277778em;"></span><span class="mord"><span class="mopen nulldelimiter"></span><span class="mfrac"><span class="vlist-t vlist-t2"><span class="vlist-r"><span class="vlist" style="height: 1.32144em;"><span class="" style="top: -2.314em;"><span class="pstrut" style="height: 3em;"></span><span class="mord"><span class="mord">2</span></span></span><span class="" style="top: -3.23em;"><span class="pstrut" style="height: 3em;"></span><span class="frac-line" style="border-bottom-width: 0.04em;"></span></span><span class="" style="top: -3.677em;"><span class="pstrut" style="height: 3em;"></span><span class="mord"><span class="mord">1</span></span></span></span><span class="vlist-s">​</span></span><span class="vlist-r"><span class="vlist" style="height: 0.686em;"><span class=""></span></span></span></span></span><span class="mclose nulldelimiter"></span></span><span class="mspace" style="margin-right: 0.222222em;"></span><span class="mbin">×</span><span class="mspace" style="margin-right: 0.222222em;"></span><span class="mopen">(</span><span class="mord">1</span><span class="mord">.</span><span class="mord">1</span><span class="mord">7</span><span class="mord">1</span><span class="mord">1</span><span class="mspace" style="margin-right: 0.222222em;"></span><span class="mbin">+</span><span class="mspace" style="margin-right: 0.222222em;"></span><span class="mord">0</span><span class="mord">.</span><span class="mord">9</span><span class="mord">1</span><span class="mord">3</span><span class="mord">0</span><span class="mclose">)</span></span></span><span class="" style="top: -1.174em;"><span class="pstrut" style="height: 3.32144em;"></span><span class="mord"><span class="mord"></span><span class="mspace" style="margin-right: 0.277778em;"></span><span class="mrel">=</span><span class="mspace" style="margin-right: 0.277778em;"></span><span class="mord">1</span><span class="mord">.</span><span class="mord">0</span><span class="mord">4</span><span class="mord">2</span><span class="mord">1</span></span></span></span><span class="vlist-s">​</span></span><span class="vlist-r"><span class="vlist" style="height: 2.80744em;"><span class=""></span></span></span></span></span></span></span></span></span></span></span></p> 

torch.nn.CrossEntropyLoss 同理。

  • 1
    点赞
  • 1
    收藏
    觉得还不错? 一键收藏
  • 0
    评论

“相关推荐”对你有帮助么?

  • 非常没帮助
  • 没帮助
  • 一般
  • 有帮助
  • 非常有帮助
提交
评论
添加红包

请填写红包祝福语或标题

红包个数最小为10个

红包金额最低5元

当前余额3.43前往充值 >
需支付:10.00
成就一亿技术人!
领取后你会自动成为博主和红包主的粉丝 规则
hope_wisdom
发出的红包
实付
使用余额支付
点击重新获取
扫码支付
钱包余额 0

抵扣说明:

1.余额是钱包充值的虚拟货币,按照1:1的比例进行支付金额的抵扣。
2.余额无法直接购买下载,可以购买VIP、付费专栏及课程。

余额充值