映射標題（4.0 版）

發行項
01/23/2024

影像分析 4.0 中的影像標題可透過 Caption 和 Dense Captions 功能取得。

Caption 會為所有影像內容產生一個句子描述。除了描述整個影像之外，密集標題還提供更多詳細數據，方法是產生最多10個影像區域的一句描述。密集輔助字幕也會傳回所描述影像區域的週框方塊座標。這兩項功能都使用以佛羅倫薩為基礎的 AI 模型。

目前，影像標題僅適用於英文版。

重要

影像分析 4.0 中的影像標題僅適用於下列 Azure 數據中心區域：美國東部、法國中部、韓國中部、北歐、東南亞、西歐、美國西部、東亞。您必須使用位於其中一個區域的視覺資源，從 Caption 和 Dense Captions 功能取得結果。

如果您必須使用這些區域以外的視覺資源來產生影像標題，請使用所有 Azure AI 視覺區域中可用的影像分析 3.2。

使用 Vision Studio 快速且輕鬆地在瀏覽器中試用影像標題功能。

試用 Vision Studio

性別中性標題

標題預設包含性別詞彙（“man”、“woman”、“boy” 和 “girl”）。您可以選擇在結果中以「人員」取代這些詞彙，並接收性別中性標題。若要這麼做，您可以在要求 URL 中將選擇性 API 要求參數性別中性標題 設定true為。

下列 JSON 回應說明在根據視覺功能描述範例影像時，「分析 4.0 API」傳回的內容。

Photo of a man pointing at a screen

"captions": [
    {
        "text": "a man pointing at a screen",
        "confidence": 0.4891590476036072
    }
]

下列 JSON 回應說明 Analysis 4.0 API 在產生範例影像的密集標題時所傳回的內容。

Photo of a tractor on a farm

{
  "denseCaptionsResult": {
    "values": [
      {
        "text": "a man driving a tractor in a farm",
        "confidence": 0.535620927810669,
        "boundingBox": {
          "x": 0,
          "y": 0,
          "w": 850,
          "h": 567
        }
      },
      {
        "text": "a man driving a tractor in a field",
        "confidence": 0.5428450107574463,
        "boundingBox": {
          "x": 132,
          "y": 266,
          "w": 209,
          "h": 219
        }
      },
      {
        "text": "a blurry image of a tree",
        "confidence": 0.5139822363853455,
        "boundingBox": {
          "x": 147,
          "y": 126,
          "w": 76,
          "h": 131
        }
      },
      {
        "text": "a man riding a tractor",
        "confidence": 0.4799223840236664,
        "boundingBox": {
          "x": 206,
          "y": 264,
          "w": 64,
          "h": 97
        }
      },
      {
        "text": "a blue sky above a hill",
        "confidence": 0.35495415329933167,
        "boundingBox": {
          "x": 0,
          "y": 0,
          "w": 837,
          "h": 166
        }
      },
      {
        "text": "a tractor in a field",
        "confidence": 0.47338250279426575,
        "boundingBox": {
          "x": 0,
          "y": 243,
          "w": 838,
          "h": 311
        }
      }
    ]
  },
  "modelVersion": "2024-02-01",
  "metadata": {
    "width": 850,
    "height": 567
  }
}

使用 API

影像標題
密集標題

影像標題功能是分析影像 API 的一部分。包含在Caption功能查詢參數中。然後，當您取得完整的 JSON 回應時，剖析區段內容的 "captionResult" 字串。

映射標題（4.0 版）

性別中性標題

Caption 和 Dense Captions 範例

使用 API

下一步

其他資源

映射 標題 （4.0 版）

性別中性 標題

Caption 和 Dense Captions 範例

使用 API

下一步

其他資源

映射標題（4.0 版）

性別中性標題