Hello Mateo !
Thank you for posting your question on Microsoft Learn.
Yes, adding ImageAnalysisSkill
can help. It allows you to analyze images for objects, text, and visual features (like colors and shapes). However, this skill alone may not fully extract numerical values or labels from a pie chart.
{
"@odata.type": "#Microsoft.Skills.Vision.ImageAnalysisSkill",
"visualFeatures": ["Categories", "Description", "Color", "Tags", "Objects"]
}
-
Color
: Helps to detect dominant colors. -
Objects
: Can identify elements like "pie chart" or "bar graph". -
Tags
: Provides descriptive labels like "finance", "diagram", etc.
To better extract structured data from graphs, use Custom Vision. You can train a model with example pie charts, histograms, and line graphs to recognize:
- Labels (for example "Banks", "Asset Managers")
- Proportions (for example 34%, 20%, etc.)
- Colors associated with categories
Azure Form Recognizer (Custom Model) can extract structured information from complex layouts:
- It can detect graph legends, values, and category labels.
- Supports tables, key-value pairs, and bounding boxes.
You can use Azure OpenAI GPT-4V (if available in your region) to reason about the extracted image content, drawing insights like:
- Matching colors to categories.
- Understanding relative proportions in pie charts.