Szerkesztés

Megosztás a következőn keresztül:


Audio Processing Object Architecture

Audio processing objects (APOs), provide customizable software based digital signal processing for Windows audio streams.

Audio Processing Objects Overview

Windows allows OEMs and third-party audio hardware manufacturers to include custom digital signal processing effects as part of their audio driver's value-added features. These effects are packaged as user-mode system effect Audio Processing Objects (APOs).

Audio processing objects (APOs), provide software based digital signal processing for Windows audio streams. An APO is a COM host object that contains an algorithm that is written to provide a specific Digital Signal Processing (DSP) effect. This capability is known informally as an "audio effect." Examples of APOs include graphic equalizers, reverb, tremolo, Acoustic Echo Cancellation (AEC) and Automatic Gain Control (AGC). APOs are COM-based, real-time, in-process objects.

Note   The descriptions and terminology in this documentation refers mostly to output devices, such as speakers. However, the technology is symmetric and works essentially in reverse for input devices.

Software APOs vs. Hardware DSP

A hardware digital signal processor (DSP) is a specialized microprocessor (or a SIP block), with its architecture optimized for the operational needs of digital signal processing. There can be significant advantages to implement audio processing in purpose built hardware vs. using a software APO. One advantage is that the CPU use and associated power consumption may be lower with a hardware implemented DSP.

There are other advantages and disadvantages to consider, specific to your projects goals and constraints that you will want to consider before implementing a software based APO.

Software based effects are inserted in the software device pipe on stream initialization. These solutions do all their effects processing on the main CPU and do not rely on external hardware. This type of solution is best for traditional Windows audio solutions such as HDAudio, USB and Bluetooth devices when the driver and hardware only support RAW processing. For more information about RAW processing, see Audio Signal Processing Modes.

Proxy APO for Hardware DSP

Any effects applied in hardware DSP need to be advertised via a proxy APO. Microsoft provides a default proxy APO (MsApoFxProxy.dll). To use the Microsoft provided APO, this property set and property must be supported.

Optionally, you can implement your own proxy APO.

Windows Provided (System) APOs

Windows installs a default set of APOs, that provide a number of different audio effects. For a list of the system provided APO effects, see Audio Signal Processing Modes.

OEMs can include all of the system supplied APOs or replace some or all of them with custom APOs.

Custom APOs

It is possible to create custom APOs to enhance the Windows audio experience by adding additional audio effects.

The OEM can include any combination of the provided Windows APOs and custom APOs when they ship Windows.

A custom APO can be installed by an OEM or a third-party to enhance the audio experience after the device has been purchased. When users install an audio device driver by using the standard INF file, they automatically have access to the system's APOs. Independent hardware vendors (IHVs) and original equipment manufacturers (OEMs) can provide additional custom system effects while still using the Microsoft class drivers. They do so by packaging their DSP algorithms as APOs and modifying the standard INF file to insert their APOs into the audio engine’s signal processing graph.

For more information on creating custom APOs see, Implementing Audio Processing Objects.

Custom APO Support App

To allow the user to configure settings associated with your custom APO the recommended approach is to create a Hardware Support App. For more information, see Hardware Support App (HSA): Steps for Driver Developers.

Custom APO Tests and Requirements

The Microsoft HLK provides tests that can be used with APOs. For more information about audio tests see, Device.Audio Testing and Device.Audio Tests.

These two tests can be particularly useful when working with APOs.

Verify Audio EffectsDiscovery (Manual) - Certification

SysFX Test

For information on audio requirements to support APOs, see Device.Audio Requirements.

Custom APO Tools and Utilities

You can use the "Audio Effects Discovery Sample" to explore the available audio effects. This sample demonstrates how to query audio effects on render and capture audio devices and how to monitor changes with the audio effects. It is included as part of the SDK samples and can be downloaded using this link:

Audio effects discovery sample

Application Audio Effect Awareness

Applications have the ability to call APIs to determine which audio effects are currently active on the system. For more information on the audio effects awareness APIs, see AudioRenderEffectsManager class.

Audio Processing Objects Architecture

Placement of Audio Effects

There are three different locations for audio effects implemented as APOs. They are in Stream effects (SFX), Mode effects (MFX), and Endpoint effects (EFX).

Stream Effect (SFX)

A stream effect APO has an instance of the effect for every stream. Stream effects are before the mix (render) or after the tee (capture) for a given mode and can be used for changing channel count before the mixer. Stream effects are not used for raw streams.

Some versions of Windows, as an optimization, do not load SFX or MFX APOs in RAW mode.

  • Windows 8.1 does not load RAW SFX or RAW MFX
  • Windows 10 loads RAW MFX but not RAW SFX

Mode Effect (MFX)

Mode effects (MFX) are applied to all streams that are mapped to the same mode. Mode effects are applied after the mix (render) or before the tee (capture) for a given mode, but before the mix (render) or after the tee (capture) of all modes. Any scenario specific effect or effect that doesn’t need the specifics of the stream effect should be placed here. It is more power efficient to use a mode effect since there is one instance for multiple streams that share the same characteristics like periodicity and format.

Endpoint Effect (EFX)

Endpoint Effect (EFX) are applied to all streams that use the same endpoint. An endpoint effect is always applied, even to raw streams. That is, it is after the mix (render) or before the tee (capture) of all modes. The endpoint effects should be used with care and when in doubt an effect should be place in the Mode area. Some effects that should be placed in the endpoint area are speaker protection and speaker compensation.

This diagram shows the possible locations for stream (SFX), mode (MFX) and endpoint (EFX) effects for Windows 10.

Diagram illustrating the placement of stream, mode, and endpoint effects in Windows 10.

Multiple Custom APO Effects

It is possible to configure multiple APO based effects to work with different applications.

This diagram illustrates how multiple applications can access multiple combinations of stream, mode and endpoint APO effects. All of the APOs are COM based and run in user mode. In this scenario, none of the effects are running in hardware or in kernel mode.

Diagram showing how multiple applications can access various combinations of stream, mode, and endpoint APO effects.

Note  You can use the scroll bar at the very bottom of this page to view all of this diagram.

Software Mode Effects and Hardware Endpoint Effects for Render and Capture

This diagram illustrates software mode effects and hardware endpoint effects for render and capture.

Diagram displaying software mode effects and hardware endpoint effects for render and capture processes.

DSP Equipped System with Hardware Effects

This diagram illustrates a DSP equipped system that implements effects in hardware. In this scenario, a Proxy APO should be created to inform the apps of the effects that are implemented in hardware.

Diagram depicting a DSP-equipped system implementing effects in hardware.

Windows Audio Processing Objects

Hardware Offloaded Audio Driver Implementation