Skip to content

Amazon AVS and Google Cast accessory certification

Guide - AVS, Alexa Built-in, Cast, Chromecast built-in

Integrating Amazon Alexa or Google Cast into a smart speaker, a connected display, a TV or a home accessory adds a programme-certification layer on top of regulatory radio and EMC certification. Amazon structures its programme around three tracks, Alexa Built-in with the AVS Device SDK, Alexa Connect Kit (ACK) for simpler IoT devices, and Alexa Built-in Lite for limited integrations. Google structures its programme around Chromecast built-in for audio, Chromecast built-in for TVs, and Google Assistant built-in, with extensions emerging around Matter casting. Both programmes share the same constraint families: cloud-connection conformity, acoustic front end and far-field microphone test plan for voice products, OAuth and credential management, privacy and microphone-indicator rules, DRM for video. This page presents the institutional map of the two programmes, the AFE and acoustic test-plan logic, the wake-word and DSP IP layer, the mDNS-SD and DIAL discovery stack on the Cast side, the Widevine track for video, and the cross-programme constraints with regulatory radio certification.

The two programmes are run by Amazon and Google as private programmes, separate from public regulatory regimes. The certificate they issue is contractual: it authorises use of the Alexa Built-in or Chromecast built-in brand and access to the corresponding cloud, under the terms of the developer agreement.

ActorScopeType of decision
Amazon DeveloperEnrolment, SDK access, Alexa cloud account managementAccount creation, contractual access to AVS or ACK
Amazon Alexa certification teamFunctional, acoustic and privacy review for Alexa Built-inAcoustic and functional test-plan pass, brand authorisation
Google DevelopersCast SDK and Assistant SDK access, account managementAccount creation, contractual access to the Cast cloud
Google certification teamFunctional, acoustic and DRM review for Cast and AssistantCast or Assistant certification pass, brand authorisation
Accredited acoustic laboratoriesAnechoic chamber, diffuse field, multi-talker conditionsTest reports compliant with the Amazon or Google test plan
Widevine and DRM partnersHardware Widevine L1 integration, KeyBox provisioningIntegration certificate for premium video

The procedural counterpart for the manufacturer is, at Amazon, the Alexa Built-in or ACK certification team accessed through the Amazon Developer portal, and at Google, the Cast or Assistant certification team accessed through Google Developers. Direct enrolment without a developer account is not possible.

Amazon AVS: Alexa Built-in, ACK, Alexa Built-in Lite

Section titled “Amazon AVS: Alexa Built-in, ACK, Alexa Built-in Lite”

The Amazon Alexa accessory programme comprises three certification tracks, defined by the functional depth of the Alexa integration and the location of the cloud logic.

Alexa Built-in covers products that embed the full Alexa client on the device. The technical foundation is the AVS Device SDK (C++ on Linux or RTOS, Java derivatives on Android-based platforms) or the Alexa Voice Service Integration Kit (AVS-IK) for accelerated integration on partner SoCs. The device handles:

  • wake-word detection locally, by an embedded engine (Sensory TrulyHandsfree, Amazon proprietary engine, or equivalent),
  • streaming to AVS of audio captured after wake-word, over HTTP/2 with OAuth authentication through Login With Amazon,
  • rendering of audio responses, with possibly a graphical interface on a display,
  • device-side capabilities declared via Alexa Smart Home or Alexa Skill API.

This track imposes the highest level of functional and acoustic conformance.

ACK targets simpler IoT devices: connected appliances, plugs, sensors, kitchen accessories. The Alexa client logic does not run on the device. The product integrates a small Amazon-supplied module (Wi-Fi plus MCU) preloaded with Amazon-hosted firmware. The device exposes its capabilities through an Amazon API, and Amazon manages the cloud part, OTA updates and account integration.

This track has no acoustic test plan, because the device typically has no far-field microphone array. Functional certification covers integration with the Amazon API and conformance to the Amazon Privacy and Security Requirements for connected devices.

Lite covers an intermediate scope: limited Alexa integration on devices that have a microphone but not a far-field array, typical of headphones or wearable accessories. The acoustic test plan is reduced compared with the full Built-in version, and the supported Alexa feature set is more limited.

TrackCloud logicMicrophoneTest-plan depth
Alexa Built-inOn device, AVS Device SDKFar-field array typicalFull acoustic and functional test plan
ACKAmazon-hosted firmware on dedicated moduleTypically noneFunctional only, no acoustic
Alexa Built-in LiteOn device, reduced scopeMono or close-talkReduced acoustic test plan

Functional certification for Alexa Built-in rests on an acoustic test plan, the Alexa Acoustic Test Plan, complemented by the Alexa Functional Test Plan. The objective is to validate that the device's acoustic front end (AFE) lets the wake-word engine and the AVS-streamed dialogue work in realistic listening conditions.

The AFE is the chain that picks up the user's voice, conditions it, applies acoustic echo cancellation when the device itself plays audio, and supplies a clean signal to the wake-word engine and the cloud. Its components:

  • far-field microphone array (typically 2 to 7 microphones depending on the device class), beamforming-capable,
  • AEC (Acoustic Echo Cancellation) for music playback or rendered audio responses,
  • noise suppression in diffuse fields, music and multi-talker environments,
  • beamforming and source localisation to focus on the active speaker,
  • automatic gain control to absorb proximity variations from the user.

DSP IP licensing is a recurring point: several actors (Cirrus Logic, Knowles, DSP Group, Synaptics-CES heritage) offer commercial AFE bricks pre-tuned for Alexa Built-in or Cast. The choice of brick conditions the SNR achieved and the test-plan pass rate.

Anechoic chamber and diffuse-field testing

Section titled “Anechoic chamber and diffuse-field testing”

Acoustic tests are conducted in an accredited laboratory equipped with an anechoic chamber able to simulate reverberation profiles representative of domestic environments. The instruments are supplied by specialists such as Audio Precision, Klippel, Listen Inc. The standard sequence:

  • wake-word detection at variable distances (typically up to several metres),
  • multi-talker with competing voice sources at different angles,
  • AEC during music playback at various levels,
  • noise suppression with diffuse noise injection,
  • false-accept rate with non-target audio content.

The exact target values are defined in the Amazon test plan and are subject to change. The order of magnitude of effort, when starting from a non-tuned AFE, is several iterations of physical testing, with hardware adjustments (mic placement, acoustic ports, internal damping) between sessions.

The wake-word engine is a critical brick. Two patterns:

  • embedded third-party engine, the most common case for non-Amazon products (Sensory TrulyHandsfree is the historical reference),
  • proprietary Amazon engine, available under licence for certain partners.

The wake-word activation rate, the latency between user utterance and AVS streaming, and the false-accept rate per hour are all metrics covered by the test plan. The engine and its tuning ship as a deliverable that travels through certification.

Cloud connection and credential management

Section titled “Cloud connection and credential management”

The AVS cloud connection works on a specific stack.

  • HTTP/2 endpoints on the Alexa cloud, with multiplexing for low-latency audio,
  • OAuth 2.0 through Login With Amazon for user authentication on the device,
  • refresh tokens managed by the device, with rotation according to Amazon's rules.

A common bug is refresh-token rotation that breaks after a device reboot: the device stores a refresh token, the reboot loses the rotated state, the cloud refuses the next refresh request, the user is logged out without explanation. This pattern is caught in pre-cert review when it appears.

The audio uploaded to AVS is typically encoded in Opus at 16 kHz mono, sometimes in PCM depending on the configuration. The choice between Opus and PCM influences bandwidth and codec latency, both factors weighed by Amazon during functional testing.

The Amazon Privacy and Security Requirements for Alexa Built-in impose strict rules on user privacy.

  • physical mute button required: a hardware control that physically disconnects or visibly inhibits the microphone array,
  • LED indicator when the microphone is actively streaming audio to the cloud, with an always-on policy during streaming,
  • microphone state signalled at every interaction, no silent listening,
  • factory reset that fully clears stored credentials and personal data.

The indicator-LED rule is regularly underestimated: a device that emits the LED only at the start of streaming and not for its full duration is non-compliant. Pre-cert review verifies the LED is on for the entire streaming window.

Google Cast: Chromecast built-in and Assistant built-in

Section titled “Google Cast: Chromecast built-in and Assistant built-in”

Google's accessory programme covers two main product tracks, plus an emerging Matter extension.

Audio-only products: smart speakers, connected sound bars, audio receivers. The integration goes through the Google Cast SDK on the receiver side, with a sender on phone, tablet or browser (the Cast app and the Chrome Cast extension). The architecture:

  • mDNS-SD (Multicast DNS Service Discovery) for discovering receivers on the local network,
  • DIAL (Discovery And Launch) to launch the receiver application,
  • Cast Application Framework (CAF) for session and message management between sender and receiver.

The product also typically integrates Google Assistant built-in when it has a microphone, with an acoustic test plan close in spirit to Amazon's.

TVs and connected displays. Add to the audio track:

  • video rendering with H.264, H.265 and AV1 codec support depending on the product class,
  • Widevine DRM, in L1 for HD or UHD premium content, in L3 for lower resolutions,
  • HDR pipeline with Dolby Vision or HDR10 plus depending on commercial positioning,
  • Dolby Atmos pass-through for compatible audio outputs.

Hardware requirements (RAM, CPU, AES decoder support, Wi-Fi class) are defined by Google in published documents, with minimums that evolve by SoC generation. The order of magnitude is a Wi-Fi 5 minimum with Wi-Fi 6 preferred on recent generations, but the actual figures applicable to a given project must be verified with Google.

A certification layer separate from Cast, dedicated to voice integration. The functional and acoustic test plan resembles in spirit the Amazon plan, with its own metrics: wake-word detection rate ("Hey Google" or "OK Google"), far-field SNR, AEC during music playback, false-accept rate. A product can be Cast-only, Assistant-only or both, depending on the targeted scope.

The 2024 to 2026 horizon introduces Matter casting: casting roles (Casting Source and Casting Target) on the Matter fabric. The point is to converge with the Matter ecosystem on the discovery and session-management layer, while keeping the Cast SDK on the application and media layer. The exact integration depends on Matter version revisions and ecosystem actor decisions: at the time of writing, the Cast over Matter angle is being structured.

TrackTypeFunctional certAcoustic certDRM
Chromecast built-in for audioAudio receiverYesOften paired with Assistant if mic presentLower constraint
Chromecast built-in for TVsVideo receiverYesIf embedded micWidevine L1 for premium HD or UHD
Google Assistant built-inVoice integrationYesYes, full acoustic test planNot applicable
Matter castingCasting extension over MatterEmergingDepends on product typeDepends on product type

For Cast products that render premium video, Widevine integration is a critical brick.

  • Widevine L1 (hardware-backed): keys handled in a Trusted Execution Environment (TEE), content decoding running in a protected path. Required for HD and UHD premium content from Netflix, Disney+ and equivalents.
  • Widevine L3 (software): keys and decoding in software. Acceptable for lower resolutions or specific configurations, insufficient for premium HD or UHD content.

L1 integration requires loading a KeyBox in the SoC at the factory, in a process supervised by the SoC vendor and Google. The KeyBox is unit-specific and not transferable. Forgetting the KeyBox at factory, or losing it during repair flows, is a recurring failure mode that results in a non-playable device.

A frequent pre-cert symptom is frame drops at the Widevine handshake when the stream starts or at key rotation. The cause is typically a timing problem in the protected pipeline or a slow key derivation that does not absorb in the audio-video synchronisation budget. Resolution involves measurements internal to the SoC and tuning of the protected decoder.

For high-end audio products, no equivalent DRM layer applies: the audio chain does not carry equivalent content protection.

Several constraints apply to both AVS and Cast.

AVS and Cast certifications do not replace radio regulatory certification. A connected speaker for the European market needs:

For the United States, FCC Part 15 takes the role of CE under RED. For Japan, MIC certification with the giteki marking, and so on for the other regulatory regions. The radio layers stack on top of the programme layers, not in substitution.

Privacy and security requirements imposed by Amazon and Google on connected accessories partly overlap with public referentials such as NIST IR 8425 and EN 303 645, notably on update mechanisms, default-password management and vulnerability handling. The convergence is not total, but well-organised cybersecurity work covers both the Amazon or Google ask and the public referentials.

The choice of AFE brick (DSP, wake-word engine, beamforming) involves third-party licensing. The relevant actors at the time of writing include Cirrus Logic, Knowles, DSP Group, and Synaptics with the CES Conexant heritage. The licensing structure and the technology roadmap evolve, and a current verification with each vendor is the only solid source.

Matter is gradually introducing itself as a unifying layer for the connected home, with extensions on the casting side for Google and on the smart-home side for both Amazon and Google. The Cast over Matter angle and Alexa Smart Home Matter support are tracked in parallel by both programmes. Matter does not replace the voice or media layer: it adds to it.

PitfallConsequence
AFE with insufficient microphone SNRWake-word detection rate below acoustic-test-plan threshold
AEC failing during music playbackWake-word masked, dialogue blocked while music plays
OAuth refresh-token rotation broken after rebootUser logged out, cloud connection lost without explanation
Cast video frame drops at Widevine handshakeStream rejected, premium content non-playable on the device
Mic mute LED not always on while streamingPrivacy rule breach, fails Alexa Privacy and Security review
Factory reset that removes Cast keys without re-provisioningDevice unusable as Cast receiver after reset
Confusing Alexa Built-in and ACK during scopingWrong SDK chosen, integration restarted at programme switch
Wake-word IP not licensed at industrialisationForced licence under time pressure or engine reselection
Underestimating Wi-Fi Alliance and Bluetooth SIG certifications on topProgramme certification obtained but product not radio-conformant
Forgetting the Widevine KeyBox at factoryL1 chain non-operational, premium HD or UHD non-playable

Step-by-step procedure for a smart-speaker product with Alexa Built-in

Section titled “Step-by-step procedure for a smart-speaker product with Alexa Built-in”

The typical sequence for a manufacturer integrating Alexa Built-in into a new product.

  1. Open the Amazon Developer account and sign the Alexa Built-in agreement, with selection of the target track (Built-in, ACK, Built-in Lite).
  2. Choose the SoC and the AFE brick, validate DSP IP licensing (DSP supplier, wake-word engine, beamforming).
  3. Integrate the AVS Device SDK or AVS-IK, including OAuth via Login With Amazon and refresh-token management.
  4. Acoustic and mechanical design: microphone array placement, internal damping, acoustic ports, mute button and LED.
  5. Internal acoustic tuning with provisional measurements in an in-house or low-cost chamber, before passing to an accredited laboratory.
  6. Pre-cert acoustic testing in an accredited laboratory (Audio Precision, Klippel, Listen Inc.), wake-word detection rate, AEC, multi-talker.
  7. Pre-cert functional testing on the Alexa Functional Test Plan: dialogues, capabilities, response rendering.
  8. Privacy and security review against the Amazon Privacy and Security Requirements: mute, LED, factory reset, data clearing.
  9. Wi-Fi Alliance and Bluetooth SIG certifications in parallel, on the regulatory radio side.
  10. Regional radio certification: FCC, CE under RED, MIC, KC and so on according to the target markets.
  11. Submit the full dossier to Amazon, with acoustic and functional reports, privacy declarations, regulatory references.
  12. Iterate on Amazon comments, conduct any complementary testing, get the Alexa Built-in pass.
  13. Brand authorisation and access to commercial Alexa Built-in deliverables (logo, marketing, registry on the Amazon site).
  14. Maintenance: track Alexa SDK updates, Amazon test-plan revisions, Wi-Fi Alliance and Bluetooth SIG roadmap.

For cross-cutting orders of magnitude per phase, see certification timeline.

CriterionAlexa Built-in (Amazon)Chromecast and Assistant built-in (Google)
Cloud entry pointAVS Device SDK or AVS-IK, HTTP/2 + OAuth via LWACast SDK or Assistant SDK, mDNS-SD + DIAL + CAF
Wake-word"Alexa", local engine + acoustic test plan"Hey Google" / "OK Google", local engine + test plan
Acoustic test planAlexa Acoustic Test Plan, accredited laboratoryEquivalent Google acoustic test plan, accredited laboratory
Video DRMNot applicable for typical audio productsWidevine L1 for premium HD or UHD on TV products
IoT lite trackACK (Alexa Connect Kit) on Amazon-hosted moduleNo strict equivalent, indirect via Google Home and Matter
Privacy and indicatorMute, always-on LED while streaming, factory resetEquivalent rules under Google policy
BrandAlexa Built-in logo and brand authorisationChromecast built-in / Google Assistant built-in logos

The two programmes share a strong family resemblance on the AFE, voice cloud and privacy parts. They diverge on the DRM for video angle (specific to Cast on TVs) and on the IoT-lite track (where Amazon has structured ACK with no exact symmetric on the Google side at the time of writing).

Sources & references

  1. Amazon Developer, Alexa Voice Service overview , Amazon developer.amazon.com/en-US/alexa/alexa-voice-service
  2. Amazon Developer, Alexa Connect Kit , Amazon developer.amazon.com/en-US/alexa/connected-devices/alexa-connect-kit
  3. Google Developers, Cast SDK overview , Google developers.google.com/cast
  4. Google Developers, Google Assistant SDK , Google developers.google.com/assistant/sdk
  5. Widevine, DRM for premium content , Google www.widevine.com/
  6. Connectivity Standards Alliance, Matter specification , CSA csa-iot.org/all-solutions/matter/