What is Automatic Gain Control (AGC)?

VoIP audio often sounds fine for one person, then too quiet for the next. Users move, face away from the mic, or shout into it. Without control, the far end keeps adjusting volume and still misses words.

Table of Contents hide

1 How AGC works inside VoIP phones and intercoms

1.1 The three blocks: measure, decide, smooth

1.2 Slow rider + fast protector (common in good designs)

1.3 Where AGC belongs in the chain

2 What’s the difference between AGC, limiter, and compressor?

2.1 Compressor: shape dynamics above a threshold

2.2 Limiter: protect against clipping

2.3 AGC: aim for a target level

3 How should I set AGC target level, attack, and release?

3.1 Target level: choose comfort with headroom

3.2 Attack: fast enough to protect, not so fast it “ducks” words

3.3 Release: slow enough to avoid breathing noise

3.4 Max gain and hold: the real anti-pumping controls

4 Why does AGC pump noise or clip speech, and how to fix it?

4.1 Pumping: AGC follows noise instead of voice

4.2 Clipping: gain rides too high or reacts too slow

4.3 A practical troubleshooting map

5 Conclusion

6 Footnotes

Automatic Gain Control (AGC) is an audio process that automatically changes amplification to keep speech near a target loudness. It boosts quiet talkers and attenuates loud talkers so VoIP phones and intercoms sound consistent and intelligible.

Automatic gain control optimizing SIP intercom audio levels between office user and remote caller — SIP intercom AGC

How AGC works inside VoIP phones and intercoms

The three blocks: measure, decide, smooth

A practical AGC has three core pieces:

1) Level detector
It measures signal level, often using root mean square (RMS) ¹ (loudness-like) or peak (clip protection). Some designs use both.

2) Control law
It compares measured level to a target and decides a gain change. This may behave like a compressor curve with ratio and knee, or like a simple “move toward target” rule.

3) Smoothing
It applies attack and release time constants ² so gain does not jump every frame. Many systems also include hold/hangover so gain does not chase noise between speech bursts.

In VoIP, AGC often runs on 10–30 ms frames, similar to VAD. It updates gain gradually to avoid audible stepping. If you’re building on open stacks, the WebRTC GainControl API ³ is a good reference point for common knobs and behaviors.

Slow rider + fast protector (common in good designs)

Strong voice pipelines often use two stages:

A slow AGC that rides overall speech level toward target over hundreds of milliseconds to seconds.
A fast limiter after it to catch sudden peaks (door slams, loud shouts) and prevent clipping.

This avoids the ugly tradeoff where one fast AGC tries to do both loudness and clip protection. When AGC is too fast, it sounds like pumping. When it is too slow, loud spikes clip.

Where AGC belongs in the chain

Placement decides what AGC accidentally amplifies. In voice endpoints, a safe order is often:

Mic capture
Acoustic Echo Cancellation (AEC) ⁴ (so echo is not boosted)
Noise suppression (or moderate suppression)
AGC (so codec sees stable levels)
Limiter (final clip protection)
Encoder

Some platforms put AGC before noise suppression so the suppressor works on a stable level. Others put suppression after AGC to avoid boosting noise. In practice, the correct choice depends on the suppression design and whether AGC is noise-gated. The key rule is: do not let AGC boost echo or boost pure background noise during silence.

AGC element	What it controls	What users hear when wrong
Target level	Average loudness	Too quiet or always too loud
Max gain	How much it can boost	Background hiss gets loud
Attack	How fast it reduces gain	Clip or sudden “ducking”
Release	How fast it increases gain	Pumping or breathing noise
Hold/hangover	Delay gain changes in silence	Less pumping, fewer artifacts

AGC is not “make it louder.” It is “make it consistent.” Once that idea is clear, the next common confusion is the difference between AGC, a compressor, and a limiter.

Many installers mix these terms and tune the wrong knob.

What’s the difference between AGC, limiter, and compressor?

People often call everything “AGC.” In reality, these tools solve different parts of the loudness problem.

AGC aims for a target loudness over time. A compressor reduces dynamic range above a threshold. A limiter is a fast, high-ratio compressor that prevents peaks from exceeding a ceiling to avoid clipping.

Automatic gain control signal flow with level detector error control law and smoothing limiter block — AGC block diagram

Compressor: shape dynamics above a threshold

A compressor reduces gain when the signal rises above a set threshold. It usually leaves quiet signals alone. It is used to smooth loudness differences and keep speech more even. It can be gentle or aggressive depending on ratio and knee.

In VoIP, compression can help, but it can also make background noise more audible if used without noise gating.

Limiter: protect against clipping

A limiter is like a compressor with a very high ratio and fast time constants. Its purpose is protection. It prevents digital clipping in the ADC, DSP chain, or encoder input. In paging and intercoms, a limiter is valuable because sudden sounds are common.

A limiter does not make quiet speech louder by itself. It only stops peaks from going too high.

AGC: aim for a target level

AGC is the “automatic operator.” It moves gain toward a target over time, both up and down. It can increase gain during quiet talk, and it can decrease gain during loud talk. That makes it different from a typical compressor that only pushes down above threshold.

A simple way to remember:

Compressor/limiter reacts to “too loud.”
AGC reacts to “not at target,” both directions.

Tool	Main purpose	Typical speed	Best use in VoIP
AGC	Maintain consistent loudness	Slow to medium	Phones, intercoms with varying talk distance
Compressor	Reduce dynamic range	Medium	Smooth speech, reduce level jumps
Limiter	Prevent clipping	Fast	Protect codec input from peaks

In well-designed VoIP endpoints, AGC and limiter work together. AGC makes speech comfortable. Limiter prevents overload.

Now the next question is the one that matters in real deployments: how to set target, attack, and release so speech is clear without pumping.

How should I set AGC target level, attack, and release?

If AGC is tuned wrong, it sounds unnatural. If it is tuned right, nobody notices it. That is the goal.

Set AGC target with headroom (often around −18 to −12 dBFS short-term), use a medium attack so loud bursts do not clip, and use a slower release so gain does not jump during pauses. Pair AGC with a limiter ceiling to prevent peaks.

Graph comparing AGC compressor and limiter curves for threshold target loudness ratio and ceiling — AGC compressor limiter

Target level: choose comfort with headroom

Digital voice systems need headroom for consonants and sudden peaks. A common practical target is in the −18 to −12 dBFS (decibels relative to full scale) ⁵ range for short-term speech level. This keeps average speech strong without riding near 0 dBFS.

If you want a standards-based way to think about “speech level,” the active speech level method (ITU-T P.56) ⁶ is a common reference used across speech systems.

In SIP intercoms, talkers can be closer or farther than a phone. So max gain and target must be chosen carefully. Too high a target plus high max gain makes background noise loud when nobody talks.

Attack: fast enough to protect, not so fast it “ducks” words

Attack controls how quickly gain is reduced when the signal gets loud. If attack is too slow, loud talkers clip before AGC reacts. If attack is too fast, the first syllable can sound squashed and unnatural, especially with plosives (“p,” “b”).

A common approach is moderate attack for the slow rider, then a fast limiter to catch true peaks.

Release: slow enough to avoid breathing noise

Release controls how quickly gain rises after the signal gets quieter. If release is too fast, the system turns up room noise between words. That creates “pumping” or “breathing.” If release is too slow, quiet talkers stay too quiet for too long.

In noisy sites, slower release plus a hold time often sounds better than fast release.

Max gain and hold: the real anti-pumping controls

Max gain limits how much AGC can boost. This is critical. If max gain is too high, the device becomes a noise amplifier. Hold/hangover prevents gain from rising immediately when speech stops.

For SIP phones, max gain can be moderate because talk distance is stable. For SIP intercoms, max gain may need to be higher, but hold and speech gating become more important.

Parameter	If too low	If too high	Practical bias for intercoms
Target	Speech still quiet	Speech sounds “hot” and clips	Moderate target with limiter
Max gain	Far talkers too quiet	Noise pumping and hiss	Limit gain, improve mic placement
Attack	Clips on loud talkers	Over-ducking, unnatural speech	Medium attack + fast limiter
Release	Speech recovers slowly	Noise breathing between words	Slower release + hold
Hold	No effect	Sluggish feel after speech	Medium hold to reduce pumping

A field habit that helps: tune in the real environment, with real distance and real background noise. Lab tuning in silence almost always leads to over-boosting noise on site.

Now the last section addresses the two most common complaints: “AGC pumps noise” and “AGC clips speech.”

Both are predictable failure modes.

Why does AGC pump noise or clip speech, and how to fix it?

When users complain about “robotic” audio or “volume going up and down,” AGC is often involved. Pumping and clipping come from the same root: gain changes happening at the wrong time and for the wrong signal.

AGC pumps noise when it raises gain during silence or background-only frames. It clips speech when target is too high, max gain is too high, attack is too slow, or there is no limiter. Fix it with speech gating/hold, lower max gain, slower release, correct target, and a brick-wall limiter.

AGC target level chart for quiet office intercom lobby and noisy factory floor environments — AGC environment targets

Pumping: AGC follows noise instead of voice

Pumping happens when the detector thinks the background is the signal to normalize. Common causes:

No speech detector or weak VAD gating inside AGC
Release too fast
Hold too short
Max gain too high
Noise suppression placed after AGC, so AGC boosts noise then suppression fights it

Fix steps that usually work:
1) Reduce max gain.
2) Add or increase hold/hangover.
3) Slow the release.
4) Enable speech gating if the device supports it.
5) Improve BNE or VAD baseline so the system knows what “noise-only” looks like.

Clipping: gain rides too high or reacts too slow

Clipping can be digital (0 dBFS) or analog (mic preamp saturating). Causes include:

Target level too high
Attack too slow
No limiter, or limiter ceiling too high
Mic gain too high before DSP
Speakerphone echo path causing extra energy at mic

Fix steps:
1) Lower input gain at the mic/preamp stage if possible.
2) Lower AGC target or reduce ratio.
3) Speed up attack slightly, but keep it natural.
4) Add a limiter with a sensible ceiling (leave headroom).
5) Check AEC so far-end playback is not inflating mic level.

A practical troubleshooting map

Symptom	Likely cause	Fast fix
Noise gets louder between words	Release too fast / hold too short	Slow release, increase hold
Constant hiss even when nobody speaks	Max gain too high	Reduce max gain, improve noise suppression/BNE
Loud talkers distort	Attack too slow or no limiter	Add limiter, lower target, adjust attack
Soft talkers still quiet	Max gain too low or target too low	Increase max gain slightly, improve mic placement
“Volume rides” during crowd noise	Speech detector confused	Strengthen gating, tune BNE, slow release

In SIP intercom deployments, a common real fix is not only DSP. It is also physical:

Improve mic placement to increase speech-to-noise ratio.
Reduce wind and vibration.
Avoid mounting that causes resonance.
When SNR improves, AGC can be less aggressive, and pumping disappears naturally.

AGC should be the helper, not the star. When it is tuned with headroom and restraint, speech becomes consistent and the call feels stable. For WebRTC-style endpoints, the baseline expectations for processing features are summarized in RFC 7874 WebRTC audio processing requirements ⁷.

Conclusion

AGC keeps speech near a target loudness by adjusting gain over time. Pair it with a limiter, use sensible target/headroom, and prevent noise pumping with max-gain limits, hold, and speech-aware gating.

Footnotes

Quick refresher on RMS and why it’s used for loudness-style level detection. ↩ ↩
Explains compressor/limiter concepts and how attack/release timing shapes audible pumping. ↩ ↩
WebRTC’s GainControl interface shows common AGC knobs and behaviors used in many VoIP stacks. ↩ ↩
ITU-T G.168 outlines echo canceller performance requirements relevant to speakerphones and intercoms. ↩ ↩
Definition of dBFS and why 0 dBFS is the digital ceiling before clipping. ↩ ↩
Describes the active speech level method often used to reference consistent speech loudness targets. ↩ ↩
Summarizes recommended audio processing features (AGC, NS, AEC) for WebRTC interoperability. ↩ ↩

About The Author

DJSLink R&D Team

DJSLink China's top SIP Audio And Video Communication Solutions manufacturer & factory .
Over the past 15 years, we have not only provided reliable, secure, clear, high-quality audio and video products and services, but we also take care of the delivery of your projects, ensuring your success in the local market and helping you to build a strong reputation.

Request A Quote Today!

Your email address will not be published. Required fields are marked *. We will contact you within 24 hours！

Kindly Send Us Your Project Details

We Will Quote for You Within 24 Hours .

Get a Free Quote

DJSLink experts Will Quote for You Within 24 Hours .

What is Automatic Gain Control (AGC)?

How AGC works inside VoIP phones and intercoms

The three blocks: measure, decide, smooth

Slow rider + fast protector (common in good designs)

Where AGC belongs in the chain

What’s the difference between AGC, limiter, and compressor?

Compressor: shape dynamics above a threshold

Limiter: protect against clipping

AGC: aim for a target level

How should I set AGC target level, attack, and release?

Target level: choose comfort with headroom

Attack: fast enough to protect, not so fast it “ducks” words

Release: slow enough to avoid breathing noise

Max gain and hold: the real anti-pumping controls

Why does AGC pump noise or clip speech, and how to fix it?

Pumping: AGC follows noise instead of voice

Clipping: gain rides too high or reacts too slow

A practical troubleshooting map

Conclusion

Footnotes

DJSLink R&D Team

Request A Quote Today!

Kindly Send Us Your Project Details

Recent Products

DJSlink EH238-3P One MachineScreens Dispatching Console Phone

DJSlink EH238-2P One MachineTwo Screens Dispatching Console Phone

DJSlink YE-EH238 Dual Handle Dispatch Console Phone

DJSlink YE-EH215V Video Touch Dispatch Console Phone

DJSlink YE-EH215 Dispatch Console Phone

Recent Posts

What Is the Keypad Lifecycle Rating for a Weatherproof Telephone?

What Is the Hook Switch Lifecycle Rating for a Weatherproof Telephone?

What Cable Gland Materials Are Suitable for a Weatherproof Telephone?

Is C5-M Marine-Grade Protection Supported for a Weatherproof Telephone?

Get a Free Quote

Ready to Partner with a Manufacturing Leader?

Quick Link

products

Contact Info