- Joined
- Mar 9, 2026
- Messages
- 24
Welcome back to another edition of Tech Tuesday!
When everything is working, lossless streaming feels effortless. When it’s not, the symptoms can be subtle: intermittent dropouts, devices drifting out of sync, or inconsistent device visibility. Let’s dig into what’s happening under the hood, specifically packet loss, error handling, and how your network (Ethernet or Wi-Fi) manages communication.
Below are a few common settings and behaviors to look for on routers, mesh systems, and switches.
Many routers and mesh systems include a setting that prevents wireless devices from talking to each other directly.
WiiM devices rely on multicast for discovery and synchronization. Some networks try to “optimize” multicast in ways that can interfere with this.
Look for settings like:
If you’re using managed switches, Spanning Tree Protocol (STP) is running in the background to prevent loops.
Settings to look for:
Small improvements in how your network handles multicast, device communication, and latency consistency can make a noticeable difference, especially as you scale from a single room to a full home setup.
Packet Loss, Retransmission, and Audio Integrity
At a protocol level, most streaming relies on TCP (Transmission Control Protocol), which is designed to prioritize accuracy over timing. If a packet is lost, it is retransmitted. If packets arrive out of order, they are reassembled correctly. If the network is congested, the sender slows down. The result is bit-perfect delivery, but not necessarily real-time delivery.
That tradeoff is usually invisible because buffering absorbs it. A single device can quietly wait a few extra milliseconds (or even seconds) and you’ll never notice. In multiroom playback, though, timing starts to matter. If one device is waiting on retransmitted data while another is not, they drift apart, and the system has to correct.
Under the hood, not all streaming behaves the same way. Services like Spotify, Qobuz, and TIDAL typically use HTTP (Hypertext Transfer Protocol) over TCP (Transmission Control Protocol) with chunked delivery, while local playback via DLNA (Digital Living Network Alliance) or SMB (Server Message Block) is also TCP-based but served from your own network. AirPlay is the outlier, using RTP (Real-time Transport Protocol) over UDP (User Datagram Protocol) with tighter timing expectations.
In all of these cases, the system is constantly balancing accuracy and timing. TCP ensures the data is correct. The buffer determines when it gets played.
With Spotify Connect, your phone acts as a controller. It tells the WiiM device what to play, but the device itself pulls the stream directly from Spotify’s servers after that. Once playback starts, that connection is independent of your phone.
This is why you can leave your house, lose Wi-Fi, and the music keeps playing. It’s also why opening Spotify when on the go can unintentionally drop you back into the same session and accidentally wind up blast music back at home.
Local playback behaves differently. When you’re using Server Message Block (SMB) or Digital Living Network Alliance (DLNA), the WiiM device is pulling data directly from your Network-Attached Storage (NAS) or server. In that case, your network and your server’s responsiveness become part of the playback chain.
Devices are sharing playback state, buffer levels, and timing information. In a multiroom group, they’re also coordinating who leads, who follows, and how to stay aligned. None of this uses much bandwidth, but it is extremely sensitive to delay.
If these control signals arrive late, nothing sounds “bad,” but devices can react at slightly different times. That’s where sync issues start.
Ethernet vs Wi-Fi: What Actually Changes
This isn’t really about raw speed. Lossless audio typically requires well under 10 Mbps even for high-resolution formats, which is far below the capacity of modern Ethernet and Wi-Fi networks. The difference is consistency.
Ethernet is full-duplex and deterministic. Data flows in both directions simultaneously, with very little variation in timing. Wi-Fi, on the other hand, is a shared medium. Devices take turns transmitting, and that introduces variability, especially as more devices compete for airtime.
In practice:
Multiroom playback is where the network becomes more visible. Each device maintains its own buffer and experiences its own network conditions, then continuously aligns with the rest of the group. Small differences in latency or retransmissions between devices require ongoing correction.
In buffered streaming scenarios (Spotify, Qobuz, local libraries), this correction is usually subtle because each device has enough buffer to absorb fluctuations.
For low-latency inputs (HDMI, Optical In, Line In, Bluetooth), the behavior changes. These inputs use much smaller buffers to keep audio in sync with the source. With less buffering available, network variability has less room to be absorbed.
In these cases, if network throughput fluctuates or a device has a weaker connection, follower devices are more likely to exhibit stuttering rather than gradual drift.
Very resilient. Differences between Ethernet and Wi-Fi are rarely noticeable unless the connection is unstable.
Further Reading
If you want to see how these network concepts show up in real-world setups, these threads are worth a read:
When everything is working, lossless streaming feels effortless. When it’s not, the symptoms can be subtle: intermittent dropouts, devices drifting out of sync, or inconsistent device visibility. Let’s dig into what’s happening under the hood, specifically packet loss, error handling, and how your network (Ethernet or Wi-Fi) manages communication.
This article is more technical than most. Please ask questions in the comments.
TL : DR Optimizing Your Network for Multiroom Playback
Most playback issues in multiroom setups aren’t about bandwidth, they’re about how your network handles device-to-device communication and timing signals. Audio quality is usually protected by buffering. What you’re optimizing here is consistency and coordination between devices.Below are a few common settings and behaviors to look for on routers, mesh systems, and switches.
Check for Wireless Isolation / Client Isolation
Many routers and mesh systems include a setting that prevents wireless devices from talking to each other directly.- Often labeled: Access Point Isolation (AP Isolation), Client Isolation, or Guest Network Isolation
Multicast Handling (Internet Group Management Protocol (IGMP) Snooping / Multicast Enhancement)
WiiM devices rely on multicast for discovery and synchronization. Some networks try to “optimize” multicast in ways that can interfere with this.Look for settings like:
- IGMP Snooping
- Multicast Enhancement
- Multicast to Unicast Conversion
Spanning Tree / Bridge Protocol Data Units (BPDUs) Behavior (Managed Switches)
If you’re using managed switches, Spanning Tree Protocol (STP) is running in the background to prevent loops.Settings to look for:
- STP / RSTP enabled (usually good)
- BPDU Guard / Filtering / Flooding
Small improvements in how your network handles multicast, device communication, and latency consistency can make a noticeable difference, especially as you scale from a single room to a full home setup.
Now let's get into it!
Packet Loss, Retransmission, and Audio Integrity
At a protocol level, most streaming relies on TCP (Transmission Control Protocol), which is designed to prioritize accuracy over timing. If a packet is lost, it is retransmitted. If packets arrive out of order, they are reassembled correctly. If the network is congested, the sender slows down. The result is bit-perfect delivery, but not necessarily real-time delivery.
That tradeoff is usually invisible because buffering absorbs it. A single device can quietly wait a few extra milliseconds (or even seconds) and you’ll never notice. In multiroom playback, though, timing starts to matter. If one device is waiting on retransmitted data while another is not, they drift apart, and the system has to correct.
Under the hood, not all streaming behaves the same way. Services like Spotify, Qobuz, and TIDAL typically use HTTP (Hypertext Transfer Protocol) over TCP (Transmission Control Protocol) with chunked delivery, while local playback via DLNA (Digital Living Network Alliance) or SMB (Server Message Block) is also TCP-based but served from your own network. AirPlay is the outlier, using RTP (Real-time Transport Protocol) over UDP (User Datagram Protocol) with tighter timing expectations.
In all of these cases, the system is constantly balancing accuracy and timing. TCP ensures the data is correct. The buffer determines when it gets played.
Who’s Actually in Control of Playback?
One of the more confusing aspects of streaming is that your phone usually isn’t doing the streaming.With Spotify Connect, your phone acts as a controller. It tells the WiiM device what to play, but the device itself pulls the stream directly from Spotify’s servers after that. Once playback starts, that connection is independent of your phone.
Local playback behaves differently. When you’re using Server Message Block (SMB) or Digital Living Network Alliance (DLNA), the WiiM device is pulling data directly from your Network-Attached Storage (NAS) or server. In that case, your network and your server’s responsiveness become part of the playback chain.
Why Bidirectional Communication Matters
Even though we think of streaming as “sending audio to a speaker,” there’s constant back-and-forth communication happening.Devices are sharing playback state, buffer levels, and timing information. In a multiroom group, they’re also coordinating who leads, who follows, and how to stay aligned. None of this uses much bandwidth, but it is extremely sensitive to delay.
- Playback position and transport state (play, pause, seek)
- Buffer fill level and underrun risk
- Clock synchronization and drift correction
- Group membership and leader/follower coordination
Ethernet vs Wi-Fi: What Actually Changes
This isn’t really about raw speed. Lossless audio typically requires well under 10 Mbps even for high-resolution formats, which is far below the capacity of modern Ethernet and Wi-Fi networks. The difference is consistency.
Ethernet is full-duplex and deterministic. Data flows in both directions simultaneously, with very little variation in timing. Wi-Fi, on the other hand, is a shared medium. Devices take turns transmitting, and that introduces variability, especially as more devices compete for airtime.
- Ethernet tends to deliver packets at consistent intervals
- Wi-Fi delivers packets in bursts, especially under load
Where You’ll Actually Notice This?
For single-room playback, most of this doesn’t matter. Buffered playback hides retransmissions, and the device can wait briefly if needed without affecting what you hear. Even with some network variability, audio remains stable.Multiroom playback is where the network becomes more visible. Each device maintains its own buffer and experiences its own network conditions, then continuously aligns with the rest of the group. Small differences in latency or retransmissions between devices require ongoing correction.
In these cases, if network throughput fluctuates or a device has a weaker connection, follower devices are more likely to exhibit stuttering rather than gradual drift.
Single Room vs Multiroom: Where Trouble Shows Up
Single Device Playback
- Buffer hides most network imperfections
- Retransmissions are absorbed silently
- Timing is local to one device
Very resilient. Differences between Ethernet and Wi-Fi are rarely noticeable unless the connection is unstable.
Multiroom Playback
- Multiple devices must maintain a shared timeline
- Each device experiences different network conditions
- Clock alignment (multicast timing signals)
- Buffer adjustments (speed up / slow down slightly)
|
Technical: Multicast, BPDU, and Keeping Devices in Sync
There are three parallel systems running at once:
Within an active multiroom group, one device typically operates as the timing reference, while others act as followers. These follower devices continuously align their playback using a combination of clock synchronization and buffer management. Multicast enables efficient one-to-many distribution of timing and coordination signals, allowing all devices in the group to receive updates simultaneously without duplicating traffic across individual connections. When multicast behavior is altered by the network, the underlying coordination mechanisms must compensate. Increased latency in multicast delivery introduces additional delay in timing updates, requiring follower devices to adjust playback more frequently. Packet loss can interrupt synchronization signals, forcing devices to rely more heavily on local clock estimation until the next update is received. In environments where multicast traffic is converted to unicast, coordination messages are replicated per device, increasing overall network load and introducing variability in delivery timing between endpoints. |
Further Reading
If you want to see how these network concepts show up in real-world setups, these threads are worth a read:
- How to adjust Qobuz streaming quality in the WiiM Home app
- How to add custom album artwork in your SMB music library
- Adding and accessing SMB shared files in the WiiM Home app
- Help me! WiiM Amp – FLAC streaming
- Spotify Lossless Not Working
- Intermittent Stuttering with WiiM Ultra
- Trouble Adding FLAC Stream (DNS / IPv6 Discussion)
- Can’t Get WiiM Pro to Play Hi-Res Radio