Is this a valid method to detect Proxies?

Please Note: I was quite hesitant to publish this article, because I don't know exactly how this proxy detection method works (if it works). I need your help to review this, please leave a comment if you know more.

Also visit my Proxy/VPN Detection Page

Introduction

The goal of this article is to reliably detect proxies using JavaScript (More accurate: By measuring latencies with JavaScript). There are many legitimate reasons for proxy usage, but unfortunately, criminals tend to also camouflage their Internet identity (IP address) by abusing proxies.

I hate to do deal with latencies and to derive conclusions while looking at them. It's improper science and it's quite dirty to be frank. But I fear that there is some truth hidden in latencies and that it must be considered when trying to detect sneaky bots and malicious traffic.

Let's put it that way: Have you ever attempted to browse a website with a proxy? Have you noticed that everything tends to load a bit slower? Probably!

Why should it be impossible to detect this delay and slowness using JavaScript? After all, JavaScript gives us a full programming language with (almost) limitless possibilities!

Looking at latencies is just another side channel attack that JavaScript gives us. There are almost limitless possibilities when each website is given the permission to execute arbitrary (sandboxed) code on client machines.

Mad Scientist — Me trying to detect proxies with latencies (Source: https://de.wikipedia.org/wiki/Verr%C3%BCckter_Wissenschaftler#/media/Datei:Mad_scientist.svg)

What do I even mean when I speak of detecting proxies with latency measurements?

There are essentially two ways how bots use proxies in the real world:

They make use of the SOCKS protocol. SOCKS5, the latest version of the protocol is defined in RFC 1928. This RFC is from March 1996, so it's seen quite some days. SOCKS allows you to route any connection, either UDP or TCP, through a proxy server.
Then there are HTTP Proxies, implemented with the CONNECT http method. This behavior is specified in RFC 7231, 4.3.6. CONNECT.

For RFC 1928, the most important part is:

When a TCP-based client wishes to establish a connection to an object that is reachable only via a firewall (such determination is left up to the implementation), it must open a TCP connection to the appropriate SOCKS port on the SOCKS server system. The SOCKS service is conventionally located on TCP port 1080. If the connection request succeeds, the client enters a negotiation for the authentication method to be used, authenticates with the chosen method, then sends a relay request. The SOCKS server evaluates the request, and either establishes the appropriate connection or denies it.

As you can read above, the SOCKS specification states that a SOCKS server basically has to glue two TCP streams together ("sends a relay request").

For HTTP proxies, the most important part from RFC 7231, 4.3.6. CONNECT reads:

The CONNECT method requests that the recipient establish a tunnel to the destination origin server identified by the request-target and, if successful, thereafter restrict its behavior to blind forwarding of packets, in both directions, until the tunnel is closed. Tunnels are commonly used to create an end-to-end virtual connection, through one or more proxies, which can then be secured using TLS (Transport Layer Security, [RFC5246]).

My conclusion: Each TCP/IP stream, especially the connection establishment, costs several RTT's and thus time. A packet being transmitted through two interconnected TCP/IP streams from host A to host B takes more time than a packet flowing through just one TCP channel from host A to host B.

The Idea

The idea I had is quite simple:

I want to measure the RTT/network latency to three different IP addresses:

The non-routable meta-address 0.0.0.0
The IPv4 loopback address 127.0.0.1
Differential public IP address: An Internet reachable server with IP address for example 167.99.241.135

The port does not matter here. I chose an arbitrary port that is closed with high confidence. Of course we cannot be sure that the port is closed, but the likelihood of a service listening on 127.0.0.1:43983 is rather small.

Conjecture:

If the latency measured with JavaScript to the address 0.0.0.0 is significantly larger compared to the latency to 127.0.0.1, there might be a proxy used by the browser.

Additionally, the latency to 0.0.0.0 compared to the latency to 167.99.241.135 must be in the same range. Otherwise, the collected data is likely invalid.

Explanation

But why should the latency to 0.0.0.0 be significantly larger than the latency to 127.0.0.1 when we use a proxy?

Honestly, I don't know for sure (!).

According to this hacker news thread, the IP 0.0.0.0 should not point to localhost when entered in the Chromium address bar. But in this Chromium bug tracker discussion, this is exactly what it does:

Allow explicit navigations to "0.0.0.0" to support systems where this performs a navigation to localhost (in defiance of specs... but seemingly common).

We can quickly look into the Google Chrome code base autocomplete_input.cc where autocomplete in the Chromium address bar is handled:

/* components/omnibox/autocomplete_input.cc */

// For hostnames that look like IP addresses, distinguish between IPv6
// addresses, which are basically guaranteed to be navigations, and IPv4
// addresses, which are much fuzzier.
if (host_info.family == url::CanonHostInfo::IPV6)
  return metrics::OmniboxInputType::URL;
if (host_info.family == url::CanonHostInfo::IPV4) {
  // The host may be a real IP address, or something that looks a bit like it
  // (e.g. "1.2" or "3232235521").  We check whether it was convertible to an
  // IP with a non-zero first octet; IPs with first octet zero are "source
  // IPs" and are almost never navigable as destination addresses.
  //
  // The one exception to this is 0.0.0.0; on many systems, attempting to
  // navigate to this IP actually navigates to localhost.  To support this
  // case, when the converted IP is 0.0.0.0, we go ahead and run the "did the
  // user actually type four components" test in the conditional below, so
  // that we'll allow explicit attempts to navigate to "0.0.0.0".  If the
  // input was anything else (e.g. "0"), we'll fall through to returning QUERY
  // afterwards.
  if ((host_info.address[0] != 0) ||
      ((host_info.address[1] == 0) && (host_info.address[2] == 0) &&
        (host_info.address[3] == 0))) {
    // This is theoretically a navigable IP.  We have four cases.  The first
    // three are:
    // * If the user typed four distinct components, this is an IP for sure.
    // * If the user typed two or three components, this is almost certainly a
    //   query, especially for two components (as in "13.5/7.25"), but we'll
    //   allow navigation for an explicit scheme or trailing slash below.
    // * If the user typed one component, this is likely a query, but could be
    //   a non-dotted-quad version of an IP address.
    // Unfortunately, since we called CanonicalizeHost() on the
    // already-canonicalized host, all of these cases will have been changed
    // to have four components (e.g. 13.2 -> 13.0.0.2), so we have to call
    // CanonicalizeHost() again, this time on the original input, so that we
    // can get the correct number of IP components.
    //
    // The fourth case is that the user typed something ambiguous like ".1.2"
    // that fixup converted to an IP address ("1.0.0.2").  In this case the
    // call to CanonicalizeHost() will return NEUTRAL here.  Since it's not
    // clear what the user intended, we fall back to our other heuristics.
    net::CanonicalizeHost(base::UTF16ToUTF8(original_host), &host_info);
    if ((host_info.family == url::CanonHostInfo::IPV4) &&
        (host_info.num_ipv4_components == 4))
      return metrics::OmniboxInputType::URL;
  }

The important part from the comment reads:

The one exception to this is 0.0.0.0; on many systems, attempting to navigate to this IP actually navigates to localhost.

According to Wikipedia, the address 0.0.0.0 has a special meaning in routing contexts:

In the context of routing tables, a network destination of 0.0.0.0 is used with a network mask of 0 to depict the default route as a destination subnet. This destination is expressed as 0.0.0.0/0 in CIDR notation. It matches all addresses in the IPv4 address space and is present on most hosts, directed towards a local router.

Maybe when Chrome uses a proxy server, it will route requests going to 0.0.0.0 through the proxy server, but requests going to 127.0.0.1 not. This discrepancy could be measurable. Could this be correct? I DON'T KNOW.

Experimental Setup

In order to take latency measurements, I used the following JavaScript code (using fetch()):

function measureImageLatenciesFetch(url) {
  const N = 5;
  return new Promise((resolve, reject) => {
    let ts = [];
    for (let i = 0; i < N; i++) {
      (function() {
        let t0 = performance.now();
        let fullUrl = url + (44435 + i) + '/' +  (new Date()).getTime() + '.png';
        fetch(fullUrl)
        .then((res) => {

        })
        .catch((err) => {
          let elapsed = (performance.now() - t0)
          ts.push(parseFloat(elapsed.toFixed(3)))
          if (ts.length === N) {
            resolve(ts);
          }
        })
      })();
    }
  });
}

for (let url of ['https://0.0.0.0:', 'https://127.0.0.1:', 'https://167.99.241.135:']) {
  measureImageLatenciesFetch(url).then((latencies) => {
    console.log(url, latencies)
  });
}

No Proxy Server

When I do not use a proxy with my Google Chrome browser, I get the following latencies:

https://0.0.0.0: [21.9, 21.6, 22, 21.6, 21.3]

https://127.0.0.1: [23.4, 22.9, 22.4, 22, 26.3]

https://167.99.241.135: [65.6, 64.3, 75.5, 77.3, 79.8]

Observation: The 0.0.0.0 latencies are in the same range as the 127.0.0.1 latencies.

With Proxy Server

However, when I start my chrome browser with a local forwarding proxy server (which in turn uses a remote SOCKS5 upstream proxy) with the command

chromium-browser --proxy-server=http://localhost:8947 https://incolumitas.com

I obtain the following latencies:

https://127.0.0.1: [36.9, 37, 38.3, 37.9, 37.4]

https://0.0.0.0: [262.6, 264.8, 325.1, 327.2, 330.4]

https://167.99.241.135: [379.2, 407.9, 410.3, 424, 428.8]

Observation: It can be seen that the 0.0.0.0 latencies are significantly higher than the 127.0.0.1 latencies and that the 0.0.0.0 latencies are in the same range as the 167.99.241.135 latencies.

With VPN enabled

And when I use a VPN server (with my Android Smartphone and OpenVPN Connect), this is what I obtain:

https://127.0.0.1: [15, 21, 24, 24]

https://0.0.0.0: [18, 18, 24, 24]

https://167.99.241.135: [81, 85, 86, 87]

Observation: No discrepancy when using VPN.

More Data from the real World

Because I record visitors on my Proxy/VPN Detection page I managed to collect some real world data to confirm whether this method works. In most of the below cases, I had sufficient grounds to believe that the visitors are in fact using a proxy, since other proxy detection tests showed an indication for proxy usage:

"0.0.0.0": [
  330.7,
  334.4,
  339.8,
  351.5
],
"127.0.0.1": [
  24,
  24.9,
  22.4,
  21.5
],
"167.99.241.135": [
  525.1,
  1145.5,
  1161.8,
  4161.2
],

"0.0.0.0": [
  320.8,
  323.8,
  328.5,
  331
],
"127.0.0.1": [
  13.8,
  14.6,
  14.8,
  17.5
],
"167.99.241.135": [
  789,
  793.7,
  796.5,
  798.4
],

"0.0.0.0": [
  138.2,
  138.5,
  143.1,
  143.3
],
"127.0.0.1": [
  3.6,
  3.9,
  3.9,
  3.9
],
"167.99.241.135": [
  144,
  144.2,
  144.3,
  145.2
],

"0.0.0.0": [
  138.8,
  143.5,
  144,
  144.4
],
"127.0.0.1": [
  3.6,
  3.8,
  3.8,
  3.9
],
"167.99.241.135": [
  137.8,
  142.8,
  143.6,
  145.1
],

"0.0.0.0": [
  209.2,
  210.3,
  251.4,
  252
],
"127.0.0.1": [
  4.3,
  4.6,
  4.7,
  4.8
],
"167.99.241.135": [
  351.3,
  357.6,
  409.9,
  410.1
],

"0.0.0.0": [
  646.5,
  662.5,
  663.1,
  665.2
],
"127.0.0.1": [
  4,
  4,
  4.3,
  4.3
],
"167.99.241.135": [
  337.5,
  343.9,
  350.4,
  351.1
],

Bad Data / Rubbish

Then again, sometimes I recorded very weird data on otherwise normal browsers. Why are the latencies so huge? I could not find an explanation. First, I assumed that maybe the JavaScript main thread was slow, so I removed every other JavaScript code, but I still got sometimes weird large latency measurements.

"0.0.0.0": [
  5.4,
  5.3,
  5,
  5.1
],
"127.0.0.1": [
  2004.4,
  2004.3,
  2004.7,
  2006.4
],
"167.99.241.135": [
  3420.8,
  3486.6,
  3485.6,
  3486.5
],

"0.0.0.0": [
  3.4,
  3.3,
  3.2,
  3.1
],
"127.0.0.1": [
  2017.3,
  2017.3,
  2017.1,
  2017
],
"167.99.241.135": [
  2206.9,
  2240.6,
  2240.6,
  2241
],

"0.0.0.0": [
  3,
  3,
  3,
  2.9
],
"127.0.0.1": [
  2032.1,
  2031.7,
  2048.1,
  2048.3
],
"167.99.241.135": [
  2272.2,
  2329.2,
  2329.9,
  2330.2
],

And absolutely rubbish such as:

"0.0.0.0": [
  393.2,
  413.6,
  2472.2,
  2876.9
],
"127.0.0.1": [
  2048,
  2047.9,
  2048.6,
  2048.5
],
"167.99.241.135": [
  396.6,
  4074,
  4468.9,
  7171.9
],

Somewhat Rubbish Data

And then there are cases where I just don't know what to say.

I am quite confident that this latency measurement is from a browser that uses a proxy (Again, mostly because of other proxy detection tests):

"0.0.0.0": [
  95.2,
  95.2,
  95.5,
  96.3
],
"127.0.0.1": [
  2031.7,
  2047.3,
  2048,
  2047.8
],
"167.99.241.135": [
  182.4,
  182.9,
  183.6,
  183
],

The 0.0.0.0 are too large and in a similar range than the 167.99.241.135 latencies. But the 127.0.0.1 latencies are just unreasonably huge?! Like WTF?

Help Needed

Do you think my suggested method to detect proxies does work? I need your help!

Why are latencies to 0.0.0.0 significantly larger than latencies to 127.0.0.1 when using a SOCKS / HTTP proxy?

Maybe because the browser routes requests to 0.0.0.0 to the upstream proxy but requests to 127.0.0.1 not? The only way to find out would be to take a look at the Chrome proxy implementation, which is used when Chrome is invoked with the command line flag chromium-browser --proxy-server=http://localhost:8947.