High frequency trading in Node.js

Imagine, you need to choose a technology for a high-frequency trading project to execute trades on crypto exchanges. What would you go with?

This is exactly the situation we found ourselves in, two years ago. We had several heated discussions about this between various developers. There were many opinions and arguments, a lot of languages were explored and considered, and their pros and cons were voiced repeatedly.

When the dust settled, we had identified two and a half requirements that resonated most:

First, it’s in the name – high-frequency trading technology: we need speed. Not the Wall Street high-frequency trading speed, but a very decent speed. The system today processes more than 10k updates/second on a single machine. It also does lots of I/O—our database tortures the disk with 100MB/s of I/O, and machines running our code can easily top 200GB of logs per day.

Second, we need extreme agility. The crypto space is often compared to a wild-wild-west, and everything can change in a heartbeat. New exchanges pop up and go bankrupt every month. Also, there are only two kinds of APIs: brittle and extremely brittle.

Finally, we like to write code in a functional way, because it’s the easiest way to avoid bugs by design.

As you probably already guessed, we chose Node.js and Typescript.

Performance: Node.js is faster than you’d think

Backed by the V8, a Google Chrome Javascript engine, Node.js performance can in many cases be in the same ballpark as native code. Especially if you focus on the hot path and let JIT do its work there. Node.js is certainly much faster than for instance Python, and this is what Alameda Research (a big market maker in the crypto space) used for their first prototypes.

Compared to C(++), Java, or even Python, Node.js has another nice property: it is designed with a non-blocking API first, and instead of using a huge amount of threads, it just runs a single event loop that schedules your asynchronous tasks. In the world of streaming updates from crypto exchanges, this is a very good functional fit.

General tips on how to optimize Node.js performance

Before we move on, let’s take a quick recap of some best practice and Node.js performance tips:

Remember and use monitoring tools like Clinic.js, Node.js built-in profiler, and Chrome DevTools to gather detailed insights into how you can approach your Node.js performance optimization.
Keep an eye on memory and CPU management to improve Node.js performance by regularly checking process metrics and employing garbage collection strategies to optimize resource usage.
Don’t overlook the built-in profiler; the Node.js debugger can be a crucial tool for identifying performance bottlenecks and optimizing code execution in real-time environments.
Optimize asynchronous operations – Node.js operates on an event-driven, non-blocking I/O model. You can manage asynchronous operations smartly using Promises and async/await to keep your code clean and avoid blocking the event loop.
Apply effective memory management practices using tools like heapdump to monitor and prevent memory leaks.”
Optimize database interactions through smart querying and pagination to efficiently manage data flow and minimize unnecessary data fetching.
Enhance communication protocols by implementing HTTP/2, taking advantage of header compression and multiplexing for more efficient data handling.
Implement appropriate server and client timeouts to avoid delays caused by slow network requests.
Improve Node.js performance by using its clustering capabilities to distribute load across multiple CPU cores effectively.

What about all that garbage: java script optimization

Javascript is a garbage-collected language. GC pauses are nasty, but again, with the right design, you can optimize a lot here. You may be curious what the heaviest unnecessary-garbage producers in such a system could be. Well, the below were our two most relevant spaces and tips for java script optimization or improvement:

1. String concatenation

Did you know that V8 can very efficiently concatenate strings? Values such as

const value = ‘nodejs’+’surely’+’can’+’concatenate’+
‘strings’+’quickly’+’right?’

are not immediately materialized into a single contiguous string. Instead, they are represented by a special class ConcatStr. This obviously puts a lot of pressure on the Garbage collector as many “hot” strings in our codebase were produced by a logging/metrics system that assembled the data using concatenation. We relieved some GC pressure by forcing V8 to flatten strings in our logging infrastructure. With 200GB/day of logs, this is not something to be taken lightly.

2. Data layout

For example, when representing things such as order books, you would naturally use format such as:

type OrderBook = {
asks: PriceLevel[],
bids: PriceLevel[]
},

where:

type PriceLevel = {price: number, amount: number}.

Using this design creates lots of tiny objects which means more work for GC. This improves rapidly, by just turning these objects into type:

type OrderBook = {asks: Side, bids: Side},

where:

type Side = {price: number[], amount: number[]}.

Typescript type safety is essential

I’m sorry type-haters, it really is. Especially for a product like this one. We’re all well aware that Typescript type-safety doesn’t imply correctness, but if you can prevent a critical bug by writing proper Typescript annotations, surely you want to do that.

In general, Typescript presents a great solution for Javascript’s nasty surprises. Especially if you adopt a culture of strict typing, code reviews, and linter rules. You can always opt-out to untyped code, for instance, if you are just prototyping.

Runtime validation of external data

Maintaining correct typings for third-parties is hard. Many times, third parties will send you messages in a different format than they advertise on their documentation page. You definitely don’t want to execute some unintended trades and lose a lot of investors’ money because a certain exchange suddenly sends you “1.234” (string) instead of 1.234 (float)—which is exactly the kind of small-but-important differences that are surprisingly frequent occurrences with exchanges’ APIs.

To deal with brittle APIs, we need robust runtime input validation. We already have good Typescript type annotations, so the obvious design would be to generate a runtime type-check out of these. Many people have already solved this same issue, so this should be a piece of cake, right? Well, no. We’ve tried several libraries but none of them had all we need—some of the libraries need a schema but do not produce good TypeScript types, while others were too slow to validate inputs.

Finally, we decided to write a custom schema DSL and generator which produces both TypeScript types as well as a highly optimized validation routine. Part of our optimizations was a realization that we do not need to validate arbitrary Javascript objects when considering high frequency trading technology. Instead, we always run validation on the result of JSON.parse(). This has a huge benefit because by definition the objects being validated aren’t cyclic, and they cannot contain classes/functions or other funky JS stuff.

JSON (De)Serialization is surprisingly slow

This caused several headaches for the engineering team. There are several reasons why JSON is slow:

JSON is very chatty. It introduces a lot of redundancy (i.e. long field names) in the protocol
JSON is UTF-8 based. This is a very nice property except that UTF-8, being a variadic-length format, is hard to parse.
JSON in our use-case requires a ton of CPU to constantly format floats as strings and then parse strings into floats, an operation we do a lot in our architecture.

Of course, we cannot choose a data format when it comes to communicating with third-party APIs, but a majority of the traffic we send is between internal parts of the project. Therefore we’ve had the freedom to replace JSON with something else. After some experiments, we’ve gone with AVRO. For this, we heavily rely on our DSL schema generator, which can now produce both TypeScript types, JSON validation routines, and AVRO schema definitions. Think of all this as Google Protobuf on steroids. The result is a lightning-fast (de)serialization of internal messages.

It’s just the beginning for high-frequency trading technology

Node.js allowed us to build a working product quickly and get to know the space we’re operating in. Armed with this knowledge, we are finally ready to move to the next stage – a slow rewrite to a language with high-performance guarantees for high frequency trading technology. After weighing all the pros and cons, we’ve chosen Rust. While it has a steep learning curve and has strict constraints on the design, it is very good at controlling memory and thread safety. This is exactly what you need when you need to go fast but can’t risk losing your investors’ money on stupid mistakes.

Review

What is high-frequency trading?

High-frequency trading (HFT) is a form of algorithmic trading that involves executing thousands of orders at extremely high frequencies. Traders use sophisticated algorithms to capitalize on small price discrepancies and market inefficiencies, and they usually trade in milliseconds or microseconds. HFT is characterized by high turnover rates and order-to-trade ratios, seeking short-term investment horizons and infinitesimal profits per trade but executing many transactions to achieve substantial returns.

Why do you need optimized technology for high-frequency trading?

High-frequency trading relies heavily on highly optimized technology due to its dependency on speed. Even milliseconds can define the success of a trade, making advanced hardware and software essential for rapid data processing and trade execution. Efficient connectivity, such as direct data feeds and strategic server placements near exchanges, also plays a critical role. Precision in trade execution minimizes errors and slippage, ensuring trades match the traders’ exact specifications. The high stakes involved in HFT necessitate solid and secure systems to prevent costly crashes or breaches. Integrating technologies like Node.js helps manage these demands effectively, enhancing backend performance for high-traffic applications with its efficient handling of asynchronous operations and multiple connections.

Summary

Some may think that spending two years optimizing with Node.js just to rewrite the project to Rust is a complete waste of time. We see it differently though. Only in retrospect, we can see how crazily complicated our problem is when it comes to high frequency trading technology. Would we start with Rust, we’d be both nervous wracks and probably bankrupt, too. As a good rule of thumb: properly understand the problem before investing a lot of effort into low-level optimizations.
If such an approach resonates with you, and you’re curious to work on similar projects… we’re hiring!

Cookie	Duration	Description
cookielawinfo-checkbox-analytics	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookie is used to store the user consent for the cookies in the category "Analytics".
cookielawinfo-checkbox-functional	11 months	The cookie is set by GDPR cookie consent to record the user consent for the cookies in the category "Functional".
cookielawinfo-checkbox-necessary	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookies is used to store the user consent for the cookies in the category "Necessary".
cookielawinfo-checkbox-others	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookie is used to store the user consent for the cookies in the category "Other.
cookielawinfo-checkbox-performance	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookie is used to store the user consent for the cookies in the category "Performance".
viewed_cookie_policy	11 months	The cookie is set by the GDPR Cookie Consent plugin and is used to store whether or not user has consented to the use of cookies. It does not store any personal data.

Fintech Trends 2025 Guide

The Future of Mobility Financing with a Car Subscription Application

Lessons Learned From Writing a High-frequency Trading Engine in Node.js

Peter Peresini

Performance: Node.js is faster than you’d think

General tips on how to optimize Node.js performance