RIP Thich Nhat Hanh
Posted on 22nd of January 2022 | 0 words
Some time ago, I wrote a short post about my feelings towards web analytics which were sparked due to a spike in visitors on my site (mainly coming from Hacker News). Due to that surge, I decided to part ways completely from any sort of tracking since, for me, it was mainly an unnecessary dopamine fix rather than anything useful.
Today I stumbled upon big news on the front of the legitimacy of web analytics from the point of view of privacy. Turns out, as most suspected, it’s not so good, at least according to Austria’s data protection authority .
Basically, this case dates back to the invalidation of Privacy Shield data sharing system between the EU and the US because of overreaching US surveillance. Turns out that many companies in the US have largely ignored this invalidation happened in 2020, and despite this, they have still continued to transfer data from the EU to the US. The Austrian DPA held that the use of Google Analytics by an Austrian website provider led to transfers of personal data to Google LLC in the US in violation of Chapter V. of the GDPR .
In the long run, there will be two options: Either the US changes its surveillance laws to strengthen their tech businesses, or US providers will have to host the data of European users in Europe. This kind of transcontinental transfer is currently (as of the time of writing this) only illegal Austria, but Dutch’s DPA (data protection authority) has stated that Google Analytics “may soon no longer be allowed”.
In any case, this is a great thing for privacy in the EU, and hopefully, many more countries will join Austria in this effort. You can follow what countries have started to follow this at Is Google Analytics ILLEGAL in your country?
Lately, I have dedicated a large part of my free time to audio software. I have done this mainly out of interest in the subject due to my history in music. But at the same time, I also thought writing audio software could be a fun passion project or even a small business that I could work on alongside my day job. I don’t see myself replacing my current job with this, but maybe I could dedicate 20% of my work time to it.
The world of audio software is a pretty exciting place. It involves a lot of low-level systems like signals and real-time operations, complex math at times, and something you can feel or at least hear. And what’s great, I don’t have any background in this stuff!
Now I have programmed most of my life and played around with RTOS, but when it comes to writing algorithms for manipulating digital signals, that’s new stuff for me. However, I have experience with the topic from the user point of view since I have been making music for almost as long as I have programmed. This experience involves playing instruments, how effects affect the sound, how mixing and mastering works etc. But what do linear types have to do with any of this?
Like I said earlier, signal processing (not necessarily just audio) is very low-level stuff. So when working with signals in software, you often need to work with C or C++. This is mainly due to the performant and close-to-hardware nature of the languages required to handle and manipulate signals optimally and efficiently.
Digital signal processing is also full of algorithms. The standard workflow for people in this industry seems to be that these applications are prototyped on some high-level language before being produced. Often in languages/tools like MATLAB, Octave, Mathematica, and similar, very heavily math-oriented languages and tools. Julia has appeared to grow in popularity also in this world. These high-level languages are mainly used due to the speed of development.
It is also not uncommon to see FPGA being used in these applications. For several reasons, they are reconfigurable hardware, so you can tailor and deploy computation units and data buses specifically designed for your particular needs. So if you’re working with digital hardware, you can’t go wrong with FPGAs. In this world, VHDL or Verilog comes in handy.
As you can see, overall, the applications tend to involve a lot of different low-level concepts but, at the same time, high-level topics in terms of prototyping. But, as the post’s header might hint, I’m not interested in the prototyping aspects of signal processing since I think those are all well and good. Instead, I’m interested in having a small thought experiment on whether the low-level elements could be improved somehow.
I would consider myself a functional programmer first and foremost, even though I mainly write imperative and/or object-oriented code, at least professionally. Now in my free time and in non-trivial side projects (that are not signal processing related), I like to work with weird languages like Haskell or Common Lisp. Unfortunately, as I mentioned above, almost all the work in this signal-processing world is written in C or C++, emphasizing the latter. However, I completely understand why these languages are used since we talk about real-time programming, so latency needs to be minimized.
“Real-time” can be understood that the program has to produce the correct result but also on a certain amount of time (which varies between systems).
If we use audio processing as an example, typically, you would have some sort of processing function in your code that would work in the audio callback:
process :: BufferRef -> ()
This function would get its callback from a sound card or some input device, e.g. microphone. After it has received its callback, this block of code (whatever might be inside it) would write the corresponding audio data into the given buffer. Which would then be played by the speakers or vice-versa when recording.
This procedure is basically what should happen in real-time lots of times when we are doing audio processing. Audio software is often set up to send these audio callbacks from a high-priority “real-time” thread with a very short latency between the callbacks, ~1-10ms (varies between systems).
To achieve this minimal latency between callbacks, you often can’t rely on garbage collection since you can’t be sure when your program launches it. I dare to say that most of the software benefits from GC significantly, but in the audio, making GC right is very hard. What makes it hard is that if GC launches at the wrong time or the latency between callbacks gets too large, garbage data will leak into the buffer, causing unwanted sounds.
Most other software might only see a slight latency in their computations if they do profiling, so that might not be the end of the world, of course, depending on your context. But in audio, you cannot let that happen since you can literally hear that glitch, which is unforgivable.
When it comes to C or C++, I think everyone knows their foot guns that involve memory management. Thankfully in modern C++, it’s not that bad (as long as you follow core guidelines, ) but there is still a lot of unnecessary baggage when it comes to safe code in these languages.
Could there be any way we could use garbage-collected language while doing “real-time” operations and how that could be achieved?
GHC 9.0
introduced support for Linear Haskell, which can be enabled with
-XLinearTypes
. One of the significant use cases for linear types is
implementing latency-sensitive real-time services and analytics
jobs. As I mentioned earlier a major issue in this use case is GC
pauses, which can happen at arbitrary points for reasonably long
periods. The problem is exacerbated as the size of the working set
increases. The goal is to partially or entirely eliminate garbage
collection by controlling aliasing and making memory management
explicit but safe.
So what, then, are linear types. Henry Baker described linear types and their benefits in his paper Lively Linear Lisp — ‘Look Ma, No Garbage!’ and also on “Use-once” variables and linear objects: storage management, reflection and multi-threading . As you can see, we are not talking about a new topic. We are talking about types whose instances must be held in a linear variable. A variable is linear if it’s accessed exactly once in its scope. Same for a linear object, their reference count is always 1. When we have this safety guarantee on the type-level, we can avoid synchronization and GC and also, we could update linear objects in place since that would be referentially transparent.
So why can we avoid synchronization and GC with linear types? If we would consider the following function as an example:
linearFunc :: a %1-> b
On their own, linear types only gives a type to functions that consume their argument exactly once when their result is consumed precisely once. So, alone, they don’t make your programs any faster or safer for resources. Still, they allow you to write many optimizations and resource-safe abstractions that weren’t possible before.
First, since linear values can only be used once, these values cannot be shared. This means that, in principle, they shouldn’t be subject to GC. But this depends on the value consumer since that may very well do some sort of de-allocation on the spot. One way to mitigate this could be to store these values to heap outside of GC’s control.
While utilizing heap for these values alone would diminish the GC, it would introduce some overhead to your program, which could increase the total running time of your application. But if we continue using real-time systems as an example, this isn’t necessarily bad.
In real-time systems, optimizations often happen only to the worst-case scenarios. This is because you don’t really care about your latencies as long as they stay within the particular window. But you watch that those latencies should never go above your maximum limit, and this is primarily where optimisation utilizing linear types could come in handy.
Linear types are a blessing in GC languages if you intend to do anything safely in the low-level world. I would like to continue this post with some practical examples of how Haskell utilizes these types and how they can make low-level optimizations and resources safer in your Haskell code, but that deserves its own post.
This blog is a few years old, and during its time, it has had a couple of different style changes (and a couple of domain changes). Initially, this blog started in 2018 when I was in school, and I thought some blogging platform could improve my writing.
Most of the posts back then were some assignments for various school related works. The blog was a clever publishing platform for those and was accepted in many courses. Unfortunately, most of those have since been deleted and forgotten.
During those days, this was very tech-oriented, focusing more on writing various tutorials about different subjects, mainly around distributed computing. Then I landed my first job as a software engineer, where I could put the tutorials I had written to practice. Unfortunately, at about the same time, I started losing my motivation on regular writing, mainly due to time constraints.
Around late 2020, or early 2021, I started to rekindle my writing habit with different styles and topics. Then I began to write down my thoughts about the world around me. So many topics were about music, art, and some tech stuff here and there but not really “technical writing”. I was mainly sharing my opinions about the various topics that interested me. While I enjoyed this little bit more pondering writing style - some could call ranting - for me, it felt that it was missing something.
I missed the technical writing. But now, since I had established a particular style in my writing, it felt out of place to start writing about various geeky subjects revolving around computer science. That being said, all those writings in the tutorial and guidance way or these recent little bits of the more analyzing way of writing haven’t gone to waste. However, I feel that now I know what I want to write.
I think the combination of both will suit me the best. At the same time, I don’t necessarily see myself writing some basic how-to tutorials anymore. But I feel technical analysis with various concrete implementations would be interesting to research and write. My main interests generally lie in music and arts and how these intertwine with computer science. However, my professional life is linked to designing and implementing distributed systems, distributed computing in big data environments, systems engineering, performance, reliability, compilers and programming languages. I feel that those might also be regular visitors to my ramblings.
My professional life has also revolved around these topics. So I feel I can write a lot in conjunction with my work. Especially since working with large distributed systems, you’re bound to stumble upon some weird things, making this analysis-oriented technical writing an excellent tool for thought. I would want to have this semi-regular to regular writing habits that I once had, but that is something I can’t promise. Post at a time!
But since this is my blog, I feel that occasional rambles and shitposts about various topics are in place. Do people want to read those? Probably not, but it’s fun to vent every once in a while. So going forward, I think you can expect a significant portion of more technical commentary with a creative angle alongside a smaller portion of things that entertain and occasionally horrifies me.
I’m very prone to procrastination. While I wouldn’t say that I have focus issues, I have noticed that I can easily spend hours on non-essential sites that don’t bring anything to my life. Social media has been one of them. I have always had a pretty weird relationship with social media. I joined Facebook and Instagram a long time ago because many of my friends and family were already there. While I never did post stuff actively, I always noticed that I just ended up mindlessly surfing these, especially on Instagram.
A couple of years back, I became conscious of this and decided to delete my accounts on these platforms without giving too much thought to it. While leaving these platforms was pretty easy for me, I noticed that I had just replaced these with some other platform, YouTube in my case. After which, I started spending countless hours on that platform instead. Back then, I didn’t consider this habit as bad as mindlessly browsing Instagram or Facebook despite it being the same. I think I just rationalized it to myself as being educational or informative in a better way than other platforms.
A year or two passes without Instagram or Facebook completely fine, but I wanted to start using them again for some reason. Maybe I thought I had already been cured of this disease, so I could have a healthy relationship with them from now on. I also had professional reasons behind this since I thought these platforms offer a great way of marketing your art to others, which is true in some cases. However, I quickly noticed similar behaviour when I was last on these platforms. So after a couple of months of trying to get back in, I just felt repulsed by them and decided to leave them again. When it comes to marketing, that is not for me. I understand the benefits of being an artist in social media. Still, since I mainly enjoy that as a passionate hobby, I don’t see the need for being on social media.
So at the time of writing this, I think it has been about six months or so of living without these. Still, I’m conscious of my unnecessarily large usage of YouTube, News etc. While comparing my use with Instagram, I still wouldn’t consider watching YouTube or regularly checking news as bad as mindlessly scrolling through your feeds. I still noticed similar behaviour on those I struggled with, for example, with Instagram. I became conscious about randomly picking up my phone and scrolling through the news even though I had just read them or letting YouTube’s autoplay roll for long periods without giving it too much thought. So I wanted to tackle these habits.
I have noticed that the most extreme methods work the best when fixing some bad habit, at least in my case. So I didn’t want to ease when trying to have a healthy relationship with these applications but instead went cold turkey immediately.
So how has this worked for me? I think great! In the beginning, I noticed how much free time I have when I don’t spend on useless things. Also, initially, I occasionally picked up my phone by instinct. However, I quickly realized I had no applications to spend mindlessly surfing, so I quickly grew out of this habit. At first, I felt boredom slightly when I couldn’t spend time on these apps, but thankfully I realized that this spare time needed to be used elsewhere. Before this, I was already reading relatively a lot, about three to four books per month, but I have almost doubled that number nowadays. I also wrote about time management between multiple passions a while ago, where I pondered how I manage time between, for example, programming and music. After ditching distractive sites entirely, I have felt that the time management between these activities and my work life hasn’t been an issue. Finding time for various pet projects and serious work outside my life is straightforward since I don’t spend my time on useless stuff anymore.
Do I see myself using these applications in the future? Well, I want to read the news and continue to do so, not just constantly. I usually catch up with recent events in the morning, but I don’t desire to install any news apps on my telephone. When it comes to these streaming platforms, YouTube, Netflix, etc., I could live without them. There are lots of good information on these platforms, so if I need to watch some videos, I can allow myself to do so. However, I don’t want them to control my life in a way that I’m uncomfortable with.