Arnab Sen's Technical blogs

Leaping into Lempel-Ziv Compression Schemes

Arnab Sen — Sun, 14 Apr 2024 15:58:10 GMT

Hey everyone! In this blog, we're going to continue our discussion on data compression, but this time, we're diving into some real-world algorithms that make it all happen.

Here was my previous blog which builds the intuition behind compression:

https://arnabsen.dev/compression-conundrum

A quick recap

In the previous blog, we dove into the world of compression algorithms! We explored the fascinating idea that compressing data can, in some cases, actually make the file size bigger. We talked about how every compressed data set has a counterpart that expands when compressed. This is because compression relies on finding patterns within the data.

Idea behind a practical Compression Scheme

One way to deterministically achieve compression would be to have symbols for repeated chars or bits. Lets take the example of the phrase to be or not to be. If we use a particular symbol X as a replacement for the to be part it will be encoded as: X or not X. This has reduced the number of symbols, hence we can say compression is achieved.

But wait, hold on a sec!! This raises a few questions:

What if we had the char X in the stream itself? Or a better question would be how to effectively differentiate between actual characters and symbols used as replacements?
How will the decompressor know that X refers to to be?

Solution #1: Lets say our input is made up of 8-bit ASCII codes. Then in our compression scheme we can agree upon some kind of contract that after compression, the compressed data will be made up of 9-bit ASCII code instead. The decompressor will have to keep this information in mind. So, with 8 bit ASCII we have values from 0-255, with and additional bit we can have extra values from 256-511 that can be used for our symbols. This way we can very easily differentiate between symbols and input chars.

But wait, what kind of compression scheme is this, where we are taking the input is made up of 8-bit but the compressed data is made up of 9-bit? Won't that just increase the size of the data? Will compression be achieved at all?

Thats a really good question. Lets tackle the 2nd question first and come back to this.

Solution #2: How will the decompressor know that X refers to to be? One approach would be to maintain a table/mapping for every symbol and what characters they represent. And the decompressor should also have this table/mapping information to refer to while performing the decompression.

Here let me callout an important piece of information to keep in mind while developing compression schemes. When we compare compression algorithms we just cant compare the compressed text with the original text. We need to take into account all the data that the decompressor needs, to be able to accurately decompress the compressed data to our original text (mind you we are still talking about lossless compression as mentioned in my earlier blog). So, in our case we will have to consider the compressed text + metadata (which can be the table/map that were talking) while comparing the performance of compression algorithms.

Now, let's tackle that tricky question.

From my previous blog we got an intuition that compression algorithms should be able to compress effectively, those inputs that are commonly used, at the expense of maybe expanding inputs which resembles random noise. So, when we use 9-bits or more in our compression scheme, we are hoping that with the extra 256 symbols we will be able to reduce the size so much that even with 9-bits we are able to get an effective compression. Lets do that math with an example string: to be or not to be.

The original string has 18 chars, each char is of 8-bits i.e 18*8 = 144 bits in total.

If we replace the to be with the symbol 256 (0b100000000) then the string itself gets compressed to just 10 symbols, with each symbol taking 9-bits according to our agreement i.e 10*9 = 90 bits.

But, also we need to take into account the additional mapping for the decompressor to know that symbol 256 refers to to be, that will take another 48 bits. So, effectively we have 90 + 48 bits = 138 bits and we achieve a compression ratio of 1.043. Although this compression is trivial, but this is a good example that supports our hypothesis.

Note: Trying to judge the compression schemes on smaller inputs is not ideal. Compression is all about saving space on large data. So, we should see if the compression scheme has the potential to scale with larger inputs. If the data to be compressed is itself very small, there is not much requirement to compress it anyways. But here, we will still stick to smaller inputs for easier understanding.

To draw some conclusion, to make a good compression algorithm, it needs to be able to find repeated patterns in the string and substitute them. But the tradeoff being, we have to somehow flow this information to the decompressor.

But is there any way we can make this even better? What if we can save the extra over-head to send the metadata to the decompressor?

Idea

One approach might be to traverse the input stream and build a dictionary simultaneously with all the information that we have so far. In other words, if the table/mapping can be built deterministically we dont have to send that information to the decompressor and that will save us some extra overhead. The decompressor can use the same logic to build the table itself.

This is actually the hallmark of the LZ Scheme of Compression Algorithms. The LZ family of data compression algorithms trace back to the work of Abraham Lempel and Jacob Ziv, who published two influential papers in 1977 and 1978. These algorithms, LZ77 and LZ78, laid the groundwork for many variations that are still used today, including LZW, LZSS, and LZMA.

This article will focus on the LZW scheme. Terry Welch published it in 1984 (the W in the LZW) as a significant improvement to LZ78. The LZW algorithm quickly gained widespread use due to its efficiency and adaptability. It became the compression method of choice in the popular GIF image format. In 1985, Sperry Corporation (which had acquired Lempel and Ziv's employer) obtained a patent on LZW. Later, Unisys acquired Sperry and began enforcing licensing fees on the use of LZW in software. But developers werent happy with this but that hatred fueled the creation and adoption of other compression algorithms like DEFLATE (which is used in the gzip tool). But, the classic Unix command compress utilizes a modified version of the LZW algorithm.

Its a little hard to explain the algorithm in a blog but I have given my best:

Getting hands dirty

Setup

The compressor and decompressor maintain a symbol table. The symbol table is initially populated with the first 256 ASCII characters. As we process the input string we will create new symbols and they will take the index 256 onwards. So, we can say symbol 256 is for to be and so on.

In the LZW compression algorithm, whenever we add something to the compressed data result, there must exist corresponding symbol in the symbol table.

We also have to maintain something called the Working String which stores a new symbol. At every step we either try to make the working string longer or add the current working string to the symbol table.

And, How do we build the working string? We use the Current Character while iterating the string. Once we add the current working string to the symbol table we reset the working string to just include the current character. So, effectively the working string is never empty except at the beginning of the compression.

Compression

Lets consider an example string banana.

When we check if the working string is in the symbol table we append the current character at the end of it. Lets define it as Augmented Working String (AWS) for easier understanding.

Lets start with our first character: b.

Working String: ""Current Character: "b"Augmented Working String: b

Is AWS in the Symbol table? Yes, as we mentioned before, all the single character ASCII values are already in the table.

Now, the new working string becomes this augmented working string. And we dont output anything in this step.

---

Lets go to our next character: a.

Working String: bCurrent Character: aAugmented Working String: ba

Is AWS in the Symbol Table? No, currently we dont have any extra symbols except the first 256 ASCII characters in the symbol table.

So, we will add this augmented working string in the symbol table with the next available index, which in this case is 256. So our symbol table now has this extra symbol:

symbol_table: {    256: ba}

At this point we will output the existing working string i.e b. So we got our first character in the compressed data.

---

Lets go to our next character: n.

Working String: aCurrent Character: nAugmented Working String: an

Is AWS in the Symbol Table? No. So, we add it with the next available index i.e 257. Now our symbol table becomes:

symbol_table: {    256: ba,    257: an}

And we output our current working string i.e a. So our compressed data is now ba.

---

Lets go to our next character: a.

Working String: nCurrent Character: aAugmented Working String: na

Is AWS in the Symbol Table? No. So, we add it with the next available index i.e 257. Now our symbol table becomes:

symbol_table: {    256: ba,    257: an,    258: na}

And we output our current working string i.e n. So our compressed data is now ban.

---

Our next character: n.

Working String: aCurrent Character: nAugmented Working String: an

Is this in the Symbol Table? Well yes !! Its index is 257. So, we move to our next step.

---

Our next character: a.

Working String: anCurrent Character: aAugmented Working String: ana

Is this in the Symbol Table? No. So, we add it with the next available index i.e 259. Now our symbol table becomes:

symbol_table: {    256: ba,    257: an,    258: na,    259: ana}

And we output our current working string i.e an in fact the index for this symbol. So our compressed data is now ban<257>.

And our working string for the next iteration is a.

This is interesting to see that with this algorithm, we are even building up on the symbols that we already saw previously, in the hope that if it occurs again we can use this new symbol. This shows how compression schemes like this can benefit from repeated patterns in long input.

Now we are done with the input character, but we still have some data left in the working string. So, we need to flush them to the compressed data, so it becomes: ban<257>a.

As you can see we were building the symbol table as we were traversing the input. Another good observation is that we solved the problem of what a symbol represents. In the compressed string where we used 257 as a replacement for an; the substring an already exists in the string before the symbol has been used.

Decompression

For decompression the important objective is to build the symbol table by the time it reaches the symbol, so that it can use the substitution.

So, if the decompressor follows the same logic, by the time it reaches the symbol 257 it should be able to learn that 257 represents the character an. Lets go through the decoding process too.

Now, we have the compressed data ban<257>a.

The logic to fill the symbols table remains the same.

Lets go to the first symbol: b.

Working String: Current character: bAugmented working string: b

Since we already have the symbol in the table, we will continue and we can simply output the decompressed result b.

---

Next, our symbol is a.

Working String: bCurrent Character: aAugmented Working String: ba

We dont have it in the symbol table so we will insert it:

symbol_table: {    256: ba}

And we will append a to the decompressed result: ba.

---

Our next symbol is n.

Working String: aCurrent Character: nAugmented Working String an

We dont have it in the symbol table so we will insert it:

symbol_table: {    256: ba,    257: an}

And we will append n to the decompressed result and it will be ban.

---

Next our symbol is 257 .

As soon as we come across a symbol we check if there is any substitution, and it does. We have an for 257. So, instead we will replace 257 with an. And treat each char individually like we were doing so far.

---

So, the next symbol is a (from the "an").

Working String: nCurrent Character: aAugmented Working String na

We dont have it in the symbol table so we will insert it:

symbol_table: {    256: ba,    257: an,    258: na}

And we will append a to the decompressed result and it will be bana.

---

Then our next symbol will be n (We will still have a n from the substitution we performed earlier)

Working String: aCurrent Character: nAugmented Working String an

We have the symbol in the table, so we will continue with that augmented working string.

And we will append n to the decompressed result and it will be banan.

---

Now our final symbol is a.

Working String: anCurrent Character: aAugmented Working String ana

We dont have it in the symbol table so we will insert it:

symbol_table: {    256: ba,    257: an,    258: na,    259: ana}

Finally we will have our decompressed result: banana.

Also notice how we arrived at the same symbol table that we saw during the compression process. This is the beauty of the LZW compression algorithm.

Edge Case :)

But there is a small caveat !! Try to follow this same algorithm to compress the string another_banana and then decompress it. While compression you wont find any issue and if you do it correctly you should get another b<256><265> as the compressed data. But there will be a small edge case when you perform the decompression.

But it can be resolved with a little trick.

Conclusion

"The art of compression is the art of finding patterns."

That was all for this blog. While LZW marked a significant milestone in compression algorithms, it's important to note that data compression continues to evolve. Modern techniques often combine LZW with other algorithms or use more sophisticated dictionary-building mechanisms to achieve even higher compression rates. Just like Googles DeepMind AI AlphaDev found faster sorting algorithms, maybe someday they will come up with better and more efficient compression algorithms, until then lets add more creativity to this field.

The Compression Conundrum

Arnab Sen — Sun, 24 Mar 2024 14:40:27 GMT

Couple of days back I was reading about compression and how different compression algorithms work and realized that I had a flawed understanding of the fundamentals of compression algorithms. So here I am noting down what I learnt.

I will not cover any compression algorithm in this blog, will keep that for future blogs. But what I will do cover is if you want to create your own compression algorithm what should the starting point be like.

Let's step back a little

What hath God wrought

So our protagonist is Samuel F Morse, a successful portrait artist. But he got hooked on the idea of an electric telegraph way back in 1832. The problem at the time was that other telegraph systems needed multiple wires a mess to manage over long distances. So Morse came up with the idea of sending electrical pulses through wire with an electromagnet attached at the end of it. A springy metal bar (the armature) is attached to the electromagnet. When the magnet pulls, the armature makes a satisfying "click" as it hits a contact.

After years of tinkering and getting partners on board (shoutout to Alfred Vail!), Morse strung a telegraph line between Washington D.C. and Baltimore. In 1844, he tapped out the now famous message, "What hath God wrought!" It wasn't just a test; imagine the hype, like sending the first ever tweet!

Morse's real genius wasn't just the tech; it was the code itself. Instead of a whole alphabet, he figured you could represent everything with just dots and dashes. He assigned the most common letters the shortest codes.

The most frequently occurring letter in the English alphabet is the letter "e," and its corresponding Morse code representation is a single dot. Similarly, the second most common letter, "t," is represented by a single dash in Morse code.

But letters like "z" are represented by two dashes followed by two dots. So if you had to say something like "Jazz" in Morse it would become

dot dash dash dash dot dash dash dash dot dot dash dash dot dot

But here is my question, can we call this a compression algorithm? Like in the last case is it performing any compression?

So, lets understand what qualifies as a compression algorithm.

What is a compression algorithm?

Lets say s is a string of bits of length n, and then C(s) is a function that returns a stream of bits which is a "compressed" version of s.

For a compression algorithm to work, we should be able to decompress it. It would be stupid to have an algorithm that just compresses stuff and there is no way to get back the original.

Hence, there should also exist an inverse of C(s), lets call it D(s) such that D(C(s)) = s.

One small thing to note here is that after decompression if we get the exact uncompressed data its called loss-less compression.

There is another kind of compression where we expect that some information can be selectively discarded that is of less importance or less noticeable to the human eye or ear to achieve much smaller file sizes. They are called lossy compression (e.g. JPEG). Lets just focus on loss-less compression for now.

What makes a compression algorithm good?

So, how do we know if it's a good compression algorithm? Lets define a formula for that.

Lets call it the compression ratio: the length of the original data divided by the length of the compressed data.

$$\text{Compression Ratio} = \frac{\text{Length of Original String}}{\text{Length of Compressed String}}$$

$$\text{i.e } \text{CR} = \frac{|s|}{|C(s)|}$$

So, if our original data is 100 bits long and after compression it becomes 50 bits long then the compression ratio is 100/50 i.e 2.

Case #1: When CR = 1

If the compression ratio is 1 then it means that the length of the data before compression is the same as the length of the data after compression. Well, then whats the point of calling it a compression.

Case 2: When CR > 1

This is only possible when the compressed data has length less than original data. Which is supposed to be the case right? Well here is the catch.

Lets say we have a compression algorithm C(s) which has a compression ratio greater than 1.

In the worst case:

If |s| = n, the |C(s)| will be n-1 i.e we are saying that the compression algorithm is only able to save 1 bit of space which sounds very inefficient but lets hold on to that.

What if we apply the same compression algorithm again? Then

$$|C(C(s))| = n-2$$

If we continue doing that:

$$|C(C(C(s)))| = n-3$$

Now you see where we are leading to? Effectively after applying the same compression algorithm again and again we will be able to compress any data to 0 bits. Which is impossible, and here lies the fallacy.

Example:

Lets take an example to understand this deeper. Lets say we want to compress 2 bits of data. And we want to achieve at least some compression so the compressed data should be of 1 bit.

So, one possible input is 10 and we want to compress it to 1 and lets say we want to compress 01to 0. But? What about 11or 00? We cannot compress it to 1 cause we are using 10 for that, nor can we use 0 cause we are using 01 for that.

We can actually prove this with a theorem which is one of my personal favorites: Pigeonhole Theorem.

So, for some integer n >= 1 and consider the set S of 2^n strings of length exactly n. Let's define

$$S_c = \{C(s):s \in S\}$$

$$\text{Note that, } |S_c| = |S| = 2^n$$

So, if every string in S_c has length at most n-1 then size of S_c can be at most:

$$|Sc| \leq 2^0 + 2^1 + ... + 2^{n - 1} = \sum_{i=0}^{n-1} 2^i = 2^n - 1 \text{ (contradiction)}$$

$$\text{Therefore, there exists } s \in S \text{ such that } |C(s)| > n - 1.$$

Conclusion

The conclusion from this proof is that "Compression is a Zero-Sum game". For some input to be compressed to something smaller there will be some input that will be compressed to something larger.

Dont believe me? Try this online tool (link) to compress the following text using DEFLATE, the algorithm behind gzip which kind of runs the entire internet as of now.

ykjNyclXKMnILFYAosSivMQkheLUPIWMUXFyxAEAAAD

You will see the compressed string is larger than the original string.

Whereas if you give an input something like a text shown in the image below; we get a really good compressed result. Most of the compression algorithm used nowadays are very good at getting rid of redundancy which occurs a lot in our day to day lives. Which is why some characters in the English alphabet has higher frequency than others.

And this is the truth, most of the compression algorithms focusses on compressing inputs that are more generally available than any random stream of noise.

Therefore if you are planning to create a new compression algorithm you should ensure that your algorithm is able to achieve good compression of actual data at the expense of bad compression of random noisy data because we rarely care about them.

Getting back to where we started...

So, getting back to the question we started with, can something like Morse code be considered a compression algorithm? Well, in my opinion, it sure can. We can represent letters that show up a lot with less effort, but we have to make do with longer codes for the letters that don't show up as often, like Z, Q, and so on. And that's basically what compression is all about, as we just saw.

That spirit of squeezing the most meaning into the fewest bits? That's the heart of compression, even with today's fancy algorithms.

One Instruction to Rule Them All: Exploring OISC

Arnab Sen — Mon, 13 Nov 2023 03:27:01 GMT

Background

As software developers, our daily routine revolves around crafting programs in familiar languages such as JavaScript, C++, Python, Rust, Java, and more. We immerse ourselves in mastering the syntax of these languages and then put the keywords such that it achieves our business logic. But the question is: How does the computer comprehend this diverse array of languages?

Most of the languages that we usually come under the category of High-Level Language. They are high-level in the sense they provide a higher level of abstraction and allow us to write more "human-readable" code.

Let's take a look at this super fancy C Code, which most of you have written at some point in your career as a developer:

#include void main() {    printf("hello world");}

A computer doesn't understand these words like "printf", "void", "main". They are meant for us humans to write code faster and more efficiently. Once the compiler compiles the code into Machine Language it looks like this:

Note: This assembly code is for Linux using the x86 architecture. The actual machine code can vary based on the target system.

You can notice the words like push, mov, call, nop, pop etc. These are commands that the CPU can understand and they all belong to something called an instruction set.

It serves as an interface between the hardware and the software, allowing us to write software that can run on a specific architecture.

The instruction set of a processor typically includes a variety of instructions that can perform operations such as arithmetic, logic, data movement, control flow, and more. Here are some common types of instructions found in an instruction set:

Arithmetic Instructions:
- Addition (ADD), subtraction (SUB), multiplication (MUL), division (DIV), etc.
Logic Instructions:
- Logical AND (AND), logical OR (OR), logical XOR (XOR), bitwise operations, etc.
Data Movement Instructions:
- Load (LOAD), store (STORE), move (MOV), etc.
Control Flow Instructions:
- Jump (JMP), conditional jump (JZ for zero, JNZ for not zero, etc.), subroutine call and return, etc.
Comparison Instructions:
- Compare (CMP), test for equality (EQ), test for greater than (GT), etc.
Input/Output Instructions:
- Input (IN), output (OUT), system calls, etc.

The specific instructions available in an instruction set can vary between different CPU architectures. For example, x86 and ARM are two different instruction set architectures commonly used in today's processors, and they have distinct sets of instructions.

In today's world, we have it easy, we just have to write the high-level code and the compiler translates the high-level code into the machine code instructions that the processor understands, based on the instruction set architecture.

Although, understanding the instruction set is essential for low-level programming, optimization, and when working with assembly language or writing programs that directly interact with hardware.

In this blog, I will talk about one specific kind of Instruction Set which I found really fascinating.

OISC

OISC stands for One Instruction Set Compiler (OISC). It is also called Single Instruction Programming Language or Ultimate Reduced Instruction Set Computer (URISC).

As the name suggests, this instruction set has just one instruction.

Okay, but you might think, what's so special about that? Anyone can create an instruction set that has just one instruction and call it an OISC.

Well, not really. Most mainstream programming languages are able to solve such a wide variety of problems and computations because they are all Turing Complete. If a programming language is Turing complete, it means that it can theoretically compute anything that is computable, given enough time and resources. (I'll do a separate blog on Turing Completeness, but for the time being, you can understand it like a test for programming languages)

Here is a movie recommendation: "The Imitation Game" based on the life of Alan Turing.

The OISC that I am going to talk about is also Turing complete, but the best part is unlike the Turing machine it doesn't need to have an infinite memory model. Hence, it is equivalent to a real computer than a Turing machine. And the instruction is subleq (Subtract and branch if less than or equal to zero).

For any instruction to be Turing Complete (or in simple words be able to compute most complex computations) it needs to have some sort of conditional branching. You can imagine this to be something like

if goto

Now, these machines can be categorized into 3 types:

Bit manipulating machines
Transport triggered machines
Arithmetic-based machines

mov is another such instruction. The mov instruction copies the data item referred to by its second operand (i.e. register contents, memory contents, or a constant value) into the location referred to by its first operand (i.e. a register or memory).

mov a, b

//it is equivalent to*a = *b;

This mov instruction is Turing complete and comes under 2. Transport Triggered Machine. There is an amazing repo that showcases the fact that any C program can be compiled into a program written only using mov instructions. You can check it out here.

On the left, you have the code in Assembly, and on the right, you have the same code which does the exact same thing but it only contains the instruction mov.

Arithmetic-based Turing-complete machines use an arithmetic operation and a conditional jump. This is what subleq does.

Let's simplify the subleq operation. If we do subleq a, b, c it comes down to:

Mov[b] = Mov[b] - Mov[a];if (Mov[a] <= 0) goto c;

So, you can see there is a basic arithmetic operation and one conditional branching, (we already mentioned that it is required to be Turing complete). There is a syntactic sugar for the same which goes like this

 subleq a, b   ; Mem[b] = Mem[b] - Mem[a]               ; goto next instruction

Here we basically drop the c which means we aren't branching anywhere but rather moving on to the next instruction. Now the big question. Is this instruction enough? Well as it turns out. Yes, it is.

Let's look at some programs using this instruction:

Example 1

; initially *z = 0subleq a, zsubleq z, bsubleq z, z

let's break this down

//initally*z = 0;//line 1*z = *z - *a // since *z=0, this line boils down to*z = -*a//line 2*b = *b - *z // *z = -*a, this line boils down to*b = *b - (- *a)*b = *b + *a//line 3*z = *z - *z*z = 0 //we started with *z=0 and ended with *z=0

What we see is *b = *b + *a i.e. this 3-line instruction performs addition.

The reason we use (*) before a, b, z is because those represent memory addresses and *a represents the value in that particular memory block
The reason the last line is important is because we have to ensure we do not end up changing the values that we are not dealing with, here we were just dealing with a and b. z was initially 0 and finally should also be 0. In other words, there shouldn't be any side effects.

Not just addition other instructions can also be implemented with subleq instruction as well.

Example 2

; initially *z=0subleq b, bsubleq a, zsubleq z, bsubleq z, z

You can work these out and you will come to the conclusion that it ends up assigning the value of a to b.

Or you can apply 200IQ and notice that this is exactly the same as the previous one, only a new instruction subleq b, b was added to the beginning, we know the other 3 lines implement this *b = *b + *a and the first line is basically *b=0 , hence what we finally get is *b= 0+*a i.e. *b=*a.

Example 3

subleq z, z, c

This is very basic, it will just jump to the branch c.

The interesting thing is this programming language with this one instruction is equally powerful to any other programming language like C++, Java .etc.

It may not be very efficient and easy to write but in the end, all of these come under the class of Turing Complete. Yet there are some interesting benefits to employing a one-instruction computer. For example, hardware-level functionality is simplified when implemented around a single instruction. This greatly simplifies the underlying implementation, as the same functional element is repeatedly used to form the processor core. Another advantage is that since all the instructions are the same, the instruction decoder circuitry and complexity can be eliminated.

Why bother about OISC?

OISC architectures provide an excellent paradigm for implementing traditional von Neumann computers using non-traditional materials. Simply put, a practical computer can be built by massive scaling of simple, single instruction elements. Embracing OISC challenges us to strip away the layers of abstraction, inviting a deeper exploration of the foundational principles that govern all programming.

I hope you found my blog informative. If you have any feedback share it in the comments. You can sign up for the Hashnode newsletter to get notified every time I post a blog. Learn more about me at arnabsen.dev/about. Have a nice day.

Row, Row, Raft Your Nodes: A Guide to Consensus in Distributed Systems

Arnab Sen — Sun, 22 Oct 2023 17:40:50 GMT

Introduction

I have been recently diving into the world of Distributed systems and I came across a rather interesting paper: "In Search of an Understandable Consensus Algorithm". What caught my attention was the paper's primary objective of creating a consensus algorithm that is easy to understand. Typically, in the realm of new research, we tend to emphasize factors like efficiency, correctness, and conciseness while overlooking how accessible a concept is for others to grasp and apply in practical scenarios.

We often neglect how easy it for someone is to grasp the concept and use it to build practical stuff or solve real problems. Sometimes we do the opposite, if something works but is very hard to understand and is confusing we consider it to be very "clever" and end up rewarding complexity. That's why I appreciate the approach taken by the authors, Diego Ongaro and John Ousterhout, in prioritizing the algorithm's comprehensibility.

With many major corporations transitioning to distributed systems, having an algorithm that is both easy to understand and intuitive can empower developers to create systems more seamlessly and devise more efficient implementations."

So, what is Consensus

Let's start with a simple analogy:

You have been invited to a party and you have dressed up all nice and good. Then you ask your 4 friends if you are looking good (if you don't have so many friends just imagine you do, at this point, you must be good at it). Two of them say that you are looking good, and the rest say you are not. What are you supposed to do?

But if 3 of them said that you do look good (quite unusual right?) then you can go with the majority decision.

This is consensus !!

In distributed systems, we know that we have multiple nodes, and consensus refers to coming to an agreement.

Consensus simply means to get all the nodes in distributed computing to agree on a common value or decision, despite the presence of faults, delays, or unreliable communication. This agreement is crucial for ensuring that the distributed system functions correctly and consistently.

Even though it sounds simple, many broken systems have been built in the mistaken belief that this problem is easy to solve.

There is a pattern in most of the fault-tolerant systems. For example, in MapReduce, the computation is replicated but the whole thing is controlled by a single master. Similarly, we have GFS which replicates the data and has a single master to determine who the primary is for the piece of data. The benefit of having such a single master system is that there will be no disagreement. But at the same time, it makes it a single point of failure. So we need some sort of consensus to avoid such failures.

💡

It's fine if you don't have an understanding of MapReduce or GFS, they were just examples. You can read my blog on MapReduce

https://arnabsen.dev/understanding-mapreduce

History of Consensus Protocols

Well, now that I think "history" is an overstatement cause there is just one very popular protocol called "Paxos" which has been the gold standard for a long long time. First submitted in 1989 by the Leslie Lamport, the Paxos protocol is named after a fictional legislative consensus system on the island of Paxos in Greece.

FYI Leslie Lamport was the creator of LaTeX. He also won the Turing Award which is like the Nobel Prize for Computer Science.

https://youtu.be/rkZzg7Vowao

The problem with Paxos was it was really hard to wrap our minds around. It only solved a portion of the problem and hence made it difficult to build systems around Paxos.

Raft Algorithm

As mentioned before the authors' primary design goal when building Raft was "understandability".

🤓

Even the author of Raft found it difficult to understand Paxos and he once said that it took him building a new consensus algorithm to really understand how Paxos worked and its correctness.

Now this doesn't mean that Raft is very very easy to understand, there are still a lot of intricacies and edge cases to cover, but I will try my best to give a brief overview of the entire thing. So let's start.

What are we dealing with?

We have a client, which can be a user or a service etc. And it tries to fetch some data from the server we have which is nothing but a cluster of nodes.

Now, to ensure consensus we need the nodes to agree on a particular value and then send that to the client.

How do we ensure that all the nodes have the same value? Raft achieves that through something called a "Replicated Log".

It ensures that all the commands (or operations) that need to be performed are replicated across the nodes in the same order. So, let's say you have a variable X initially, the value is 0 and then you perform some operations like:

add 5
mul 10
sub 15

If these commands were logged in the same order then the final value of X will be the same for all the nodes assuming the fact that the operations in the nodes (which are also referred to as state machines) are deterministic.

So, when a new command gets proposed, it first gets logged on all the nodes (at least most of the nodes) at the end of their logs. Only when the log gets properly replicated, does the node perform its computation and send the result to the original client.

The idea behind this is if the logs get replicated properly the cluster can continue functioning as long as the majority of the nodes are up (Yes, it's more like democracy, the majority wins).

Who will instruct the nodes on what to do?

Again similar to democracy, here also we have a leader node elected by other nodes. The leader node is responsible for talking to the client, giving commands to the other nodes, etc.

Before diving deep into the functioning of the leader let's see how the nodes become a leader.

Server States

The nodes/server have 3 states:

Follower
Candidate
Leader

In normal service, there will be just one Leader and all the other servers will be Followers.

Followers are passive, they issue no requests and can only respond to requests from Leaders and Candidates.

The Leader handles all the client requests. It also sends regular heartbeats (empty requests) to the Followers to maintain authority. More like it's saying "Don't get too excited, I am still your leader".

If a Follower receives no communication over a time period (called election timeout) it assumes that no Leader is available and starts the process of Leader Election.

Leader Election

Let's talk about term before diving into Leader Election. To have consensus we need to have a mechanism to detect obsolete information. Like if a particular server was a leader and then it stopped responding, and a new leader was appointed. We need to ensure that the other servers don't take instructions from the old leader anymore. Raft achieves this by dividing the time into terms or arbitrary lengths called term.

Each term starts with an election of the leader and then if the election is successful the leader rules for some period of time which is represented in green in the above diagram.

You can see how this is similar to the administrative process of a country where each leader has a term like 5 years and then post that election happens to elect a new leader.

In Raft, now the interesting thing is there is no global concept of term among the servers. Each server starts with its own terms.

When the election process starts the follower increments its current term index and becomes a candidate. It votes itself and then sends specific requests to the other nodes requesting to be a leader in other words asking for their vote. This happens by a special kind of request called RequestVote RPCs.

Whenever the server communicates with the other it includes what it thinks the current term index is. Then the other server responds by including its term index.

Now if there is a mismatch the server with the lower-term starts having an identity crisis and falls back to being a follower. And the other server just ignores it.

A Server is entitled to vote only one server and that happens on a first come first serve basis. So by the end of the process, the Candidate server with the most votes becomes the leader and then sends heartbeat messages to all of the other servers to establish its authority and prevent new elections.

Here I had a doubt, what if all the server realizes that the leader no longer exists or the current term is over, votes themselves, and requests a vote at the same time? They will just end up waiting indefinitely to get majority votes. Or what if two candidates receive the same number of votes (this is called a split vote)?

Raft solves this by using randomized election timeouts, which means election timeouts are chosen randomly from a fixed interval (e.g., 150-300 ms). This way in most cases only one server will timeout at a time, become a candidate, and will request a vote before the other server times out. Even in case of a split vote, each candidate restarts its randomized election timeout at the start of an election and it waits for that timeout to elapse before starting the next election. This randomness drastically reduces the chances of another split vote.

How does the normal operation happen?

Let's go through how a normal operation will take place with consensus in Raft.

First, the client sends a command to the leader. Wait, how will the client know who the leader is? Well, the client can send it to any server and if the server is a follower it will simply redirect it to the leader.

The Leader appends the command to its log and sends another request (called AppendEntries RPC request to all its followers). The followers on receiving this request just append the new log to their existing logs record. Note, at this point they just append it and don't actually perform any computation. It's the leader who decides when it is safe to apply the log entry i.e. when the entry has to be committed. This is actually called 2PC (Two Phase Commit).

The append request contains two identifiers, one is the term index which I already mentioned before, and another is the log index which is the position of the new entry in the log. The leaders use these two index values to determine if the majority of the servers have the logs up to date and then issue a commit.

In any case, the logs become inconsistent, the leader forces the followers to duplicate the leader's log. To bring a followers log into consistency with its own, the leader finds the latest log entry where the two logs agree, deletes any entries in the followers log after that point, and sends the follower all of the leaders entries after that point.

This is how the Leader ensures that every server (or at least the majority) has the same order of the logs i.e. consensus is reached, performs the computation, and then responds to the client.

This is how we can build a system with high data consistency with Raft and many distributed consistent database like CockroachDB uses Raft.

This was a very brief overview of Raft. There are many more aspects to this that couldn't be covered in a blog and I urge you to go through the original paper. I hope you found my blog useful. If you have any feedback share it in the comments. You can sign up for the Hashnode newsletter to get notified every time I post a blog. Learn more about me at arnabsen.dev/about. Have a nice day.

Understanding MapReduce

Arnab Sen — Sat, 30 Sep 2023 07:25:48 GMT

MapReduce is a programming paradigm that helps us perform large-scale computation across computing clusters.

DISCLAIMER: In this blog, I will take a very simple overview of this topic. If this blog actually spikes your interest and you want to delve into the technical nitty-gritty then I would urge you to read the original paper: "MapReduce: Simplified Data Processing on Large Clusters" (It was written by the Top-Gs in the programming world: Jeff Dean and Sanjay).

Background

Let's start with a little bit of background.

Currently at Google, I am working on a project that requires me to process a large amount of logs, perform some computations, and then write the computed results in a database. This has to happen on a daily basis and the output is then consumed by downstream teams for further operation.

One might think, what's the big deal in that? Just take the log file as input, implement the logic, and write the results to an output file. That's what we have always been doing.

Well, one thing we should take into consideration is the size of the file that we are dealing with. It is actually in the multiple of Terabytes. Taking our traditional path for problems like this would cause a lot of issues like:

Performance Issue: the entire thing would probably take a whole lot of time and it can have a lot of potential bottlenecks.
Memory Issue: How are we even going to load the entire TBs of data into memory? We have to implement some partitioning ourselves to do it effectively.
Scalability Issue: Very difficult to actually scale it to multiple machines.
Fault Tolerance: Let's say you somehow managed to overcome all the problems above, what if at the last moment, something crashes, how are we going to recover?

This is why Google came up with the MapReduce paradigm. There is also an open-source implementation of the same called "Apache Hadoop".

Link: https://github.com/apache/hadoop

What's MapReduce?

You must have already noticed that the name MapReduce is made up of two parts: Map and Reduce.

You might have often used these functions in Python, JavaScript, etc. Something like this:

Map Function

The Map function applies a given function to each element in a collection (such as a list or array) and produces a new collection of transformed values. It is often used to perform element-wise operations on data.

Reduce Function

The Reduce function takes a collection and combines its elements into a single result by repeatedly applying a binary operation. It reduces a collection to a single value.

Here is a visualization to help you understand it better:

These two methods are fundamental to the Functional Programming Paradigm. I came across these when I started learning functional programming languages like OCaml, Lisp, and Rescript. The main idea is to process the data in a functional, declarative manner.

Inside the map function, we provide the transformational logic and in the reduce function we provide the aggregator logic, and no need to specify any logic for the looping, etc.

A large number of problems can actually be split into two broad logic: Map phase and Reduce phase. Some examples mentioned in the paper are:

Distributed Grep
Counting URL access frequency
Reverse Web-Link Graph
Term-Vector per host

Let me give a simple example, let's say you are working on a Music Recommendation Algorithm. Before you can actually use the data for the recommendation algorithm you will have to do some pre-processing. Usually, logs contain everything, and we may not need all of them, so we have to modify the logs to a form that is relevant to us. To further simplify things, let's say we just need the Genre of the music from the log data. We can use a map function for that.

Then let's say our recommendation algorithm also needs the number of music the user has listened to for each genre. A reduce function can do this. Here is a sample Python code to help you understand better:

from functools import reducelogs = [  {    'name': 'Shape of You',    'singer': 'Ed Sheeran',    'genre': 'pop'  },  {    "name": "Uptown Funk",    "singer": "Mark Ronson ft. Bruno Mars",    "genre": "pop"  },  {    "name": "Bohemian Rhapsody",    "singer": "Queen",    "genre": "rock"  },  {    "name": "Friends in Low Places",    "singer": "Garth Brooks",    "genre": "country"  }]def update_frequency_count(freq_dict, num):    if num in freq_dict:        freq_dict[num] += 1    else:        freq_dict[num] = 1    return freq_dict# Map operation# {name, singer, genre} -> genregenres = list(map(lambda log: log['genre'], logs))# genres: ['pop', 'pop', 'rock', 'country']# Reduce operationfrequency_count = reduce(update_frequency_count, genres, {})print(frequency_count)# {'pop': 2, 'rock': 1, 'country': 1}

Both of these functions are deterministic. Also, the map operation on one element doesn't depend on the result of another operation. In other words, they are independent of each other as well.

So we can parallelize the operations of Map and Reduce. We can further extend the same thought process by breaking the input data into several chunks of data and having one map function for each chunk of data. This can also solve our Fault Tolerance issue, if something breaks on operation, it won't affect the others cause now they are all running independently in a parallel fashion. We can just re-run the broken process with the small chunk of data. You can see the big picture now as to why the Map and Reduce functions were considered specifically to solve this kind of problem.

To quote the paper:

"Our use of functional model with user specified map and reduce operation allows us to paralelize large computations easily and to use re-execution as the primary mechanism for fault tolerance."

Let's dive into the programming model deeper and then we will take another very popular example to understand the programming model.

With MapReduce, we usually deal with key/value pairs cause it's pretty generic. The Map takes an input pair and produces a set of intermediate key/value pairs. The MapReduce library groups all the intermediate values associated with the intermediate key I and passes them to the Reduce function.

The Reduce function then accepts the intermediate key I and a set of values for that key and then merge together values to form a possibly smaller set of values (it can be one as well).

Stages of MapReduce

Now let's dive into the actual working of MapReduce.

Splitting the Input

The input data is actually partitioned into smaller chunks, typically 16 MB or 64 MB. Let's say the number of chunks/splits is M. Then there will be M map operations in total.

Why 64MB? That can be a whole separate topic of discussion but you can think of it like these large amounts of data are usually stored on local disks of the machine that make up the cluster. Google File System (GFS) divides each file into 64MB blocks, so having the input data of the same size will ensure that the map function just has to read data from one block thus reducing I/O latency.

The Reduce invocations are also distributed by partitioning the intermediate key space into R pieces using a partitioning function (we will discuss this later in this blog). The number R is also set by the user.

Building the army

When we start the MapReduce function, it first starts a special process called "master". The master as you might have already guessed does a lot of management-related work like assigning Map and Reduce operations, handling exceptions, etc.

The program also starts up many copies of the program on the other machines. These are done by the fork system call. All the other programs except the master are called "worker nodes" or simply "workers". Each of the workers will be assigned a task from the M map tasks and R reduce tasks which is determined by the master.

Map Stage

A worker who is assigned a map task reads the contents of the corresponding input split. It parses key/value pairs out of the input data and passes each pair to the user-defined Map function. The intermediate key/value pairs produced by the Map function are buffered in memory. Since the map operation doesn't depend on any other worker, all the map workers can parallelly perform this, thus allowing us to linearly scale the performance of the task of extracting data.

Before the workers with Reduce task assigned can work, these buffered pairs are written to the local disk which is partitioned into R partitions by the partitioning function. Once the partitioning is completed, the master is informed, who then signals the location to the Reduce workers to get the rest of the job done.

Reduce Phase

The Reduce workers then read the intermediate data by using RPC calls. Once it has read all the intermediate data, it performs a sort based on the keys. Why?

So that all the occurrences of the same key are grouped together. The sorting is needed because typically many different keys map to the same reduce task.

Let me be frank, it took me some time to wrap my head around this, and I am still not sure if I got it correctly. So if you think my understanding is wrong please don't hesitate to correct me.

Previously we saw that we have M map operations and R reduce operations. Sometimes the same worker is assigned multiple Reduce jobs. Sorting the data makes it easier for the worker to distinguish between the new intermediate data and the old intermediate data. Once the worker finds that the partition has some new data it starts the reduce job again.

In the reduce step it also performs a merge of all the values corresponding to a particular key. So, sorting the keys makes it easier to merge. This will get clear once we go through the steps again taking the example of a word counter.

Finally, the reduce worker goes through the sorted intermediate data and for each unique intermediate key, it passes the key and the set of intermediate values to the user's Reduce function. The output is then appended to the final output file.

Last stage

When all the Reduce workers have completed execution, the master passes control back to the user program. The output of MapReduce is stored in the R output files that the R reduce workers created.

Partitioning Function

Let's understand what the partitioning function is all about. As a user, you can define the number of output files you want your data to split between. That number is denoted as R and MapReduce will assign the same number of Reduce workers. So if R = 10, it means that MapReduce will partition the data from the map workers into 10 splits. Each split will be assigned to one Reduce worker which means that there will be 10 Reduce workers. And then each Reduce worker will write the output to a file so the user will have their processed data split into 10 files.

By default, the partitioning algorithm that is used is hash(key) mod R. This usually gives a fairly balanced partition. In some situations, you might want to use a custom function as well. Taking the example from the paper, if your keys are a bunch of URLs and you want them to be partitioned based on the hostname, then you can use a function that extracts the hostname and then applies the hash and mod. So something like: hash(getHostName(key)) mod R.

Example

Let's take the example of the Character Count. We have an input text file with a bunch of words and we have to calculate the frequency of all the characters appearing in the input.

So, first, we have the input data being split into M partitions. In our case M = 5. So the master worker will assign 5 map workers to map each of these input partitions using the user-defined map function which can be something like this.

map(String key, String value):    // key: document name    // value: document content    for each char c in value:        EmitIntermediate(c, "1");

So, for the text "apple" our map function will return:

a: 1p: 2l: 1e: 1

Now, the intermediate output will be "shuffled" by using our partitioning algorithm. Since we are going to consider only the lowercase English alphabet we can have R = 26 for our partitioning algorithm. That way each output will contain the count of one letter. Simply put, our partitioning function will group all the a's and then pass it to our reducer function.

So, in our case, the reducer function will get the following input:

a: [1, 3, 1, 1]

Since we want the total count, we can have a reducer function like this:

reduce(String key, Iterator values):    // key: a word    // values: a list of counts    int result = 0;    for each v in values:        result += v;        Emit(AsString(result));

This will give us the final result: a: 6.

So this was a very basic overview of the MapReduce workflow. Keep in mind, that this is not a very generic parallel programming paradigm but only applies to problems that fit into these map-reduce paradigms. But from my work experience, I have seen that a lot of the data analysis work does actually fall into this category. Otherwise, why would Google come up with this in the first place xD.

There are many more technical concepts of MapReduce that I didn't cover in this blog. Do go through the research paper for an even more in-depth analysis. In case you have any doubts, feel free to ping me or put them in the comments.

I hope you found my blog useful. If you have any feedback share it in the comments. You can sign up for the Hashnode newsletter to get notified every time I post a blog. Learn more about me at arnabsen.dev/about. Have a nice day.

My university experience

Arnab Sen — Mon, 21 Aug 2023 18:30:00 GMT

Intro

Heads up: I initially wrote this blog at the end of my 3rd year, now that I have successfully graduated I figured why not take a stroll down memory lane and spill the beans on my last year too. Unlike my usual techy blogs, this one's more like the adventure of a super curious nerd trying to do it all! 😅

Hi there 👋

Let me start with a brief introduction. I am Arnab Sen, ~~currently, a~~ ~~final year undergraduate~~ at ~~IIEST Shibpur~~ ~~pursuing my~~ graduate with B.Tech in Computer Science. In this article, I will share my 4 years of experience as an undergrad exploring various technologies. It includes:

What technologies have I learned? When? Why? How? resources that I referred to.
My journey from CP CTF Open Source Google Intern Multiple Startups Google (presently).
The various communities I am currently a part of. And many more...

Before Joining College

As a kid, I have always been into computers. I would always tinker with the PC that I got in my 5th standard. And when I say "tinker", I mean breaking stuff by installing pirated games that had a lot of viruses and you know what I mean 🤷.

So, every month or so my dad had to call this Tech guy who would then "format" my PC and install a pirated Windows 7 and charge my dad 500 Rs. At one point, I got pretty frustrated because my dad would restrict my PC usage and at the same time felt a bit sad that he had to pay 500 Rs every time for my misdeeds (to a kid 500 was a lot of money). So I started looking for a free alternative. That's when I came across Linux for the first time.

I remember installing Ubuntu 14 by myself and I felt like I am going to become the next Bill Gates. To be honest, on the first try, I ended up wiping the entire hard disk. So it was actually after a bunch of tries that I was able to successfully install it. This kind of sparked my interest in computers and I ended up taking CS as my stream in College.

I even wrote another blog sharing my very initial days of programming here (but don't jump there directly, first complete this one and maybe then you can decide if it's worth it).

https://arnabsen.dev/my-tech-journey#heading-where-it-all-started

1st Year

By this time I had very basic knowledge about C++ since we had it on our CBSE Boards. And the first thing that I was exposed to was Competitive Coding. I practised my Data Structures and Algorithms from:

For problem-solving, I used to refer to:

My main motive at that time was to prepare for ICPC and improve my general coding skills and my knowledge of C++.

By the end of the 1st year, I became pretty good at CP and even reached Experts in Codeforces. But I started losing interest in this field, cause they were mainly about solving abstract problems. I wanted to tackle some real-world problems and came across Cybersecurity.

2nd Year

This is when COVID happened, I had a lot of free time to explore. With my interest in CP declining, I wanted to know more about computer internals and its fundamental workings. So I started solving CTF Challenges.

Since not many people may know about this, let me explain CTFs. CTF stands for Capture The Flag. You will be given a security challenge, it can be an encrypted text or a vulnerable website. You have to bypass the security or decrypt the ciphertext to get a token which is called the flag. Finding the flag wins some points and finally, the team/individual with the most points wins. This category of CTF is called Jeopardy style.

There is another style of CTF called Attack-N-Defense. Here we have two teams one Red team and another Blue team. The Red team's job is to break into a machine/system by finding loopholes and security flaws, while the Blue team's job is to fix those loopholes and patch those vulnerabilities before the Red team is able to enter the system.

https://www.youtube.com/watch?v=Lus7aNf2xDg&ab_channel=LiveOverflow

The reason these CTF challenges intrigued me was that they are based on more real-life scenarios. To solve CTF challenges I had to learn:

Assembly Language
Reversing
Binary Exploitation
Web Vulnerabilities like CSRF, XSS, SQL Injection
Cryptography like RSA, Symmetric and Asymmetric key encryption
Computer Networking dealing with network packet capture
Steganography, learning how PNGs work, how zip compression works, etc
Writing scripts in bash, python, C .etc and many more

My knowledge of computer fundamentals grew and I learned how to use tools like gdb, tcpdump, burpsuite, etc.

With rigorous practice and participating in a bunch of CTF challenges my team (which included me and my friend) ranked 67th in CTFTime in India.

There are no one-place resources for CTFs cause it is a huge domain. To even understand the vulnerabilities of a particular technology you have to be proficient in that technology first. For example, to understand the weakness of RSA, one needs to know how RSA works and what is the significance of exponent (e) and the primes (p and q).

But these are some of the YouTube channels I followed:

However the best way to learn is to participate in the CTF challenges and then read the writeups or have discussions with other teams after the CTF is over.

By the end of 2nd year, I got into a fellowship program called Pupilfirst Coronasafe Fellowship. I was in the top 24 out of 50k applicants. So, for the rest of the 2nd year, I was learning technologies like Ruby on Rails, and Rescript React and started my open source journey. As a Pupilfirst Coronasafe Fellow, I contributed to two open source health care projects called Arike and Life. Through this experience, I learned about:

Building and designing software, the workflow of the applications, and understanding the user experience
Pair programming, is a cool way to learn and work. Read more.
The importance of networking and having 1:1s with industry experts
Open-source best practices

3rd Year

This is when my internship prep started. I got rejected by Microsoft for both Engage program and the on-campus drive. It was a bit disheartening.

But then I came across Summer of Bitcoin. It was an open-source opportunity and since I knew open-source won't dishearten me I applied. And guess what? I got selected. I was among the 51 mentees for the first ever batch of Summer of Bitcoin. My project was to improve support for User Statically Defined Tracing (USDT) in Bitcoin core by adding new tracepoints and scripts, leveraging eBPF (extended Berkeley Packet Filter) technology. But what's more interesting is that during this mentorship, I had to work with:

C++ (which I did in my 1st year for CP)
gdb for debugging (which I did during my 2nd year for CTFs)
Github and Open Source practices (which I gained from the coronasafe fellowship experience)

My point is you might not see a direct advantage of what you are learning right now, but knowledge never goes to waste.

There is this very popular story of Steve Jobs. Jobs took a calligraphy class at Reed College based on campus posters he saw after dropping out. He knew the class would earn him no credit towards a degree but he still did it anyway. This knowledge of calligraphy, later on, inspired Jobs during his Apple days to include multiple typefaces in Macintosh.

"If I had never dropped in on that single course in college, the Mac would have never had multiple typefaces or proportionally spaced fonts. And since Windows just copied the Mac, its likely that no personal computer would have them. ~ Steve Jobs

The takeaway is to be curious and keep learning.

I have shared my Summer of Bitcoin journey in detail in my other blog.

https://arnabsen.hashnode.dev/my-experience-of-summer-of-bitcoin21

A couple of days later, I saw there was an opening at Google for SWE Internship. I asked my senior to refer me and I applied. After a week I got a mail saying that I have 10 days before my interviews. So, I just got back into interview prep. I mostly referred to these resources:

Leetcode and Leetcode discuss
CLRS
MIT Lectures
Gave a bunch of mock interviews at pramp.

I am not a big fan of practicing from different resources and platforms. I like to keep things simple 😇

Both of my Google interviews went really well, and after a long wait of 2 months, I heard back that I got the internship. I was on cloud nine because at that point I was starting to get a bit depressed about having no internships even after working really hard.

https://media.giphy.com/media/1ThndC5odGuUU/giphy.gif

https://arnabsen.dev/landing-a-google-swe-internship

My first corporate internship

Luckily, I got a really interesting and impactful project. It had to be built from scratch. So, a lot of time was spent writing design docs, making sequence diagrams, brainstorming the component workflows, and designing models. I worked for Google Ads.

The main codebase was in Java and at Google, we have a lot of private frameworks and technologies for internal usage. But some of my key learnings were:

Communication is the key. It doesn't matter if you have a solution in your head if you can't explain it to the other person.
Reach out for doubts. No one knows everything, so don't shy away from asking doubts. But while asking doubts make sure you have researched properly, provide enough information regarding the doubts, your thought process, your approaches, etc.
Always keep your manager/host updated about your work.
Meet new people, ask them about their project, their work what they like the most, etc. Getting insights from people who are more experienced than you helps a lot in gaining a new perspective which is really important as a fresher. The more you network the more you grow. At Google, we have this Lunch Ninja where we are paired with a random Googler to have a 1:1 discussion. I met so many amazing people that way, people from different roles like DevRel, SRE, UI/UX Designer, etc.
Writing high-quality code. In most of our personal projects, we don't care much about the code quality, but in industry, it's the code that is readable and is working that get's merged. So writing code that follows the readability standards and the style guides was a big takeaway.

I also wanted to add, that after the successful completion of my 10-week internship, I even got a pre-placement offer from Google and will be joining them next year 😊.

https://twitter.com/ArnabSen1729/status/1551660735440179201

https://arnabsen.dev/important-tips-on-bagging-ppo-during-internship-and-securing-a-full-time-role

4th Year

Having the PPO at hand gave me some confidence to try out even more stuff. Even during my internship my entire team at Google praised me for my blogs. So, my first thought was to take up blogging.

Codedamn

I got the opportunity to write technical content for Codedamn. I worked with them for like 2 months and got a lot of articles published. You can find them here: https://codedamn.com/news/author/arnabsen.

The issue I had with blogging for someone else was that I had very limited freedom to decide on the topics to research and blog. It went against the very essence of why I started blogging in the first place. So, even after leaving Codedamn, I continued to blog about random stuff on my personal website.

Cypherock

But I still felt the need to do something more challenging, so I started applying for various startups and interviewed a bunch of them. Finally, I decided to start another internship with Cypherock.

https://twitter.com/ArnabSen1729/status/1583496781131763712

In the next 6 months, I learned a lot about firmware and embedded systems development, various cryptographic, hashing, and encryption algorithms and how fast-paced startups work.

XROS Fellowship

Again the curiosity in me sparked and I found another perfect opportunity. Meta, Digital India and FICCI came up with a fellowship opportunity to train young devs in the field of Augmented Reality/Virtual Reality. It was called XROS Fellowship and I applied and was among the 100 devs who were selected for this opportunity. I worked for an org called GMetriXR.

https://arnabsen.dev/xros-fellowship

While doing this I was also working as a Teaching Assistant at Pupilfirst, reviewing the submissions of the students in the cohort, conducting sessions and solving doubts.

Outro

As you might have seen I have tried out different stuff throughout my college days and still doing. I have also participated in hackathons, kept sessions at our GDSC, and facilitated campaigns. I love sharing my knowledge through blogs.

My approach remains the same for everything, "no matter what you are doing, give your best and try to go the extra mile".

Feel free to connect with me on any of my social handles.

My XROS Fellowship Experience

Arnab Sen — Sun, 20 Aug 2023 06:58:24 GMT

What does XROS even stand for? It stands for "XR Open Source".

Ok, but what is XR? XR refers to Extended Reality.

Umm, and what's that?....

Understanding "Extended Reality"

Extended Reality (XR) is a term that encompasses the entire spectrum of computer-generated environments, including both virtual reality (VR) and augmented reality (AR), as well as mixed reality (MR) and other related technologies. XR combines real and virtual elements to create immersive experiences that go beyond the physical world.

Virtual Reality (VR) refers to a simulated environment that is completely computer-generated, allowing users to be fully immersed in a digital world. Users typically wear a VR headset that tracks their head movements and provides a visual and auditory experience that feels like being in a different reality.

Augmented Reality (AR) overlays virtual elements onto the real world. AR technology superimposes computer-generated images, videos, or information onto the user's view of the physical environment. This can be experienced through devices like smartphones, tablets, or smart glasses.

You might have heard about AR from the game that went viral (for a lot of reasons 🥲) in 2016. Remember the game? Yes, it is Pokemon Go.

Mixed Reality (MR) is just a blend of AR and VR. In other words in mixed reality, virtual objects, and the real world coexist and interact in real-time, enabling users to engage with both simultaneously.

Now that you know what XR means, let's get back to XROS Fellowship.

XROS Fellowship

Here's a snippet from their website, which pretty much covers everything about what XROS Fellowship is.

XROS Fellowship Program is a uniquely curated initiative aimed at supporting Indian developers working on XR technologies by providing fellowships which will include a stipend and mentoring by industry experts. The Program will support developers to make contributions to open-source projects related to XR technology.
The Program aims to give developers a platform to work with the best resources to create digital public goods and further support their careers by facilitating fellowships in organisations working in the domain of AR, VR, MR, 3D Modelling, etc.
XROS Fellowship will provide a learning cohort for 100 developers and selected developers will work on projects with partner organisations and industry mentors.

The initiative was taken by FICCI (Federation of Indian Chambers of Commerce & Industry) which is the largest and oldest apex business organization in India and was established in 1927. And the fellowship is further supported by Meta and implemented by Reskilll.

Perks of this fellowship

💼 3 months of practical experience on live industry projects.
🤝 Networking with industry experts.
💰 Fellowship stipends of 4 Lakhs.

Eligibility for this fellowship

The applicant should be 18+ years of age.
The applicant should be in the pre-final or final year of graduation.
The applicant should be a resident of India.

My Journey

Initial Application

The initial step of the fellowship involved completing a form that requested basic details. This stage aimed to assess whether applicants met the eligibility criteria for the fellowship.

Following this round, I received an email notifying me that my profile had been approved. Additionally, I was provided with a link to the Slack channel, where project mentors and other approved candidates can have a discussion.

Proposal Round

There were approximately 31 organizations listed. Each organization had its own projects, and candidates will need to submit a proposal for the project they wish to work on during their fellowship. A single candidate can only submit a maximum of 4 proposals.

I intended to submit 4 proposals as it would enhance my chances of being selected. Therefore, I would thoroughly review each organization, conduct some research on them, and then examine their respective projects.

My approach involved submitting proposals in the following categories:

One proposal would focus on a project that genuinely piques my interest, regardless of my familiarity with the associated technology stack.
Another proposal would target a project aligned with a technology stack in which I possess considerable proficiency.
An additional proposal would be directed towards a project that showed minimal candidate activity, indicating reduced competition.
Lastly, I would submit a proposal for a project considered highly challenging. This choice would ensure limited competition and thus increase my chances of success.

Subsequently, I prepared a Notion document accordingly and commenced working on my proposals.

Here is a glimpse of the doc:

The Reskilll team also organized AMA sessions with the project mentors, which I would attend. During these sessions, the mentors addressed all the questions and these sessions provided me with a more comprehensive understanding of the project and the organization.

So, I finally sat and wrote 4 proposals. We were also given a very basic proposal template. These were the different sections of my proposal template:

Basic Personal Details.
What excited me to work at ?
Understanding . (I tried to deep dive as much as possible and covered possible approaches. This particular section will be the determining factor)
Proposed Timeline.
Past Work Experience.
Previous Open Source Contribution.
Accomplishments.
Projects.

The Interviews

After the shortlisting of proposals, we needed to choose a time slot for the interviews. I received interview invitations from 2 out of the 4 organizations I applied to. In preparation for the interviews, I thoroughly reviewed the research I conducted to understand the project. Additionally, I dedicated time to practicing technical questions related to the technology stack.

In my view, these preparations are sufficient for the interviews. Both of my interviews went exceptionally well, and I was really excited about the final results.

The Results

After a long wait (it wasn't very long, but I was so much excited it felt long) I finally got a mail saying that I got into the XROS Fellowship Program for the organization GMetriXR.

After the AMA, I found myself genuinely intrigued by the GMetri project and its vision. I devoted significantly more time to reviewing the GMetri proposal than to any other. It is often said that diligent efforts yield fruitful results, and in my situation, it came true.

I am really proud that I was one of the 100 developers selected out of 10,000 applications.

The work at GMetri

GMetri in simple words is a toolkit designed for no-code construction of the metaverse. In recent days, we've observed the remarkable influence that Artificial Intelligence (AI) - particularly large Language Models like GPT - has had on everyday life. Its capacity to compose poems, craft music, and generate images is just phenomenal. My project at GMetri was aimed at leveraging the prowess of AI to simplify the process of metaverse creation for users.

For an effective generation of Metaverse Scenes, it is crucial that the AI model possesses an adequate understanding of the context. We called this knowledge base the "AI Brain". My primary responsibility entailed designing the user frontend experience to facilitate the creation of these AI Brains. This involved allowing users to upload information in the form of URLs, documents, and even raw text, which would subsequently be uploaded to the AI Brain. And once the AI Brain is trained the users can query the brain and use those results in the metaverse scene creation.

Final Thoughts

This was my last stint during college, cause I graduated after the fellowship period. I learned a lot about AR, VR, and XR through this experience and seeing how much the big techs like Apple, Meta are invested in this field we can expect some amazing innovations happening soon.

Do let me know if this article helped you and follow me for more technical content. Have a nice day.

XROS Summit

Recently all the selected XROS devs were also invited to attend the XROS Summit at New Delhi. It was truly an amazing experience meeting the amazing folks I worked with and networking with others.

Breaking Captchas with Golang: Leveraging the Power of 2captcha

Arnab Sen — Mon, 29 May 2023 09:30:39 GMT

Understanding the Role of Captchas in Digital Security

In today's digital landscape, where cyber threats lurk around every virtual corner, safeguarding sensitive information is of paramount importance. One crucial tool that plays a pivotal role in enhancing digital security is captchas. Captchas, short for "Completely Automated Public Turing tests to tell Computers and Humans Apart," act as a gatekeeper, effectively distinguishing human users from malicious bots. They were invented in 2000 by Luis von Ahn, Manuel Blum, Nicholas Hopper, and John Langford at Carnegie Mellon University. The original CAPTCHAs were text-based and required users to identify distorted words or letters.

These ingenious puzzles serve a lot of purposes. They help prevent a variety of attacks like:

Spam: Spam is unsolicited electronic messages, typically sent in bulk. CAPTCHAs can be used to prevent spam bots from creating accounts or sending messages by requiring users to solve a CAPTCHA before they can create an account or send a message.
DDoS attacks: A DDoS attack is a distributed denial-of-service attack. This type of attack is designed to overwhelm a website or online service with so much traffic that it becomes unavailable to legitimate users. CAPTCHAs can be used to prevent DDoS attacks by making it more difficult for attackers to generate large numbers of requests.
Credential stuffing: Credential stuffing is a type of attack where attackers try to use stolen login credentials to access multiple accounts. CAPTCHAs can be used to prevent credential stuffing attacks by making it more difficult for attackers to try login credentials on multiple websites.
Phishing: Phishing is a type of attack where attackers send fraudulent emails that appear to be from a legitimate source. The goal of phishing is to trick the recipient into clicking on a malicious link or providing personal information. CAPTCHAs can be used to prevent phishing attacks by making it more difficult for attackers to create realistic-looking phishing emails.
Click fraud: Click fraud is a type of attack where attackers click on ads without human interaction. This type of attack is used to generate revenue for the attacker. CAPTCHAs can be used to prevent click fraud by making it more difficult for attackers to click on ads without human interaction.

By incorporating captchas, website owners can fortify their defense against various cyber threats, such as spam, fraud, and unauthorized access. The utilization of visual or audio-based challenges, often involving distorted characters or logical tasks, impedes automated programs' ability to infiltrate online platforms.

How did Captchas even come into the picture?

Well, this dates back to a paper titled "Computing Machinery and Intelligence" that was published in 1950 by the famous and legendary computer scientist, mathematician, logician, cryptanalyst, philosopher, and theoretical biologist 🤯.

Are you able to guess who I am talking about?

Well here is a hint 💡: He is famously regarded as the father of modern computer science.

Yes, none other than Alan Turing.

He asked a question "Can a computer talk like a human?". This question led to an idea for measuring artificial intelligence that would famously come to be known as the Turing test. Turing proposed the following game. A human judge has a text conversation with unseen players and has to evaluate their responses. To pass the test, a computer must be able to replace one of the players without substantially changing the results. In other words, a computer would be considered intelligent if its conversation couldn't be easily distinguished from a human's.

Believe it or not, this same principle is used today to differentiate between a real human and a bot. You can think of captchas as simplified and automated Turing Test and every time you are able to solve a captcha you are basically passing a mini Turing Test.

P.S: If you are also a movie buff I would highly encourage you to watch the movie The Imitation Game where Benedict Cumberbatch played the role of Alan Turing.

Now, that you know a brief about how captchas came into the picture, let's learn the different types of captchas.

What are the different types of captchas?

You might have across different types of captchas while surfing the internet, here are the broad categories of captchas:

Text CAPTCHA: The oldest and most common type of CAPTCHA, this involves inputting text from a distorted or obscured image. The logic here is that humans can interpret obscured text while bots cannot.
Image CAPTCHA: This type of CAPTCHA involves identifying specific types of objects or patterns in an image. For instance, a user might be asked to select all squares of a grid containing traffic lights or buses.
Audio CAPTCHA: For users with visual impairments, audio CAPTCHAs are used. In this case, a short audio clip is played and the user is asked to type the words or numbers that they hear.

There are a bunch of service providers available who allow you to use their captcha services on your websites. Some of the most popular ones include:

Now, from a personal point of view I kind of hate solving these captchas, and finding the traffic lights.

So, I started looking if there were any ways to bypass these irritating captchas and I landed upon a goldmine 2captcha. Turns out, these captchas weren't as secure as I used to believe. Today we will look at a demo of how I ended up writing a script to bypass these captchas.

But, before that, let's learn about 2Captcha.

What is 2Captcha?

2Captcha operates as a real-time CAPTCHA decoding service, proficient in distinguishing and interpreting captchas with accuracy. It leverages human intellect for image recognition tasks, ensuring high levels of precision. Offering compatibility with a wide range of programming languages through its API, 2Captcha can identify and solve a diverse variety of CAPTCHA types. In fact, 2Captcha can actually solve all the captcha types I mentioned above.

So, without further ado, let's use the Golang module of 2Captcha and build ourselves a CLI tool. You will be amazed to see how easy it is to use this service.

Building the Golang CLI tool

So, the first step would be to create an account in 2Captcha and get our API key.

1. Creating our 2Captcha account

Visit their website https://2captcha.com/ and then sign in with your method of choice. After that, you will land on a page like this:

Select the left option of "I'm a customer" and then you will be redirected to the dashboard where you can see your API key.

Now, to further move ahead with our project make sure that you have Golang installed in your machine. To check that simply run go version in the terminal and it should show the version of Golang that is installed. If it gives an error, you will have to install Golang.

Once you are done with all these we can finally initialize our Golang Project.

2. Initialize our Golang Project

To get started, let's initialize a new Golang project. Open your terminal or command prompt and follow the steps below:

Create a new directory for your project:
```
 mkdir captcha-solver cd captcha-solver
```
Initialize the Go module:
```
 go mod init github.com/your-username/captcha-solver
```
Replace your-username with your actual GitHub username or any other relevant identifier.
Create a new Go source file named main.go:
```
 touch main.go
```

You have now set up the basic structure for your Golang project.

3. Installing Required Modules

To interact with the 2Captcha service, we need to install the 2captcha-go library. Run the following command to install the required module:

go get github.com/2captcha/2captcha-go

The module will be downloaded and added to your project's go.mod file.

Finally, we will write the code for our captcha solver.

4. Building the Captcha Solver

Now let's proceed with building the captcha solver. Open the main.go file in a text editor and add the following code:

package mainimport (    "flag"    "fmt"    "io/ioutil"    "log"    "os"    "github.com/2captcha/2captcha-go")const apiKey = "" // Replace with your actual API keyfunc main() {    // Parse input arguments    captchaImagePath := flag.String("image", "", "Path to the captcha image file")    flag.Parse()    // Verify that the captcha image path is provided    if *captchaImagePath == "" {        fmt.Println("Please provide the captcha image path.")        flag.PrintDefaults()        os.Exit(1)    }    // Load the captcha image    _, err := ioutil.ReadFile(*captchaImagePath)    if err != nil {        fmt.Printf("Failed to read captcha image: %v\n", err)        os.Exit(1)    }    // Initialize the 2Captcha client    client := api2captcha.NewClient(apiKey)    // Use the 2Captcha client to solve the captcha    cap := api2captcha.Normal{        File: *captchaImagePath,    }    captchaText, err := client.Solve(cap.ToRequest())    if err != nil {        if err == api2captcha.ErrTimeout {            log.Fatal("Timeout")        } else if err == api2captcha.ErrApi {            log.Fatal("API error")        } else if err == api2captcha.ErrNetwork {            log.Fatal("Network error")        } else {            log.Fatal(err)        }    }    // Output the solved captcha text    fmt.Printf("Solved captcha: %s\n", captchaText)}

Now, let's break down the code and understand each section's functionality.

Parsing Input Arguments

captchaImagePath := flag.String("image", "", "Path to the captcha image file")flag.Parse()

This code uses the flag package to parse input arguments provided when running the program. We define a flag named image that represents the path to the captcha image file. The flag.Parse() function is then called to parse the input arguments.

Loading the Captcha Image

_, err := ioutil.ReadFile(*captchaImagePath)if err != nil {    fmt.Printf("Failed to read captcha image: %v\n", err)    os.Exit(1)}

In this section, we use the ioutil.ReadFile() function to load the contents of the captcha image file specified by the user. If there is an error while reading the file, an error message is printed, and the program exits.

Initializing the 2Captcha Client

client := api2captcha.NewClient(apiKey)

Here, we initialize the 2Captcha client by creating a new instance of api2captcha.Client with the provided API key. Remember to replace the API key with the key that you saw in your dashboard.

Solving the Captcha

cap := api2captcha.Normal{    File: *captchaImagePath,}captchaText, err := client.Solve(cap.ToRequest())

To solve the captcha, we create a Normal struct instance from the api2captcha package and pass the captcha image file path to it. Then, we call the Solve() method of the 2Captcha client, passing the ToRequest() method's result from the cap struct as an argument. This method sends the captcha image to the 2Captcha service for solving and returns the solved captcha text.

Outputting the Solved Captcha Text

fmt.Printf("Solved captcha: %s\n", captchaText)

Finally, we output the solved captcha text to the console using fmt.Printf().

Conclusion

Congratulations! You have successfully built a captcha solver using the 2Captcha service in Golang. You can now run the program by executing the following command:

go run main.go --image /path/to/captcha.png

Replace /path/to/captcha.png with the actual path to your captcha image file. The program will send the image to the 2Captcha service, solve the captcha, and display the solved text.

Feel free to explore the 2Captcha Go client library documentation (https://pkg.go.dev/github.com/2captcha/2captcha-go) for more advanced usage and options.

That's it! You can now integrate this captcha solver into your own projects or applications to automate captcha solving using the 2Captcha service.

A beginner-friendly introduction to Docker and Containers

Arnab Sen — Sat, 27 May 2023 06:54:26 GMT

What is a Container?

You might be familiar with these containers.

These are used for storing stuff and shipping from one place to another. In DevOps, containers have similar applications as well.

A Container is a way to package an application with all the necessary dependencies and configuration.

And that package can be portable easily, which makes the process of development and the job of a developer easier.

What happens in these containers? Nothing much, just a couple of processes run in isolation on a shared kernel.

How is it isolated? The isolation of containers is provided by a Linux feature called namespaces. Namespaces partitions kernel resources such that one set of processes sees one set of resources while another set of processes sees a different set of resources. They give the group of running processes an isolated view of the kernel. For example,

Namespaces	Description
`PID`	process IDs
`USER`	user and group IDs
`UTS`	hostname and domain name
`NS`	mount points
`NET`	Network devices, stacks, ports

You can use the command lsns to get lists of all the currently accessible namespaces.

$ lsns        NS TYPE   NPROCS   PID USER    COMMAND4026531835 cgroup     85  1571 seth /usr/lib/systemd/systemd --user4026531836 pid        85  1571 seth /usr/lib/systemd/systemd --user4026531837 user       80  1571 seth /usr/lib/systemd/systemd --user4026532601 user        1  6266 seth /usr/lib64/firefox/firefox [...]4026532928 net         1  7164 seth /usr/lib64/firefox/firefox [...]

Another feature of the kernel is control groups (a.k.a cgroups) which monitors, limits, accounts for, and isolates the resource usage of a collection of processes (also known as containers)

Virtual Machine VS Containers

An operating system has two main layers:

OS Kernel: It communicates with hardware like memory, CPU .etc.
Applications Layer: They run on the Kernel.

Linux is a kernel. There are many Linux distributions each looking different from one another because the applications are different. But under the hood, they use the same kernel i.e. Linux.

Coming back to the difference between VM and Containers:-

VM runs on something called Hypervisor. Each virtual machine includes a full-blown OS and its process, which are very heavy and slow to start.

Containers on the other hand don't include full-blown OS, only include a set of OS-specific files. They are just processes that share the same kernel with other containers and the isolation by the containers is provided by the Linux namespaces. So since they run on top of the kernel they are very fast and also light-weight. So, we are getting the benefits of isolation by VM without the heaviness that comes with VM

But remember:

Containers don't replace VMs. Both have their own purpose.

What is Docker?

An early implementation of container technology was added to FreeBSD in 2001. Whereas Docker debuted to the public in Santa Clara at PyCon in 2013 and was made open source in March 2013. The tooling for using Linux Containers was really lacking and that's where docker comes into play. Basically, docker is tooling to manage containers.

Docker allows developers to package their applications into containers and directly use them in their CI/CD pipeline.

It helps achieve:

Build once, run everywhere

Why should we at all bother?

There are a number of advantages of using Docker and Containerisation: (No wonder it is very popular)

You can't say "But... It works on my machine" anymore. Good Luck with that. Because we are packaging the application with all the dependencies and the configurations required, even if it is running on a different machine it will still behave the same way.

They are very lightweight and fast. Already discussed this, just adding one thing. The fact that dockers are lightweight is the main reason behind them being so portable. Obvious
Docker has its own ecosystem provided by the community and has many tools that come with it, which helps solve a lot of issues.

Let's get our hands dirty now.

To install docker you can look into the official docs. Else if you want to just try your hands first, then you can use this: https://labs.play-with-docker.com/

Practical 1

Let's go through some popular docker commands. The docker commands are the same for all environments so it won't be an issue.

Run a container

Run this command

docker container run -t ubuntu ls

docker container run will run the image that your provided, in this case, it is Ubuntu. Now if you are running this command for the first time high chances are that the image is not downloaded, so it will show something like this:

Unable to find image 'ubuntu:latest' locallylatest: Pulling from library/ubuntu

If you notice something here we just mentioned ubuntu not ubuntu:latest. If we didn't specify any version it will take the latest version. But if you wanted to use ubuntu 18.04 you can specify it like this:

docker container run -t ubuntu:18.04 ls

It will again show:

Unable to find image 'ubuntu:18.04' locallylatest: Pulling from library/ubuntu

But this time notice the version. Also in both cases you see an output like this

bin   dev  home  lib64  mnt  proc  run   srv  tmp  varboot  etc  lib   media  opt  root  sbin  sys  usr

Basically, it is the output of the ls command.

Now let's run this command

docker container run -t ubuntu top

And open a new shell and continue. If you are using Play with Docker. Then create a new instance, and then ssh into the previous instance.

Now type this command

$ docker psCONTAINER ID        IMAGE               COMMAND             CREATED             STATUS              PORTS               NAMES6d2a990df65d        ubuntu              "top"               3 seconds ago       Up 2 seconds                            blissful_austin

This command shows you the containers and all the necessary information about it.

Usually, containers are used for running a process, maybe a server or some application, and soon as the main process exits the container will stop too. So let's create a very simple process.

So run this command

docker run -d ubuntu sleep 300

Yeah, this is our process, we will ask the container to sleep for 300 secs.

Every container has a unique container id. So in most of the commands which deal with a particular container, we have to provide the container id. Now let's hop into the sleeping container.

docker exec -it 6d2a990df65d bash

Note: 6d2a990df65d is the docker container id in my case. Just do docker ps look at the container id and paste it there.

It will spawn a bash terminal inside the container. You can run all the basic commands of Ubuntu.

To get the list of all the containers (even the ones that exited) run

docker ps -a

Stop a container

To stop the container we need to use docker stop .

docker stop 6d2a990df65d d9da0526d987

To remove containers you can use docker system prune

There is a nice collection of important docker commands by garystafford here which I find very helpful.

Debugging container

Usually, for debugging, we need to look at the logs. For that use the command:

docker logs

Also, docker exec sometimes helps in the debugging process.

Remember that containers use kernel-level features to achieve isolation and that containers run on top of the kernel. Your container is just a group of processes running in isolation on the same host, and you can use the command docker exec to enter that isolation with the bash process. After you run the command docker exec, the group of processes running in isolation (in other words, the container) includes sleep and bash.

Where are all these images stored?

Docker maintains a public repository of all the images called Dockerhub.

Also, you can run more than one container simultaneously.

While running containers you can do a lot more stuff, like

--name tag will allow you to name the container.
--detach or -d will allow you to run the docker in detached mode i.e. in the background.
--publish or -p to publish the ports to the host (very important)

You can the list of all the options here

Containers are self-contained and isolated, which means you can avoid potential conflicts between containers with different systems or runtime dependencies. You can run multiple NGINX containers that all have port 80 as their default listening ports. If you're exposing the host by using the --publish flag, the ports selected for the host must be unique. Isolation benefits are possible because of Linux namespaces. Running multiple containers on the same host gives us the ability to use the resources (CPU, memory, and so on) available on a single host. This can result in huge cost savings for an enterprise.

Although running images directly from the Docker Store can be useful at times, it is more useful to create custom images and refer to official images as the starting point for these images.

Docker Images and Docker Containers

Now, if you are confused let's look into the differences between Docker Images and Docker containers. A docker image is a tar file or an archive of the filesystem or the container. It contains the metadata of the filesystem.

Whereas Container is the running environment of the image. It's the process (isolated process to be specific). The filesystem of the container is virtual, i.e. it has its own abstraction.

Images are used to create containers (more than one). You can consider it as a blueprint. We can share our Docker Image and then we can create containers using the image. We can also push images to the Docker Hub.

How to create Docker Image?

To create a Docker image, we use a special file called Dockerfile (no extension), which consists of a list of commands to build our image. After we have created our image we can pass it to docker build which will build the image.

docker build -f Dockerfile

A Docker Image is a set of layers where each layer represents an instruction from the Dockerfile. The layers are stacked on top of each other. Each new layer is only a set of differences from the previous one.

The best part of the image layer is that they are cached. If you change say the 5th line then the docker engine will reuse the first 4 layers and then start building from the 5th line. This improves the time during build and also in the context of CI/CD once the base is pushed, the subsequent pushes will be very fast. To optimize the caching, we need to organize the Dockerfile in a way that the line that will change the most is located at the bottom of the Dockerfile.

Let's dive into this.

Practical 2

Creating flask application

We are going to create a simple Flask application.

Download this Python script https://gist.github.com/arnabsen1729/1fa19228e4451963bbb64563da98f880 and save it as app.py.
Install the package flask by pip3 install flask and run the app python3 app.py.
When the app is running you can visit your 0.0.0.0:5000, you will see hello world!.

Hence we have set up a basic Flask server. To close this process press Ctrl+C.

Now we will dockerize this flask app.

Building Dockerfile

So create a file Dockerfile and open your text editor and paste this

FROM python:3.6.1-alpineRUN pip install flaskCMD ["python","app.py"]COPY app.py /app.py

Let's go through this line by line

`FROM python:3.6.1-alpine`

This is the starting point for your Dockerfile. Every Dockerfile typically starts with a FROM line that is the starting image to build your layers on top of. In this case, you are selecting the python:3.6.1-alpine base layer because it already has the version of Python and pip that you need to run your application. The Alpine version means that it uses the Alpine distribution, which is significantly smaller than an alternative flavour of Linux. A smaller image means it will download (deploy) much faster, and it is also more secure because it has a smaller attack surface.

It is highly recommended to only use official images found in the Docker Hub, or noncommunity images found in the Docker Store.

`RUN pip install flask`

The RUN the command executes commands needed to set up your image for your application, such as installing packages, editing files, or changing file permissions. In this case, you are installing Flask. The RUN commands are executed at build time and are added to the layers of your image. Usually for node applications, this will involve installing all the node_modules, if it was a larger Python application, you will have to install those as well.

CMD ["python", "app.py"]

CMD is the command that is executed when you start a container. Here, you are using CMD to run your Python application. There can be only one CMD per Dockerfile.

If you specify more than one CMD, then the last CMD will take effect.

`COPY app.py /app.py`

This line copies the app.py file in the local directory (where you will run the docker image build) into a new layer of the image. This instruction is the last line in the Dockerfile. Layers that change frequently, such as copying source code into the image, should be placed near the bottom of the file to take full advantage of the Docker layer cache. This allows you to avoid rebuilding layers that could otherwise be cached. For instance, if there was a change in the FROM instruction, it will invalidate the cache for all subsequent layers of this image. You'll see this a little later in this lab

But, how can the CMD command run if we are copying the app.py later? CMD is the command that is executed when you start a container, until and unless you run the container it is not executed. Also since the command to run the application will not change, we have placed it higher.

Here is the list of all commands

Now to build the image with the Dockerfile run

Running the container

$ docker build -t flask-app . # if the Dockefile is in that directory#or$ docker build -t flask-app -f /path/to/Dockerfile

It will pull the Python:3.6.1-alpine, and go through the process. In the end it will output something like this

Successfully built 38f35dd1d2a4Successfully tagged flask-app:latest

Now run the command docker images and you will see flask-app or whatever name you mentioned in the list. So now you have successfully created the image.

For the final part let's run the container with this image.

docker run -p5001:5000 -d --name flask-container flask-app

What are we doing here?

docker run to run the container
-p5001:5000 to map port 5000 of our container with 5001 of the host machine
--name flask-container giving the name of my container
flask-app the image we want to build

Now do docker ps you will see

CONTAINER ID        IMAGE               COMMAND             CREATED             STATUS              PORTS                    NAMES940f215ceea0        flask-app           "python app.py"     2 seconds ago       Up 2 seconds        0.0.0.0:5000->5001/tcp   flask-container

Now go to localhost:5001, if everything was correct you will see hello world!

To check the logs we can run docker logs

$ docker logs flask-container * Serving Flask app "app" (lazy loading) * Environment: production   WARNING: This is a development server. Do not use it in a production deployment.   Use a production WSGI server instead. * Debug mode: off * Running on http://0.0.0.0:5000/ (Press CTRL+C to quit)172.17.0.1 - - [03/Jan/2021 14:55:58] "GET / HTTP/1.1" 200 -172.17.0.1 - - [03/Jan/2021 14:55:59] "GET / HTTP/1.1" 200 -

Docker images contain all the dependencies that they need to run an application within the image. This is useful because you no longer need to worry about environment drift (version differences) when you rely on dependencies that are installed on every environment you deploy to. You also don't need to follow more steps to provide these environments. Just one step: install docker, and that's it.

Now if you change the app.py only the last step in Dockerfile needs to be updated the rest is already cached.

And you have successfully dockerized your application.

Docker Compose

Sometimes the docker run command becomes very long and very tedious like sometimes we even need to specify lots of environment variables and stuff. Running those long commands every single time becomes difficult. Also if you are working with let's say 3 or 4 containers then you have to write such long-run commands every single time you want to start the container. So there is a simple way to express the commands in a structured way, save it in a file and simply run that file.

That file is the docker-compose.yml. The file is a YAML file. (Fun fact: full form of YAML is 'YAML Ain't Markup Language' recursive huh!!)

Writing a docker-compose file is not very difficult, you just need to know how to structure it. This article written by Gabriel Tanner explains it nicely.

Link to the article: gabrieltanner.org/blog/docker-compose

If you have reached this far and you have understood Docker, it's also important that you keep the best practices in mind. Here is a good guide for that: spacelift.io/blog/dockerfile#dockerfile-best-practices

Conclusion

In this article, we learned about Docker and how to dockerize a simple application. We also learned about Docker Compose and how to use it.

A really nice collection of Docker study material is available here: Docker Handbook 2021 Edition

Hope you liked my article, do follow me on Hashnode and on Twitter (handle: @ArnabSen1729) for updates.

Quick HTML tips: Enhancing UX and Accessibility with `enterkeyhint`.

Arnab Sen — Thu, 25 May 2023 10:04:14 GMT

In the world of web development, creating a delightful user experience (UX) goes hand in hand with ensuring accessibility for all users. Accessibility, often referred to as "a11y" (short for "accessibility" and the 11 letters between the "a" and "y"), focuses on making web content usable by individuals with disabilities. While major accessibility considerations involve things like proper semantic structure and alternative text for images, even small details can significantly impact the overall user experience. One such detail is the enterkeyhint attribute in HTML.

While using various apps on your smartphones, you might have noticed that the button where we usually have "enter", sometimes gets replaced with terms like "search", "next", etc. This small thing might be insignificant but still gives a lot of clarity about what's going to happen next. Ever wondered how that is implemented?

It's through the enterkeyhint attribute in HTML.

The enterkeyhint attribute provides a means for developers to communicate the expected action to the browser when the user presses the Enter key within a form. By utilizing this attribute effectively, developers can enhance both the UX and accessibility of web forms. By guiding the browser to understand the intended action, users can navigate forms more easily, without relying solely on mouse or touch input.

So, let's dive in and explore how this seemingly minor attribute can contribute to a more inclusive and user-friendly web!

What is the `enterkeyhint` attribute?

The enterkeyhint attribute is an HTML attribute that can be added to form input elements such as and </code>. It is used to suggest to the browser the type of action that should be taken when the user presses the Enter key while the input element has focus.<h2 id="heading-how-does-enterkeyhint-work">How does <code>enterkeyhint</code> work?</h2>The <code>enterkeyhint</code> attribute accepts a few predefined values that represent different actions. Here are the possible values:<ul><li><code>enter</code>: Indicates that the default action for the Enter key should be performed. This is the default value if the attribute is not specified.</li><li><code>done</code>: Suggests that pressing Enter should submit the form or perform the action that signifies completion.</li><li><code>go</code>: Suggests that pressing Enter should initiate a "go" operation, such as navigating to a URL or starting a search.</li><li><code>next</code>: Indicates that pressing Enter should move the input focus to the next input field or control in the form.</li><li><code>previous</code>: Suggests that pressing Enter should move the input focus to the previous input field or control in the form.</li><li><code>search</code>: Indicates that pressing Enter should initiate a search operation.</li></ul>The browser may use this hint to display an appropriate keyboard layout or provide other UI cues to the user.<h2 id="heading-example-usage">Example usage</h2>Let's consider a simple example of a login form to demonstrate the usage of the <code>enterkeyhint</code> attribute. We have two input fields: one for the username and another for the password. Here's how we can use the <code>enterkeyhint</code> attribute effectively:<pre><code class="lang-html"><form> <label for="username">Username:</label> <input type="text" id="username" enterkeyhint="next"> <label for="password">Password:</label> <input type="password" id="password" enterkeyhint="done"> <input type="submit" value="Login"></form></code></pre>In this example, we set the <code>enterkeyhint</code> attribute for the username input field to <code>"next"</code> and for the password input field to <code>"done"</code>. This provides a hint to the browser about the expected action when the user presses the Enter key.By setting the <code>enterkeyhint</code> to <code>"next"</code> for the username field, we suggest that pressing Enter should move the input focus to the password field. This helps users navigate through the form easily without having to use the mouse or touch input.For the password field, we set the <code>enterkeyhint</code> to <code>"done"</code>. This indicates that pressing Enter should submit the form or perform the action that signifies completion, which, in this case, is logging in.<h2 id="heading-browser-support">Browser support</h2>The <code>enterkeyhint</code> attribute is supported by all the browsers.<blockquote>Source: <a target="_blank" href="https://developer.mozilla.org/en-US/docs/Web/HTML/Global_attributes/enterkeyhint#browser_compatibility">developer.mozilla.org/enterkeyhint#browser_compatibility</a></blockquote>Here is how all of them look:<h2 id="heading-conclusion">Conclusion</h2>Hope you learned something new through this blog. Do check out my other blogs and follow me on Twitter at <a target="_blank" href="https://twitter.com/ArnabSen1729">ArnabSen1729</a> for more such interesting updates. You can also subscribe to my Hashnode newsletter to get updates every time I publish a new article.Have a nice day 😄 👋. </article> <article> <h1>Google I/O 2023 Highlights: Unveiling Google's Latest Innovations and Improvements</h1> Arnab Sen — Fri, 12 May 2023 06:06:28 GMT In this blog, I will cover all the latest developments happening within Google, as presented at Google I/O. I won't delve deeply into the workings and technology, as that would make the blog excessively long. Instead, I will provide an overview of the topics discussed and explain them as concisely as possible.<blockquote>I will mainly cover the new software developments in the field of AI and not the hardware and android updates.</blockquote><h2 id="heading-tldr">TL;DR</h2>Google I/O 2023 showcased AI advancements such as LaMDA, PaLM2, Imagen, and new AI features in Google Search. They also introduced AI integrations in products like Gmail, Google Maps, and Google Photos, along with the launch of Bard, an AI chatbot. Other announcements include AI tools for developers like Vertex AI and Project Tailwind, an AI-first notebook.Let's learn what Google I/O is all about.<h2 id="heading-what-is-google-io">What is "Google I/O"?</h2>Google I/O is an annual developer conference held by Google in Mountain View, California. The name "I/O" is taken from the number googol, with the "I" representing the <code>1</code> in googol and the "O" represents the first <code>0</code> in the number.<blockquote>A googol is the large number 10100. In decimal notation, it is written as the digit 1 followed by one hundred zeroes: 10,000,000,000,000,000,000,000,000,000,000,000,000,000,000,000,000,000,000,000,000,000,000,000,000,000,000,000,000,000,000,000,000,000.</blockquote>The conference also features a number of educational sessions and workshops, where developers can learn about the latest technologies and best practices.In addition to being a valuable resource for developers, Google I/O is also a great opportunity for Google to connect with its users and get feedback on its products. The conference is open to the public, and there are a number of activities and events that are available to all attendees.Google I/O 2023 was a two-day event held on May 11-12, 2023, at the Shoreline Amphitheatre in Mountain View, California. The event was attended by over 50,000 developers from around the world.A keynote was addressed by Sundar Pichai, CEO of Google, where he announced new products and features for Google's various platforms.<h2 id="heading-what-were-my-expectations-from-google-io-2023">💡 What were my expectations from Google I/O 2023?</h2>We all know that Google has been a leader in the field of artificial intelligence (AI) for decades. In 2017, they released the revolutionary transformer model, which is now widely used in natural language processing (NLP) tasks such as machine translation, text summarization, and question-answering.<blockquote>You can checkout the amazing paper by folks at Google titled "Attention is All You Need" from <a target="_blank" href="https://arxiv.org/pdf/1706.03762.pdf">here</a>.</blockquote>It was therefore no surprise that Google's annual developer conference, Google I/O 2023, was full of announcements about new AI advancements.Some of the most notable announcements included:<ul><li>The release of PaLM2, a 540-billion parameter LLM that can generate text, translate languages, write different kinds of creative content, and answer your questions in an informative way.</li><li>The release of Imagen, a new AI model that can generate realistic images from text descriptions.</li><li>The announcement of new AI features for Google Search, including the ability to answer your questions in more natural language and to provide more relevant results.</li><li>The launch of new AI tools for developers, including the Google AI Platform and the Google AI Test Kitchen.</li></ul>These are just a few of the many AI advancements that were announced at Google I/O 2023. It is clear that Google is committed to leading the way in the field of AI, and these advancements are sure to have a significant impact on the way we live and work.A significant part of Google I/O focused on integrating Generative AI into their products. So, what exactly is Generative AI?<h2 id="heading-understanding-generative-ai">🤖 Understanding Generative AI</h2>Generative AI as the name suggests is a type of artificial intelligence (AI) model called Large Language Model (LLM) that can create new content, such as text, images, audio, and video. Generative AI models are trained on large datasets of existing content, and they use this data to learn the patterns and rules that govern how that content is created. Once a generative AI model has been trained, it can use this knowledge to create new content that is similar to the content it was trained on.Here is a nice video on the recent updates in Generative AI by Google.<div class="embed-wrapper"><div class="embed-loading"><div class="loadingRow"></div><div class="loadingRow"></div></div><a class="embed-card" href="https://youtu.be/628ANvH1jH0">https://youtu.be/628ANvH1jH0</a></div> <h2 id="heading-ai-integrations-in-google-products">📲 AI integrations in Google products</h2><h3 id="heading-gmails-new-help-me-write">📪 Gmail's new "Help me write"</h3>This AI feature will generate the entire email for you. All you have to do is specify the details of the email in the prompt. It can further pull context from the previous email.This is something I will personally use quite often. Currently, there are some excellent tools in the form of browser extensions built on OpenAI's GPT model that serve the same purpose. However, it appears that this new feature from Google could be detrimental to those tools.<div class="embed-wrapper"><div class="embed-loading"><div class="loadingRow"></div><div class="loadingRow"></div></div><a class="embed-card" href="https://youtube.com/clip/Ugkxf9r4MA7Fztdkgzi6MjvJrh_Baz5ze8dt">https://youtube.com/clip/Ugkxf9r4MA7Fztdkgzi6MjvJrh_Baz5ze8dt</a></div> <h3 id="heading-immersive-view-for-routes-in-maps">🌍 Immersive View for routes in Maps</h3>Google Maps' Immersive View is being expanded to allow users to see their entire route in advance, whether they're walking, cycling, or driving. This will allow users to get a better feel for their route and make informed decisions about where to go. Immersive View will begin to roll out over the summer and will be available in 15 cities by the end of the year.<div class="embed-wrapper"><div class="embed-loading"><div class="loadingRow"></div><div class="loadingRow"></div></div><a class="embed-card" href="https://youtube.com/clip/UgkxcxMUAvh_dh2v1vf-QCk_W8bl2pf42mBb">https://youtube.com/clip/UgkxcxMUAvh_dh2v1vf-QCk_W8bl2pf42mBb</a></div> <h3 id="heading-magic-eraser-for-google-photos"> Magic Eraser for Google Photos</h3>It is an AI-powered tool that allows users to remove unwanted distractions from photos. Magic Editor is a new AI-powered tool that will allow users to do much more with their photos, such as repositioning objects, changing the sky, and more. Magic Editor is expected to roll out later this year.<h2 id="heading-the-second-gen-palm-model-palm-2"> The second-gen PaLM model: PaLM 2</h2>PaLM is a large language model (LLM) from Google AI, but it is not the only LLM that Google has developed. Other LLMs from Google include LaMDA, Meena, and T5. PaLM is the largest and most powerful LLM that Google has developed to date, but it is still under development.And in this I/O, Google announced a more advanced version of that called PaLM2. PaLM2 is a second-generation large language model (LLM) from Google AI. It is built on the same architecture as PaLM, but it is trained on a larger dataset of text and code.<div class="embed-wrapper"><div class="embed-loading"><div class="loadingRow"></div><div class="loadingRow"></div></div><a class="embed-card" href="https://youtu.be/yAANQypgOo8">https://youtu.be/yAANQypgOo8</a></div> PaLM2 is a family of models, each with its own strengths and weaknesses. Here are some of the different models in PaLM2:<ul><li>Gecko: Gecko is the smallest model in PaLM2. It is fast and efficient, but it is not as powerful as the other models.</li><li>Otter: Otter is a medium-sized model in PaLM2. It is more powerful than Gecko, but it is not as powerful as the larger models.</li><li>Bison: Bison is a large model in PaLM2. It is more powerful than Otter, but it is not as powerful as the largest model.</li><li>Unicorn: Unicorn is the largest model in PaLM2. It is the most powerful model in PaLM2, but it is also the slowest and least efficient.</li></ul>PaLM2 is particularly good at:<ol><li>Math: It can solve math problems, such as algebra, calculus, and geometry. It can also generate mathematical proofs.</li><li>Coding: PaLM2 can generate code that is both correct and efficient. It can write code in a variety of programming languages, and translate between them.</li><li>Learning & Reasoning: PaLM2 is able to learn new tasks and concepts quickly. It can be trained on a new task with just a few examples.</li><li>Translating languages: PaLM2 can translate languages with high accuracy. It can translate between any pair of languages that it has been trained on.</li></ol>PaLM2 models perform even better when fine-tuned for particular applications. Some of my favorite fine-tuned models are Sec-PaLM and Med-PaLM2. Sec-PaLM can explain the behavior of potentially malicious scripts, and better detect which scripts are actually threats to people and organizations. Med-PaML2 can answer questions and summarize insights from a variety of dense medical texts.Google is also working on another next-generation foundation model called Gemini. Gemini is being designed to be multimodal, which means that it will be able to process and understand information from a variety of sources, such as text, code, images, and audio. This will make Gemini much more versatile than previous AI models, and it will allow it to be used in a wider range of applications.<h2 id="heading-bard">💬 Bard</h2>The part of Google I/O that I was most excited about was the launch of Bard. It was initially rolled out to only US and UK, but now it's publicly available and I am really loving it.A very popular alternative that we all know is ChatGPT. Here are some of the features that Bard is providing which is not yet available in ChatGPT:<ul><li>Information Cutoff: The information cutoff for ChatGPT is September 2021. Whereas Bard can perform an internet search and give a response accordingly. This allows you to get the latest information and updates. </li><li>Export options: Bard allows you to export the response as Docs or a Draft in Gmail. If it detects code then it also gives an option to "Export to Colab" but ChatGPT doesn't have any such functionality yet. </li><li>Citations: Bard can also cite sources, something which ChatGPT still can't do. </li></ul>Further, Google is promising that in the next few weeks, Bard will become more visual. Bard will also be integrated with Google Lens, where it can analyze the photos and then respond according to your prompt.<h3 id="heading-bard-x-adobe-firefly">🎨 Bard x Adobe Firefly</h3>This collaboration will allow Bard users to generate images directly from their text descriptions, using Firefly's state-of-the-art technology.To use this feature, Bard users will simply need to type a description of the image they want to generate into the chatbot. Firefly will then use its AI to create an image that matches the description. Users will be able to edit the image as needed, and they can even share it directly on social media.<h2 id="heading-workspaces">💼 Workspaces</h2>Some of the popular tools available in Google Workspaces like Docs, and Sheets will have options for users to prompt specific requirements. From generating job descriptions, and creating tables with data inserted, to creating images with specific styles everything can be done with just a simple prompt directly from these tools. You can do these with a side panel called "sidekick" which will keep track of the context and then provide relevant information. This will make prompting very easy.All of this will be generally available to business and consumer Workspace users later this year via a new service called Duet AI for Workspace.<h2 id="heading-generative-ai-in-search">🔍 Generative AI in Search</h2>A typical Google Search can be quite challenging, to be honest. To obtain very specific information, you must break down the text you are searching for and then skim through the results to find the precise answer.But with this new integration of Generative AI, you don't have to do so anymore. It will automatically take care of extracting the information, getting relevant results and then compiling the results to give you the perfect response.<h2 id="heading-vertex-ai"> Vertex AI</h2>Vertex AI is a managed machine learning (ML) platform that helps you build, deploy, and scale ML models. It offers a unified experience for managing the entire ML lifecycle, from data preparation to model training and deployment. Vertex AI also provides a variety of tools and services to help you accelerate your ML projects.Other big techs are already availing this functionality in their apps, like:<ul><li>replit</li><li>Uber</li><li>Canva</li><li>character.ai</li></ul>Vertex has the support of 3 new models, in addition to PaLM2.<ol><li>Imagen: which powers image generation, editing, and customization from text inputs.</li><li>Codey: for code completion and generation, which can be trained on code base to help build applications faster.</li><li>Chirp: universal speech model which brings speech-to-text accuracy for over 300 languages.</li></ol>All of these features are already in Preview and can be used.<h2 id="heading-project-tailwind">📔 Project Tailwind</h2>Project Tailwind is an experimental AI-first notebook that learns from your documents and helps you learn faster. You can upload the docs using Google Drive and that creates a custom fine-tuned model for you to interact with.It can be used:<ul><li>to create study guides</li><li>find sources</li><li>generate quizzes</li><li>do a quick Google search to get more information.</li></ul><h2 id="heading-conclusion">Conclusion</h2>Google I/O 2023 unveiled groundbreaking AI advancements and integrations, showcasing the power of AI to revolutionize the way we live, work, and interact with technology. These innovations promise to make our lives more efficient, connected, and inspired as we move toward a brighter future.As we embrace the future of AI, let's harness its power to innovate, improve our lives, and create a world where technology is a catalyst for positive change, empowering us to reach new heights and achieve our greatest potential.<h2 id="heading-references">References</h2><ul><li><a target="_blank" href="https://blog.google/technology/ai/google-io-2023-keynote-sundar-pichai/">Google I/O 2023: Keynote</a></li><li><a target="_blank" href="https://blog.google/technology/ai/google-palm-2-ai-large-language-model/">Introducing PaLM 2</a></li></ul><hr />If you found this blog useful, do leave a comment and like this blog, it really motivates me to put forward more such content. You can follow me on Twitter for further updates. My Twitter handle: <a target="_blank" href="https://twitter.com/ArnabSen1729">@ArnabSen1729</a>.You can find the rest of my social links at <a target="_blank" href="https://arnabsen.dev/links">arnabsen.dev/links</a>. Thank you and have a wonderful day. </article> <article> <h1>Unlocking the Power of the Cloud: A Beginner’s Guide to Google Cloud</h1> Arnab Sen — Mon, 08 May 2023 01:30:00 GMT As software developers, we are constantly on the lookout for solutions that will allow us to publicize our projects and work for people all over the world to use and benefit from. The thought of managing our own servers and infrastructure, on the other hand, can be intimidating and exhausting. Fortunately, companies like Google recognized this need and provided a solution in the form of Google Cloud Platform (GCP). With GCP, we can easily leverage the power of the cloud to bring our projects to the masses.Google Cloud is a comprehensive suite of cloud computing services that can assist you in easily building, running, and scaling your applications. Google Cloud provides a wide range of solutions to help you achieve your goals, from data storage and analysis to machine learning and artificial intelligence.In this blog, well go over the fundamentals of Google Cloud, such as what it is, how it works, and why its such a valuable tool for both businesses and developers. Well go over the various Google Cloud services and provide real-world examples of how businesses are using Google Cloud to achieve their goals.<h3 id="heading-a-little-introduction-about-me">A little introduction about me</h3>Hi, I am Arnab Sen. A Software Developer from India. I was an SWE Intern at Google and will be joining them as a Full Time Engineer. I was also a Google Cloud Facilitator where I taught 150+ students about Google Cloud and its tools. Feel free to reach out to me on my social handles at <a target="_blank" href="https://www.arnabsen.dev/links">arnabsen.dev/links</a><h3 id="heading-getting-a-brief-idea">Getting a brief idea</h3>Before starting out, I urge you to watch this video by Google Cloud Tech once. It will really inspire you to learn about this new domain of technology.Wow, after watching that video, its clear that Google Cloud is the bomb! Its like a Swiss Army knife for building innovative solutions that can do some seriously cool stuff. Whether youre trying to build a chatbot that can talk to customers or a machine learning model that can predict the future, Google Cloud has got your back.One of the things that really stood out to me was how easy it is to use Google Cloud. The video showed how you can set up a machine-learning model with just a few clicks, and I was blown away by how simple it was. With Google Cloud, you dont have to worry about all the nitty-gritty details of setting up servers and managing infrastructure. You can focus on building your application and let Google Cloud handle the rest.<h3 id="heading-what-is-cloud-technology-after-all">What is Cloud Technology after all?</h3><blockquote>Cloud computing is the delivery of different services through the Internet, including data storage, servers, databases, networking, and software.</blockquote>Ehh !! That sounds complicated.So, lets imagine you have a lot of toys but your room is small and you cant keep them all. So you approach a friend and ask if you can keep some of your toys in their room. You can still play with your toys whenever you want, but they wont take up as much space in your room.Where do Cloud Platforms come into the picture?Cloud computing is similar in this regard. You have a lot of computer stuff instead of toys, such as pictures, videos, and documents, but your computer doesnt have enough storage space to keep everything. So you ask the cloud (which is similar to your friends room) if you can store your belongings there.<h3 id="heading-lets-rewind-a-little-bit">Lets rewind a little bit</h3>How do you think Cloud became mainstream? There were 3 stages to this. Lets go through them one by one.<h4 id="heading-stage-1-on-premise">Stage 1: On-Premise</h4>Back in the day, lets say you created an e-commerce platform. It had 5 users and you had just one server on your desk and you are managing everything. Everything is doing great. All of a sudden someone tweets about your platform and now you have 1000s of users. But your poor server cant handle those many users so it starts crashing. So you had to do one thing called Scaling. Basically, you had to buy a bigger, more powerful computer to handle all the users. Can you imagine how tedious it was back in the day?<h4 id="heading-stage-2-time-sharing">Stage 2: Time Sharing</h4>That led to the 2nd stage. Now, big companies like IBM would buy out a million computers cause someday they would definitely need them. But sometimes there might be some computers which are sitting idle and not being used. So what did they do? They started to rent those out, to other companies and small startups. That way it will be cheaper and more affordable for them and companies like IBM started making passive income, which is awesome.<h4 id="heading-stage-3-cloud">Stage 3: Cloud</h4>This became more and more popular and it got more mainstream for startups to rent out servers. Then bigger companies like Google, and Amazon saw an opportunity to make more active money by providing additional services along with resources. And this is what is called Cloud.<h3 id="heading-career-prospects-in-cloud">Career Prospects in Cloud</h3>Cloud is a really booming technology, with the big and small tech companies now shifting to Cloud-based architecture. And this shift has generated the requirement for skilled Cloud engineers, who are proficient with cloud technologies and platforms like GCP.In fact, the Cloud Computing Market is estimated to hit USD 791.48 Billion by 2028. So, if youre looking to build a career in cloud computing, theres never been a better time to do so.Salary? Lets see how much a Cloud architect makes in Bangalore India:Yes, 21L per annum 🤑.<h3 id="heading-googles-philosophy">Googles philosophy</h3>Google has this very interesting philosophy:<blockquote>Every Company is a Data Company</blockquote>In the future, every company regardless of its size and technology will differentiate itself from its competitors through technology. And that technology will be in a form of software and great software always revolves around data. So, every company in some way will be a data company. And cloud gives you the services to manage large volumes, velocity, and variations of data.With platforms like GCP, you can further analyze those data and gain further insights. By leveraging the power of data analytics and machine learning, businesses can gain valuable insights into customer behavior, market trends, and internal operations. This data-driven approach can help companies identify opportunities for growth, improve efficiency, and ultimately drive business success.<h3 id="heading-how-do-people-use-the-cloud">How do people use the cloud?</h3>Cloud is usually broken down into 3 different umbrellas.<h4 id="heading-infrastructure-as-a-service-iaas">Infrastructure as a Service (IaaS)</h4>This is basically Time-Sharing. In simple terms, it allows users to access computing infrastructure, such as servers, storage, and networking, without having to purchase and maintain physical hardware themselves.<h4 id="heading-platform-as-a-service-paas">Platform as a Service (PaaS)</h4>This takes it one step further. Previously if you had to add 10 more servers, you could reach out to those IaaS providers and ask for more 10 servers. After that, you will have to manually handle the deployment and setup. But with this PaaS, all they need is the code of the application. The providers will take care of handling the load, setting up the OS, and everything and you only have to focus on the business logic.<h4 id="heading-software-as-a-service-saas">Software as a Service (SaaS)</h4>For SaaS, you can think of simple software that is available for users to use like this blog hosting tool which is providing a service for users to publish and read blogs.In the SaaS model, the software provider is responsible for maintaining the application, including security, availability, and performance. Customers typically pay for SaaS on a subscription basis, with fees based on usage or the number of users.Where does GCP fall in these categories? Well with GCP you can get your own servers, storage disks, and everything. But at the same time, it has features that handle them for you. So it covers both IaaS and PaaS domains.<h3 id="heading-lets-dive-into-some-of-the-gcp-tools">Lets dive into some of the GCP tools</h3><h4 id="heading-compute-engine">Compute Engine</h4>Compute Engine is a service that allows users to run virtual machines (VMs) on Googles infrastructure. In simple words, it lets you run any code on Google servers. If lets say you have a giant script that takes a couple of hours or days to run. Instead of doing it on your laptop and worrying about your battery dying, etc, you can create a VM instance for yourself and run the code over there.<h4 id="heading-cloud-storage">Cloud Storage</h4>A scalable and highly available object storage service for storing and accessing data on GCP. In other words, Google Cloud Storage is like your very own storage unit in the cloud, where you can keep all your digital stufflike files, photos, videos, and moresafe and sound. But heres the really cool partGoogle Cloud Storage is designed to be super reliable and secure. Its backed up across multiple servers and data centers, so even if one server goes down, your files will still be safe and accessible. Plus, you can control who has access to your files, so you can make sure only the right people can see and edit them.<h4 id="heading-cloud-load-balancing">Cloud Load Balancing</h4>Cloud Load Balancing in GCP is a tool that will help you distribute the incoming network traffic to multiple virtual machines (VMs) or zonal or regional managed instance groups across different regions, zones, or across multiple regions. It balances the network traffic in real-time so that no single instance or zone gets overwhelmed, resulting in faster response times and better reliability.In simple terms, lets say you have an online store that is getting a lot of traffic. Without load balancing, all the traffic would hit a single server, potentially causing it to slow down or crash. But with Cloud Load Balancing, the traffic is distributed evenly across multiple servers, ensuring that the website stays up and running even during peak traffic times.<h4 id="heading-bigquery">BigQuery</h4>Big Query allows you to analyze massive amounts of data quickly and efficiently using a fully-managed, serverless cloud data warehouse. With BigQuery, you can easily load and query data sets, perform data transformations, and visualize your results using Google Data Studio or other third-party tools. It is highly scalable, secure, and easy to use, making it a popular choice for businesses of all sizes that need to analyze large volumes of data.<h4 id="heading-iam-and-admin-tools">IAM and Admin Tools</h4>IAM (Identity and Access Management) enables you to manage access control by defining who (identity) has what type of access (role) to which resource. It provides a centralized view of permissions and access for all cloud resources, including Google Cloud Storage, BigQuery, and Compute Engine. IAM allows you to create and manage service accounts, grant permissions to users, and set up custom roles to meet specific business needs.And there are many more such tools. These tools have been used by companies to drive their software and manage the data that they receive.Take Spotify for example. Spotify launched a music-streaming service in late 2008, surpassed 1 million customers in early 2011, and today offers 248 million monthly active users in 79 markets access to more than 50 million songs and podcasts. How are they handling this massive user base? Yes, with Google Cloud.<blockquote>Google Cloud removes a lot of the operational complexity from our ecosystem. That frees up time, said Tyson Singer, vice president of technology and platform at Spotify.We can iterate quicker on key needs, like data insights and machine learning. Having infrastructure managed for us, with the lower-value details taken away, streamlines our ability to concentrate on whats important to our users and give them the experiences they know and love about Spotify.</blockquote>Here is a great video on the Spotify case study:The video teaches us a valuable lesson: Why waste time building something that is already done? Google is the worlds largest search engine, and it also happens to build the best data centers. And it makes sense; why would a company waste time building a data center when they can simply use a platform like GCP and focus on the business logic and idea they are developing?Take a quick glance at the scale that Google Cloud works at:<h3 id="heading-how-can-you-as-a-student-use-gcp">How can you as a student use GCP?</h3>As we say in the first video, you as a student can fully leverage the various tools that GCP provides for a variety of applications like training ML models, starting a server, and many more. Lets get our hands dirty and learn how to use the Google Cloud Console.Lets go through it step by step:<ol><li>Visit the <a target="_blank" href="https://console.cloud.google.com/home/dashboard">Google Cloud Console</a> and log in and check the terms and conditions.</li><li>Once you are in, you will see a dashboard something like this:</li></ol>Currently, its pretty empty but once you start working on your project you will be able to see some stats.Lets dig into the Project Details section. Here we have:<ul><li>Project Name: This is to make it easy for you to differentiate between the projects that you have. It is not globally unique.</li><li>Project ID and Project Number: These are in fact globally unique and different for different projects.</li></ul>Now, lets build a resource. Lets create our own Virtual Machine. If you dont know what a VM is, its like a computer inside a computer. Its a software-based representation of a computer that can run its own operating system and applications just like a physical computer.So, now click on the hamburger menu at the top-left. Look for Compute Engine and then click on VM Instances.This will open up another dashboard that will show us all the instances that we have created. In my case, I don't have any and will create one so click on Create Instance.That will open up a form for us where we will have to provide details regarding our VM instance.First, we have to choose the name of our instance. You can name it anything you want, I will be leaving it as instance-1.Then we have Region and Zone. This means where in the world our instance will be located. We already talked about how Google has its data centers spread across the world. We can leverage that scale and set up our VM anywhere we want. This way you can set up your VM closer to your user base which will reduce the latency. I will choose asia-south1 cause that is closer to where I am right now.Scrolling down you can select the Machine Type that would suffice your requirements.I will just go with the e2-micro cause this is just for demo purposes. Next, in the Boot Disk section, you can choose the type of OS you want your VM to have.You can skip the rest of it and then go to the bottom and click on Create. That will take you back to your dashboard and you can see the status of your VM instance.Once the setup is complete you will see a in the status. And then click on SSH.Basically, it will now open up a console and you can literally use the VM like your own machine, set up a server, run some scripts, etc. I created a Python Hello World program and executed it as you can see:Once you are done with the work, its always a good practice to clean up the resources. So, lets delete our VM instance. Head back to the VM instances dashboard. Click on the 3 dots and choose Delete.It will ask for confirmation and then after some time, the VM instance that was assigned to you will be wiped out.<h3 id="heading-whats-next">Whats next?</h3>Now you must be very interested in learning more about Google Cloud. Well, Google has tonnes of great resources for you. Here are some of them:<ol><li><a target="_blank" href="https://www.cloudskillsboost.google/journeys">https://www.cloudskillsboost.google/journeys</a> This is a great place to learn Google Cloud. You will have step-by-step milestones to cover and learn about a particular tool.</li><li>Take part in events like 30 days of Google Cloud where you also earn cool swags along with learning about Google Cloud.</li><li><a target="_blank" href="https://youtu.be/kzKFuHk8ovk?list=PLIivdWyY5sqKh1gDR0WpP9iIOY00IE0xL">https://youtu.be/kzKFuHk8ovk?list=PLIivdWyY5sqKh1gDR0WpP9iIOY00IE0xL</a> A playlist of Google Cloud Essentials.</li></ol><h3 id="heading-conclusion">Conclusion</h3>So, now you have learned what cloud is, what GCP is, and how to use the platform to build resources. In conclusion, the Google Cloud Platform has revolutionized the way we approach computing and data management. With its vast array of tools and services, businesses of all sizes can harness the power of the cloud to scale their operations, improve efficiency, and drive innovation. From startups to global enterprises, GCP offers a world of possibilities, and with the right knowledge and skills, anyone can leverage it to unlock their potential.So, dont be afraid to explore, experiment, and learn. Who knows, maybe the next big thing will be built on the cloud, and it could be yours! </article> <article> <h1>$ whoami</h1> Arnab Sen — Sun, 30 Apr 2023 18:30:00 GMT <h1 id="heading-hi-i-am-arnab">Hi, I am Arnab</h1>I currently work as a Software Engineer at Google in the Ads team.I did my <a target="_blank" href="http://B.tech">B.tech</a> in 2023 in Computer Science and Engineering from the Indian Institute of Engineering Science and Technology, Shibpur.I've always had a strong enthusiasm for technology, education, and problem-solving. I also enjoy using my blogs and tweets to share my knowledge.My interest in coding peaked during my 11th when I had Computer Science in my curriculum. I wanted to do more than just pattern printing problems that used to come in school exams. And that's how I came across Competitive Programming. I attempted for INOI (Indian National Olympiad in Informatics) in my 12th standard with around 3 months of CP experience, and it was a disaster. But even after that, I continued with CP. I reached "Expert" in <a target="_blank" href="https://codeforces.com/">Codeforces</a> by the end of my 1st year.During the COVID lockdown, I found more interest in Ethical Hacking and participated in lots of CTFs with <a class="user-mention" href="https://hashnode.com/@sadn1ck">Anik Das</a> and we were ranked 65th all India according to <a target="_blank" href="https://ctftime.org/">CTFTime</a>.Then I started delving into Development and Open Source. I got the Coronasafe Pupilfirst Fellowship. Then I was also selected for the first cohort of Summer of Bitcoin.In my pre-final year, I interned at Google as a SWE Intern and by the end of the internship, I was able to get a PPO (Pre-Placement Offer).Even in my final year, I interned at a bunch of different startups:<ul><li>Cypherock in the field of web3 and blockchain</li><li>GMetriXR in the field of AR/VR</li></ul>To summarise these are the fields I have worked on:<ul><li>Competitive Programming</li><li>Ethical Hacking</li><li>Open Source</li><li>Development<ul><li>Web Development</li><li>Web3 and Blockchain</li><li>Embedded Systems</li><li>AR/VR</li></ul></li></ul>... and I still looking forward to learning more.<h2 id="heading-achievements">Achievements</h2>Here are some of my achievements (most of them were during my college times):<ul><li>Ranked 2nd position among 22k participants in the Cybersecurity Hackathon conducted by RISE. Got featured in <a target="_blank" href="https://theprint.in/ani-press-releases/get-set-hack-by-rise-marks-massive-success-with-more-than-22k-participants/1113059/?amp">ThePrint</a> , <a target="_blank" href="https://www.theweek.in/news/sci-tech/2022/10/18/why-hackathons-are-a-competitive-but-fun-way-of-getting-students-tech-ready-for-the-future.html">TheWeek</a> , <a target="_blank" href="https://www.news18.com/news/education-career/delhi-boy-wins-cryptography-hackathon-hosted-by-ap-govt-top-5-winners-get-placement-offers-5950933.html">News18</a> , <a target="_blank" href="https://www.aninews.in/news/business/business/get-set-hack-by-rise-marks-massive-success-with-more-than-22k-participants20220902131355/">ANI News</a> , <a target="_blank" href="https://www.mid-day.com/brand-media/article/driving-holistic-learning-edtech-startup-rise-enriching-students-through-hackathons-23258081">Mid-day</a> .</li><li>Was among the top 100 candidates selected out of over 10,000 applications and 2,500 project proposals for the XROS Fellowship Program 2023, an initiative of FICCI, supported by Meta, and implemented by Reskilll.</li><li>Ranked 98 globally among 136,054 students (from 34 countries) in Round 2 of Codevita 2022.</li><li>Ranked 25 in India and 311 globally in Google Hashcode 2021.</li><li>Selected for <a target="_blank" href="http://summerofbitcoin.org/">Summer of Bitcoin 2021</a> . Around 55 were selected from 4800+ applicants.</li><li>Ranked 72 among 200+ teams in ICPC Asia Kanpur-Mathura Regionals 2020.</li><li>Best Hardware Hack in SnakesNHackers Hackathon by MLH. The project link: <a target="_blank" href="https://github.com/arnabsen1729/HackersNLadders">arnabsen1729/HackersNLadders</a> .</li><li>Secured a rank in the top 24 out of 50,452 applications in CoronaSafe Engineering Fellowship Program by AICTE.</li><li>Leetcode Rating 2006 (top 2.31%) (handle: <a target="_blank" href="https://leetcode.com/arnabsen1729/">arnabsen1729</a> ) solved 800+ problems.</li><li>Leetcode ranked 431th out of 15000+ in India.</li><li>Codeforces Rating Expert 1614 (handle: <a target="_blank" href="https://codeforces.com/profile/arnab1729">arnab1729</a> )</li><li>Codechef Div1 4star 1802 (handle: <a target="_blank" href="https://www.codechef.com/users/arnab1729">arnab1729</a> )</li><li>1003th out of 17k+ in Credit Suisse Global Coding Contest.</li><li>1st Position in CodeRush 2020 by IEEE-MIT out of 70+ teams.</li><li>Ranked 103 in rgbCTF2020 out of 1034</li><li>My CTF Team 0xw3bs3c was ranked 65th in India according to CTFTime.</li></ul> </article> <article> <h1>The Evolution of a Tech Enthusiast: My Tech Journey</h1> Arnab Sen — Sun, 23 Apr 2023 14:36:13 GMT I've been working in this domain for a while now and intend to continue. One thing that has always fascinated me, is how other people got into this magical world of Tech. And here's my story.Why did I use the term "magical"? To me, coding is like casting a magical spell, through every line of code that brings a digital creation to life.Believe it or not, a few lines of code propelled humanity to the moon, an unimaginable feat. The Apollo 11 mission that landed humans on the moon in 1969 was powered by a computer with a processor speed of just 1.024 MHz, which is slower than most modern calculators. The computer had only 4 KB of memory, which is less than what we use to store a single photo today. But the code that powered this computer was nothing short of miraculous.<blockquote>Let me know in the comments if you know who this lady in the picture above is? Hint: she played a very crucial role in the Apollo 11 mission.</blockquote><h2 id="heading-lets-introduce-ourselves-first">👋 Let's introduce ourselves first</h2>I am Arnab Sen, a Software Developer from India. Currently while writing this, I have just 1 more month left for my college. In these 4 years, I have learned a lot and grown as a software developer. There are some achievements of mine that I am really proud of:<ul><li>Was one of the 24 people selected for the Coronasafe Fellowship by AICTE out of 50,000+ applicants and built an open-source healthcare platform.</li><li>Selected for the Summer of Bitcoin project and spend the summer of 2021 contributing to Bitcoin-Core.</li><li>Selected for SWE Internship at GOOGLE, and on top of that ended up with a PPO opportunity.</li><li>Was selected for the XROS Fellowship which is powered by FICCI and supported by Meta.</li><li>Interned at multiple remote startups: Dynopii, Cypherock, GMetriXR.</li></ul><h2 id="heading-where-it-all-started">🏗 Where it all started</h2>The first time I did something close to programming was LOGO in class 4. For those who aren't aware, LOGO is an educational programming language designed to teach the basic concepts of programming. There is a turtle and we can give directions to the Turtle with keywords like "FORWARD", "RIGHT", etc and it will draw the path.As a kid, I always loved creating different kinds of patterns in LOGO. Then in class 5, we had QBASIC in our syllabus. That's it, that kind of sparked a little fire in me, that maybe I should pursue Software Development in the future, cause I uncovered the magic beneath.Luckily, I had internet access (the max speed was 500 kbps, but that was still a lot during those days). So I started learning a few other programming languages. My dad's colleague used to give computer science tuition to 11-12th standard students. He asked me if I was interested in joining his classes and I started doing so in class 6.By the time I was in class 8, I already had the fundamentals of C and C++ strong. But after class 8 I joined an IITJEE coaching center, and just like everyone, I too had to leave behind my creative knacks to run in this rat race. With the JEE preparation pressure, I lost all interest in programming 🥺. That should have been the end, but...In 11 standard I had computer science again in my course, and that rekindled the fire in me. During that time I also watched movies like, The Social Network, Jobs, Snowden, and many more, and I became passionate about coding more than ever.I came across an olympiad competition called IOI (Internation Olympiad In Informatics), it's like IMO (International Mathematical Olympiad) but for Programming.To be eligible for IOI, one has to qualify for INOI (Indian Olympiad of Informatics), and to get into that one has to qualify for either ZOI (Zonal Olympiad of Informatics) or ZCO (Zonal Computing Olympiad). But when I was in class 11, I already missed the ZCO or ZOI registration date. So I prepared for class 12th ZIO and ZCO. Qualified them and then went to INOI, but failed miserably. I left the exam center 1 hour before because I was unable to solve any questions. But that didn't stop me, cause I knew it was just the beginning.So, far I did programming as a hobby or whenever I had a break. But after JEE I opted for CS in IIEST Shibpur, and now I was able to give my full attention to programming.<h2 id="heading-college-life">🎓 College Life</h2>In the initial few days, I was really interested in building small games. Here are some of them.Built a small TicTacToe game built with Python3 with one of my hostel mates.<blockquote>Here is a fun fact: the logic behind Computer Moves is just a bunch of if-else statements 🤣🤣🤣🤣</blockquote>Here is another beautiful yet very simple project that I built with simple HTML, CSS, and JS. It's based on Chaos Theory, it shows how pure randomness generates such beautiful patterns.But as time passed, I started to focus on building stuff that could be useful to some extent. Like, a Discord bot for IPL Scores, a simple website to pdf converter, markdown to pdf generator.For learning, I never really paid for any courses. There are so many great folks online who share their knowledge and are such amazing teachers. For learning web development especially I followed these channels and I feel they helped me strengthen my fundamentals:<ul><li><a target="_blank" href="https://www.youtube.com/@freecodecamp">https://www.youtube.com/@freecodecamp</a></li><li><a target="_blank" href="https://www.youtube.com/@TraversyMedia">https://www.youtube.com/@TraversyMedia</a></li><li><a target="_blank" href="https://www.youtube.com/@akshaymarch7">https://www.youtube.com/@akshaymarch7</a></li><li><a target="_blank" href="https://www.youtube.com/@WesBos">https://www.youtube.com/@WesBos</a></li><li><a target="_blank" href="https://www.youtube.com/@academind">https://www.youtube.com/@academind</a></li></ul>For theoretical topics related to computer science like Data Structures and Algorithms, Operating Systems, and Database Management Systems, I always preferred the MIT Open Courseware Courses. I think they are just goldmines for anyone looking to dive deep into these topics.<h2 id="heading-met-open-source">🤍 Met Open Source</h2>When I was in my 2nd year of college, I came across open-source software and realized that through OSS I could make more impact with my knowledge. I was a part of the Coronasafe Fellowship where I got to work on multiple open-source healthcare projects.One of the projects is <a target="_blank" href="https://github.com/coronasafe/arike">Arike</a> which is a software system for palliative care. The other project is called <a target="_blank" href="https://github.com/coronasafe/life">Life</a> which acted as verified crowd-sourced emergency services during the peak of covid second wave. I am grateful to be a part of such a great cause and take my development skills to help people.My other contributions can be found on my <a target="_blank" href="https://github.com/arnabsen1729">GitHub</a> profile. Through contributing to open source, I have been able to gain valuable experience in real-world software development problems, collaborate with other talented developers, and learn new programming languages, tools, and frameworks. Contributing to open source has also allowed me to receive feedback from other developers and improve my ability to communicate my ideas effectively.Open Source went on to change my career and brought me even better opportunities in the future.Even though everything seems smooth, like every other thing in life there are always ups and downs.If you are also interested to pursue a career in this domain of Software Development. Here are a few things to keep in mind, that I learned from my personal experience:<h3 id="heading-start-with-the-basics">📚 Start with the basics</h3>Before you jump into coding and building cool stuff, it's important to have a solid foundation of the basics. So, what are the basics? Well, at a minimum, you should have a solid understanding of at least one programming language. There are many languages to choose from, but some of the most popular ones include Python, Java, C++, and JavaScript. Each language has its own strengths and weaknesses, and choosing the right one for you will depend on your personal preferences and the type of projects you want to work on.Also, it's worth noting that learning the basics isn't a one-and-done task. As you progress in your career, you'll continue to build on your knowledge of programming languages, data structures, and algorithms. The key is to start with a strong foundation and continue to learn and grow from there.<h3 id="heading-keep-learning">🧠 Keep Learning</h3>The field is constantly evolving, and there are always new technologies and tools being developed. As a result, it's crucial to stay up-to-date with the latest trends and developments to remain competitive and relevant.Reading industry blogs and publications is a great way to stay informed about the latest trends and developments. Many tech publications offer daily or weekly newsletters that summarize the latest news and events in the industry. Following these publications can help you stay informed about the latest advancements in the field, and help you identify emerging trends that may be relevant to your work. If you prefer learning from videos more then my suggestion would be to start watching conference talks.I also love to watch documentaries like the ones by <a target="_blank" href="https://www.youtube.com/playlist?list=PLtEPUaeDclku1ECmuN3IsUimHApukWIOf">Honeypot.io</a>. The amount of insights that are shared in these 30 mins Netflix-styled documentaries is just crazy, and trust me you will never get bored.<h3 id="heading-embrace-challenges">🏔 Embrace Challenges</h3>Challenges are a natural part of any career, but in the tech industry, they can be particularly prevalent due to the constantly evolving nature of technology. However, it's important to embrace these challenges as opportunities for growth and development.One way to embrace challenges is to take on projects that push you outside of your comfort zone. By taking on projects that are unfamiliar or challenging, you'll be forced to learn new skills and adapt to new situations, which can help you grow both personally and professionally.<h3 id="heading-dealing-with-imposter-syndrome">🎭 Dealing w/ Imposter Syndrome</h3>This is a very very prevalent topic, especially in the software development domain.Imposter syndrome is a nasty little bugger that can creep up on anyone in the tech industry. It's that feeling that you're not qualified or good enough for your job, even when all evidence points to the contrary. But here's the thing: you're not alone. Lots of successful people in the tech industry have felt imposter syndrome at some point in their careers. It's a pretty common experience.So, the next time you start feeling like a fraud, remember that you're not alone, and remind yourself of all the awesome things you've accomplished. Seek out a mentor or coach for guidance, and connect with others who have gone through the same experience. With these strategies, you can kick imposter syndrome to the curb and achieve success in the exciting and dynamic world of tech!Here is a great video by Ted-Ed on Imposter Syndrome:<div class="embed-wrapper"><div class="embed-loading"><div class="loadingRow"></div><div class="loadingRow"></div></div><a class="embed-card" href="https://youtu.be/ZQUxL4Jm1Lo">https://youtu.be/ZQUxL4Jm1Lo</a></div> <h3 id="heading-share-your-learnings">🤝 Share your learnings</h3>Sharing knowledge as a software developer is one of the most rewarding things you can do! Not only does it help you solidify your own understanding of a topic, but it also benefits the wider community and can even lead to new career opportunities.Blogging is a great way to share your knowledge and experiences with others. By writing about your insights, tips, tricks, or lessons learned on a particular topic, you can help others who are facing similar challenges or looking to learn new skills. One of my blogs where I shared my resources and how I prepared to land a job at Google got featured on Hashnode and has around 1k views (even Hashnode's cofounder commented on the blog).Link to the blog 👇<div class="embed-wrapper"><div class="embed-loading"><div class="loadingRow"></div><div class="loadingRow"></div></div><a class="embed-card" href="https://arnabsen.hashnode.dev/landing-a-google-swe-internship">https://arnabsen.hashnode.dev/landing-a-google-swe-internship</a></div> Plus, blogging can help you establish yourself as an authority in your field and even lead to opportunities for speaking engagements or consulting gigs. My blogs on Bitcoin helped me get a web3-based internship.Speaking at conferences or community events is another fantastic way to share your knowledge and connect with others in the industry. Not only do you get the chance to present your ideas and insights to a live audience, but you also get to network with other professionals and learn from their experiences as well. Plus, speaking engagements can be a great addition to your resume and help you stand out when you're looking for new job opportunities.<h2 id="heading-the-end">🎬 The End</h2>As I reflect on my tech journey, I realize that it's been a rollercoaster ride with its fair share of ups and downs. But it's also been a journey filled with wonder, innovation, and creativity. I've learned that being a software developer isn't just about writing code. It's about problem-solving, collaboration, and being passionate about making a difference in the world.If you're just starting your own tech journey, remember that it's never too late to start. Whether you're fresh out of school or a seasoned professional, there's always something new to learn. Be open-minded, stay curious, and never stop exploring. And always remember to keep your eye on the end goal: creating something that can change the world.And always remember "YOU GOT THIS !!" </article> <article> <h1>Landing a Google SWE Internship: Interview Experience and preparation journey</h1> Arnab Sen — Tue, 04 Apr 2023 17:52:34 GMT Even though, there is a plethora of articles on Google SWE Internship interview experience online, I am still frequently asked the following questions:<ul><li>How did you crack Google SWE Internship?</li><li>Which resources did you follow?</li><li>Can you share some interview tips for my upcoming interview?</li><li>Was it on-campus or off-campus?</li></ul>... and many more. So, I decided to pen down my thoughts so that I can share this article next time someone asks me anything about my SWE Internship.<h1 id="heading-what-can-you-expect-from-this-blog">What can you expect from this blog?</h1>These are the few things that I would like to cover in this blog<ul><li>How I optimized my resume.</li><li>How I approached (or, usually approach) referrals.</li><li>Resources I referred to while preparing for DSA.</li><li>Interview tips</li></ul><blockquote>NOTE: Although this article is about Google, the recommendations will be useful if you apply to other product-based companies.</blockquote><h1 id="heading-introduction">Introduction</h1>Hi, I am Arnab Sen, a Software Developer from India 🇮🇳. I interned at Google as an SWE for the summer of 2022 and even grabbed a Pre-Placement Opportunity. I will join Google India as a Full Time, in July 2023. When I was in my third year Google, unfortunately, didn't visit our campus. So I had to take the off-campus route to land at Google and here is my journey:<h1 id="heading-decoding-the-google-application-process">Decoding the Google Application Process</h1>Let's understand how Google usually hires candidates for the Software Engineering Role:<ol><li>Resume Shortlisting</li><li>Online Aptitude Round</li><li>Telephonic Round</li><li>Technical Interviews</li><li>Hiring Committee</li></ol>I'll go through each round and talk about what happened and how I prepared.<blockquote>The same rounds are usually followed by most of the Product based companies like Microsoft, Amazon, Meta .etc.</blockquote><h2 id="heading-round-1-resume-shortlisting">Round 1: Resume Shortlisting</h2>It was in Aug 2021 that I came across the Google India SWE Intern opening in <a target="_blank" href="https://careers.google.com/jobs">Google's career portal</a> and I applied with a referral from one of my seniors.<h3 id="heading-resume-building-tips">Resume building tips</h3>Your resume is the first thing that creates an impression. Even if you are super talented and skilled, a poor resume can negate all your abilities. So, you need to create a kickass resume that showcases your talents and impresses everyone!Originally I used to create my resume in LaTeX but later on, it became a hassle to maintain and make updates. Hence I moved on to this website called <a target="_blank" href="https://flowcv.com/">flowcv.com</a>.Their interface is user-friendly and offers pre-defined templates while allowing ample options for personalizing your resume. Check out the template I personally used.I prefer a single-column resume. If you want to stick to LaTeX and you are looking for a resume template, my suggestion would be to use <a target="_blank" href="https://www.overleaf.com/latex/templates/deedy-cv/bjryvfsjdyxz">DeedyCV</a>. It looks very professional and compact.<h3 id="heading-resume-tips-to-keep-in-mind">Resume tips to keep in mind</h3>Here are some of my tips for your resume:<ul><li>If you wanna look like a pro, make sure your stuff doesn't have any spelling or grammar mistakes. You can use tools like Google Docs spelling check or Grammarly to help you out.</li><li>Just get straight to the point and don't use fancy words. Nobody cares about your language skills, they just want to know what you can do as a developer.</li><li>It would be great if you could add some proof to your work. If you have built a project, published a paper, or completed a certification, you must provide a link for the respective work.</li><li>Don't stuff everything into one page of your resume. Leave some space to breathe. It's totally fine to use 2 pages if you need to. I checked with some Google peeps and recruiters and they all said it's all good to have a 2-page resume if one page just ain't cutting it.</li></ul>Now, let me share with you some hacks that can use:<ul><li>If you don't have achievements, try to take part in Leetcode, GFG, Hackerrank, or HackerEarth contests or contests of college fests. You can put those ranks in your achievements. To know the upcoming contests, check out <a target="_blank" href="https://clist.by/">clist.by</a>.</li><li>If you don't have projects, try to take part in hackathons with your peers. "I can't think of a good project" is what I hear from a lot of my juniors. Right now, if I ask you to think of anything, you won't be thinking of anything. But if I asked you to think of a fruit, you might be thinking of a banana, apple, mango, etc. Every hackathon has a unique theme, so by the end of the hackathon you will have a somewhat unique project. You can find such hackathons at <a target="_blank" href="https://devpost.com/">Devpost</a>.</li><li>If you don't have enough work experience, pick up a good open-source project from GitHub and try to make contributions to that for like 1-2 months. By the end of it, you will have enough experience and contributions that you can put it under the "Open Source Experience" category.</li></ul>If you are interested to know what my resume looks like, check out: <a target="_blank" href="https://arnabsen.dev/resume">arnabsen.dev/resume</a>.<h3 id="heading-elevate-your-referral-game">Elevate your referral game</h3>Companies like Google actually let their own employees refer potential candidates. Although a referral doesn't necessarily guarantee you a job offer, it does mean your application will stand out from the rest of the applicants, which can only mean one thing - your chances of getting hired just got a whole lot higher. In some cases, a Good Resume + Referral can fast forward in the application rounds. In my case, I was directly skipped to the Technical Interview rounds.Now few things to keep in mind:<ul><li>The number of referrals doesn't matter. It's the quality of the referral that makes the difference. I'm familiar with Google's internal referral portal. Employees must respond to questions such as, why do you believe this candidate is a good fit, have you previously worked with the candidate, and so on. Hence, if the person referring you understands your strengths and weaknesses, the referral will be stronger, and your chances will be better.</li><li>I was lucky enough to have a senior in my college who was ready to refer me. We knew each other since day 1 of my college so he was able to write a good referral for me. Shoutout to <a target="_blank" href="https://www.linkedin.com/in/surikumkaran/">Suryansh Gupta</a> . So you can always reach out to your seniors regarding referrals.</li><li>If you don't have such seniors in your college, no worries, you can get referrals over LinkedIn, Twitter, etc. When asking for referrals make sure you provide all the necessary information:<ul><li>The job id of the role you are applying for.</li><li>Your resume.</li><li>Your achievements and relevant skills.</li><li>Why do you think you are a good fit for the role?</li></ul></li></ul><blockquote>College tip: active interaction with seniors in your college can have a positive effect on your career. So, don't be shy.</blockquote>Now, let's say your resume got shortlisted. Then what?<h2 id="heading-round-2-online-aptitude-round">Round 2: Online Aptitude Round</h2>Even though I skipped the OA round, here are some preparation tips:<ul><li>Google usually asks pretty standard questions which are of the Leetcode Medium-Hard difficulty. So don't just solve easy questions. Also, no need to waste time by solving very difficult questions either.</li><li>Make sure you know all the standard algorithms, I will share the list after this.</li><li>I also prepared by participating in Competitive Programming contests, especially from <a target="_blank" href="https://atcoder.jp/">Atcoder</a> and <a target="_blank" href="https://leetcode.com">LeetCode</a>. This was done to ensure that I could answer the questions swiftly and within the time limit, exactly like an actual OA.</li></ul>My advice would be to go through the past OA rounds questions. Here is the list I referred to for Google's questions:<ul><li><a target="_blank" href="https://leetcode.com/discuss/interview-question/352460/Google-Online-Assessment-Questions">https://leetcode.com/discuss/interview-question/352460/Google-Online-Assessment-Questions</a></li><li><a target="_blank" href="https://algo.monster/problems/google_online_assessment_questions">https://algo.monster/problems/google_online_assessment_questions</a></li><li><a target="_blank" href="https://www.codingninjas.com/codestudio/problem-lists/top-google-coding-interview-questions">https://www.codingninjas.com/codestudio/problem-lists/top-google-coding-interview-questions</a></li><li><a target="_blank" href="https://www.interviewbit.com/google-interview-questions/">https://www.interviewbit.com/google-interview-questions/</a></li></ul>You can find similar collections of questions for other companies as well.<h3 id="heading-standard-topics-to-prepare-for-oas">Standard topics to prepare for OAs</h3><ul><li>Binary search</li><li>BFS/DFS/Flood fill</li><li>Tree traversals</li><li>Hash tables</li><li>Linked list, Stacks, Queues, Two pointers/Sliding window</li><li>Binary heaps</li><li>Dynamic programming</li><li>Union find</li><li>Ad hoc/string manipulations</li><li>Trie (good to know)</li><li>Segment trees/Fenwick trees (good to know)</li><li>Bitmasks (good to know)</li></ul>Once you are done with the OA you will be taken to the 3rd round.<h2 id="heading-round-3-telephonic-round">Round 3: Telephonic Round</h2>So, for this round, they're gonna call you up and hit you with some theoretical Data Structures and Algorithms questions. Some might even have multiple-choice answers. They might throw in some programming language questions too. Here are some examples of questions that you can expect to get an idea:<ul><li>What are the pros and cons of using an adjacency list over an adjacency matrix for a graph problem?</li><li>What is the data structure used in a priority queue?</li><li>What is the difference between Binary Search Tree and Heap?</li><li>How is a C++ <code>map</code> different from <code>unordered_map</code> from an implementation point of view?</li><li>Which algorithm would you use for a network flow problem?</li><li>What are the time complexities of Bellman-Ford and Dijkstra's algorithm?</li></ul>If you have a strong fundamental understanding of Data Structures and Algorithms then these won't be very difficult for you.<h3 id="heading-resources-to-study-dsa">Resources to study DSA</h3>I studied DSA mainly from these two sources:<ol><li><a target="_blank" href="https://sd.blackball.lv/library/Introduction_to_Algorithms_Third_Edition_(2009).pdf">Introduction to Algorithm by CLRS (a.k.a the Bible of DSA)</a></li></ol><ol><li><a target="_blank" href="https://www.youtube.com/playlist?list=PLUl4u3cNGP61Oq3tWYp6V_F-5jb5L2iHb">MIT 6.006 Introduction to Algorithms, Fall 2011</a> </li></ol>So, I made sure to take notes while watching those videos and reading the book chapters. And when I got stuck with any problem, I just Googled it and checked out the first link - be it an article or a video.Once you complete the telephonic round you will be invited for the technical interview rounds.<h2 id="heading-round-4-interview-rounds">Round 4: Interview Rounds</h2>It was on 13th September I received a mail stating that I have my interview lined up after a week. I freaked out a little. Because first, I didn't expect to hear back from them. And second, how am I supposed to prepare for a Google interview in 7 days?I took a smart approach. I checked out other people's experiences with Google Interviews and saw that they like to ask about DP, Graphs, Binary Search, Lazy Sum, and Sliding Windows stuff. I went back and reviewed all the notes I took from that MIT course and honed in on those types of questions. I practiced topic-wise from LeetCode itself.<blockquote>Another question I get is if LeetCode premium is really worth it. To be honest, in today's time it isn't anymore. There are tonnes of other platforms which have similar questions for free that are only available in LeetCode premium. But 2 years back, I felt spending Rs 10.5k for 1 year of LC premium might be worth it. Because if I am able to crack a job like Google, that amount invested will be just a fraction of the monthly salary. So, I got 1 year of LC premium.</blockquote><h3 id="heading-my-style-of-practicing-for-interview">My style of practicing for interview</h3>After discussing with my seniors I realized that solving a question in an interview setting is completely different than practicing questions by myself. Most of these Google interviews happen on an editor-based portal where you cannot run the code and it also doesn't support intellisense. So, I would create a Google doc where I would solve all the questions and write code. This way I didn't have the privilege of syntax highlighting or intellisense. As they say, "Hope for the best, Prepare for the worst".<h3 id="heading-the-strategy-of-breaking-down-google-interviews">The strategy of breaking down Google interviews</h3>A Google Interview happens for 45 mins and they usually ask 2 questions. So, I divided the total time into (20 + 20 + 5) mins. 20 mins for each question and 5 mins as a buffer (for an emergency like network issues or Q&A at the end).Within those 20 mins, I would spend 3-5 mins understanding the question asking clarifying questions, and explaining my approach. 10 mins for implementing the optimized solution and 5 mins for dry running or debugging. Note that you have to come up with the optimal solution as quickly as possible. So, I would insist on not wasting time trying to implement the brute-force solution.So, even during practice when I was solving the questions myself I followed the same strategy. I would pretend that I am in an interview setting. I would try to read my thoughts out loud and explain everything. I even made the habit of doing a dry run every time I wrote the final code for a problem.<h3 id="heading-mock-interviews-are-an-absolute-necessary">Mock Interviews are an absolute necessary</h3>To succeed in any field, preparation is key. Just as astronauts undergo rigorous training to prepare for the unique conditions of zero gravity, so too must job seekers prepare for the challenge of a job interview. One effective way to do this is by practicing mock interviews. That way you can know your weakness, and you can prepare for them before the actual interview.I also watched a lot of Mock Interview videos on Youtube to give me an idea of what it would be like during an interview setting. I created a playlist of all the videos that I referred to.<a target="_blank" href="https://www.youtube.com/playlist?list=PLeXWGmu4fYL76f_nnnGyCu9Tr9k5NYtko">Link to the playlist 📼.</a>I gave a bunch of mock interviews (around 5-6) by taking the help of my seniors and using online free platforms like <a target="_blank" href="https://www.pramp.com/invt/lYAEXB2Pa1Hxra4p3zMP">Pramp</a>.<h3 id="heading-importance-of-clarifying-questions">Importance of clarifying questions</h3>The initial question that will be given to you, will be intentionally very vague. They will expect you to ask some clarifying questions to identify the fine details of the question. Let's take a very simple example, you are given an array of n numbers, and you have to find the subarray with the maximum sum.Now, your first thought might be using Kandane's Algorithm. But wait, what if all the numbers are positive? In that case, the answer is simply the entire array. As you can see one small change in the constraints, had a big effect on the final solution. So always practice asking clarifying questions.<h3 id="heading-interview-tips-to-keep-in-mind">Interview tips to keep in mind</h3><ul><li>Treat the interview more like a conversation with a fellow software developer and not like a viva. This will also help you tackle your nervousness.</li><li>During the interviews, you should also think about other ways to improve the solution. Share the different options or tradeoffs that you're considering.</li><li>Think out loud because that way the interviewer will know your thought process and can give you signals if you are going in the wrong direction.</li><li>Discuss tradeoffs if possible. How would you improve your solution? How do you make it faster? How to optimize the space complexity? Know the time and space tradeoffs of the solution/data structures that you are picking.</li><li>Try to avoid taking hints, but always remember solving with hints is much better than not solving at all.</li><li>Avoid syntactical errors like missing semicolons, not closing parenthesis, wrong indentation, etc. Even though they are not deal-breakers, writing high-quality code can showcase your skills as a developer.</li><li>Finally, use proper variable names and if required add meaningful comments. But remember you have limited time so just don't spend minutes trying to come up with an appropriate variable name.</li><li>Make it a habit to always calculate the Time and Space complexity of your solution.</li></ul><blockquote>If you don't remember the exact syntax of a library class or method, that is fine. You can just let the interviewer know, and replace it with a meaningful substitute.</blockquote>All of these points might be hard to keep in mind during the interview, which is why the more mock interviews you give, the better you get at remembering these subtle points.Here are some more resources that I referred to:<ul><li><a target="_blank" href="https://www.bigocheatsheet.com/">Big-O-Cheatsheet</a></li></ul>For some last-minute revision, I also followed this collection of LC questions.<a target="_blank" href="https://seanprashad.com/leetcode-patterns/">https://seanprashad.com/leetcode-patterns/</a><h3 id="heading-my-interview-experience">My interview experience</h3>I cannot reveal the actual questions because of NDA, but in my first interview, I had a question on Sliding Windows and Dynamic Programming.In my second interview which was held the next day, I had 3 questions. One was a Scheduling Problem based on the heap, the second was again a DP question. I was able to quickly answer these 2 questions, and there were around 10-12 mins left. So the interviewer gave me another DP question for which he just asked me to write the recursive function and how I would memoize it. I was able to do that as well.Both my interviews went well from my end, I didn't take any hints, wrote high code quality, and was able to successfully dry run all my solutions with edge cases.<h2 id="heading-round-5-hiring-committee">Round 5: Hiring Committee</h2>After I finished my interviews, I had a long wait. According to what I've heard, there is a hiring committee that comprises four to five persons that have prior interview experience and are familiar with the hiring standards. They would review your resume, your code during interviews, and the feedback from the interviewers before making a final decision.For me, it was a long long wait. I very clearly remember the day. It was Maha Shasthi (the onset of Durga Puja), I woke up to a mail saying:"Congratulations on your SWE Summer Intern offer with Google India!"It was a perfect blessing to me. I was ecstatic and overjoyed. It was an absolute dream come true moment for me and my family. Looking back I remember I felt very depressed when I was rejected by Microsoft twice. I gave my best in the Microsoft Engage Aptitude round yet wasn't even selected for the project building round. Then again when Microsoft came on-campus I was rejected after the OA round which still baffles me. But as they say "Whatever happens, happens for Good".So whenever something unexpected or bad happens to you, remember these lines:<hr />I hope you liked my blog, do follow me for more such content. I wrote another blog sharing some tips on how to convert your internship to PPO.<div class="embed-wrapper"><div class="embed-loading"><div class="loadingRow"></div><div class="loadingRow"></div></div><a class="embed-card" href="https://arnabsen.hashnode.dev/important-tips-on-bagging-ppo-during-internship-and-securing-a-full-time-role">https://arnabsen.hashnode.dev/important-tips-on-bagging-ppo-during-internship-and-securing-a-full-time-role</a></div> You can learn more about me at <a target="_blank" href="https://www.arnabsen.dev/about">arnabsen.dev/about</a>. Have a nice day !! </article> <article> <h1>From Burnout to Balance: How I recovered from burnout</h1> Arnab Sen — Mon, 13 Mar 2023 15:10:24 GMT Have you experienced a sudden loss of motivation, a lack of energy to complete tasks, a decrease in inventiveness, and a drop in productivity?It happened to me, and after speaking with one of my seniors, I discovered that this condition is known as burnout. Then I began investigating this particular topic, and it turns out that it's a rather common occurrence that can impact anyone, particularly those in high-stress professions like software development. However many individuals are unaware of it and fail to take the required steps to recover from it. The sooner you recuperate, the more energized you will feel about your task, which will eventually result in better results and increased productivity.In this article, I'll try to go over some of the indicators that you might be burnt out, and then I'll provide some suggestions that helped me get back on track. It should be noted that burnout can progress to depression, in which case you should consult a medical expert.<blockquote>A little note: Because I am a Software Developer, my thoughts on dealing with Burnout will be slightly skewed and will be more relevant towards this profession.</blockquote><h1 id="heading-my-burnout-experience">My burnout experience</h1>I was attempting to achieve several things at the same time in the second half of 2021.<ol><li>I was a mentee at Summer of Bitcoin and I was working on adding USDT (User Statically-Defined Tracing) tracepoints to the UTXO (Unsigned Transaction Output) set of Bitcoin. Furthermore, I had no prior knowledge of Bitcoin, tracepoints, or working with a huge C++ codebase.</li><li>My internship placements were ongoing, and I was being turned down both on and off campus. I was devastated when I was denied Microsoft's Engage program and then again from their OA round.</li></ol>But at the time, I was all about having a decent internship, so I went all in. Ended up cracking Wells Fargo SDE Internship and Google SWE Internship. Also after completing my Summer of Bitcoin project got an internship opportunity at Chaincode Labs.But then I changed from being a highly driven, hyperactive person to someone who had very little ambition to accomplish anything and felt sluggish all day. My productivity dropped, which made me feel even worse, and I became trapped in a vicious loop.<h1 id="heading-signs-of-burnout-how-to-recognize-them-early">Signs of Burnout: How to Recognize Them Early</h1>These will be some of the early signs that you will notice:<ol><li>Decreased productivity: You will find yourself struggling to complete tasks or meet deadlines.</li><li>Lack of motivation: There is a feeling of disengagement or lack of interest in your work. You might not even find your work pleasurable anymore.</li><li>Chronic fatigue: You're constantly tired, despite getting enough sleep.</li><li>Irritability: Burnout can lead to feelings of frustration, anger, and irritability.</li><li>Decreased quality of work: Burnout can cause you to make more mistakes or produce lower-quality work than you normally would.</li></ol>Now, let's understand the causes of Burnout and how to avoid them.<h1 id="heading-avoiding-burnout">Avoiding Burnout</h1><h2 id="heading-maintaining-balance">Maintaining balance</h2>In life, balance is crucial. It's incredibly easy for software engineers to work longer hours because all we need is a laptop and an internet connection. As a result, we occasionally forget our limits and push ourselves. It may produce short results, but it is hazardous in the long run. If we spend all our time and energy at work and neglect our personal life and self-care, it can lead to physical, mental, and emotional exhaustion. As a result, we must strike a good and sustainable balance between many aspects of our lives, such as jobs, hobbies, family, and friends. This varies from person to person, therefore you must reflect and establish ground rules for yourself.<ul><li>Post 6 pm, no work-related calls/talks.</li><li>Go outside to play some sports with friends.</li><li>Taking a vacation once in a while and travelling.</li><li>Having dinners with family or friends.</li></ul><h2 id="heading-taking-breaks">Taking Breaks</h2>Taking breaks is incredibly important in avoiding burnout. Breaks allow us to rest, recharge, and reset, which can help us manage stress and prevent exhaustion. When we continuously work without breaks, we can become overwhelmed, stressed, and exhausted, leading to burnout.Breaks can take many forms, from short breaks throughout the day to longer vacations. Short breaks, such as taking a few minutes to stretch, walk around, or chat with a colleague, can help us refocus and re-energize during the workday. Longer breaks, such as weekends, holidays, or vacations, allow us to disconnect from work and engage in activities that bring us joy and relaxation.Next time when you are stuck fixing a bug, go ahead take a walk and come back. 8 out of 10 times you will be able to fix it.<h2 id="heading-having-a-clear-understanding-of-your-work">Having a clear understanding of your work</h2>Employees can experience stress, dissatisfaction, and burnout when they are confused about their tasks and responsibilities or are given ambiguous expectations.Employees who lack clear direction may struggle to prioritize their tasks or become overwhelmed by competing requests. This can lead to feelings of powerlessness and a lack of control over their work, which can lead to burnout. Workers may also feel that they are not fulfilling expectations or that their efforts are not being recognized, leading to feelings of discontent and disengagement.Clear communication is the solution. If you have any questions about the tasks, you should speak with your manager, senior, or leader. If you believe you will have problems completing the work, mention this as well. This will be beneficial in two ways. One, your manager will have a clear knowledge of your comprehension and will be able to provide extra resources to assist you. Two, you will feel relieved, which will lower your stress levels.<h2 id="heading-self-care">Self-Care</h2>Finally, you must take care of yourself. By practicing self-care, individuals can reduce stress levels and promote overall well-being, which can help prevent burnout. Some self-care practices that can help prevent burnout include:<ol><li>Prioritizing sleep: Getting enough sleep is crucial for overall health and well-being and can help individuals manage stress levels. Consistently getting 7-8 hours of sleep per night can help prevent burnout.</li><li>Engaging in physical activity: Regular exercise can help reduce stress levels, promote relaxation, and improve overall health. Engaging in activities such as walking, jogging, or yoga can help prevent burnout.</li><li>Practicing mindfulness: Mindfulness practices, such as meditation or deep breathing, can help individuals manage stress and improve emotional well-being. By staying present at the moment and reducing distractions, individuals can prevent burnout.</li><li>Eating a healthy diet: Eating a balanced diet that includes whole foods, fruits, and vegetables can help individuals maintain physical and mental health. A healthy diet can help individuals manage stress levels and prevent burnout.</li><li>Engaging in leisure activities: Engaging in activities that bring joy and relaxation, such as reading, painting, or spending time with friends, can help individuals manage stress and prevent burnout.</li></ol><h2 id="heading-seeking-professional-help">Seeking professional help</h2>If things are getting out of hand, it's always a good idea to seek expert assistance. Counseling or therapy can offer individuals assistance and guidance in dealing with stress and avoiding burnout. The faster you recuperate, the better you will be able to focus on your task, leading to an increase in total productivity.<h2 id="heading-final-thoughts">Final Thoughts</h2>I have been following these few advices that I received from my seniors and leads at Google, and to be honest the outcomes have been fantastic.<ul><li>I completed my Google internship securing a PPO.</li><li>I worked as a Technical Content Writer for Codedamn and authored few high-quality posts. You can find them right here.</li><li>I won Showwcase's Develevate event.</li><li>I joined a Singapore-based web3 startup called Cypherock as an SDE Intern.</li><li>I joined Pupilfirst as a Teaching Assistant.</li><li>At the same time, I've been taking care of myself by going to the gym on a regular basis and eating healthily.</li></ul>Yet I didn't experience any burnout along the process. I hope these few tips will assist you in increasing your work productivity and avoiding burnout.<h1 id="heading-tldr">TLDR;</h1>Maintaining balance, taking breaks, having a clear understanding of work responsibilities, practicing self-care, and seeking professional help if needed are all effective strategies to prevent and recover from burnout. Prioritizing our well-being and seeking help when necessary is essential to achieving long-term success and productivity. Remember, taking care of ourselves is not a luxury but a necessity, and it is essential to make self-care a priority in our daily lives.<blockquote>All the images in this article were generated by Dall-E.</blockquote> </article> <article> <h1>Important tips on bagging PPO during Internship and securing a Full-Time Role</h1> Arnab Sen — Sun, 29 Jan 2023 17:33:10 GMT If you are here after you have got an internship, then a big congratulations to you for getting an internship. I hope you are enjoying it and learning a lot. I am sure you are. But now you want to make sure that you get a PPO at the end of it. So, let's get started.I have been wanting to write this blog for a long time, I will try to include all the important points that I followed and have learned during my internship which helped me bag a PPO. I will break them down into 2 parts mainly, Technical and Non-Technical. Before that let me give you a little background.<h1 id="heading-background">Background</h1>I applied for Google Summer SWE Intern Opening on the <a target="_blank" href="https://careers.google.com/">Google's careers portal</a> with a referral. <a target="_blank" href="https://arnabsen.dev/resume.pdf">My Resume</a> got selected and I was called for 2 Rounds of Interviews. After that, I got the offer and interned at Google for 10 weeks remotely with the Google Ads team. And finally, I was able to crack the PPO and got a full-time offer from Google as an SWE. I am planning to join them in July 2022.From<div class="embed-wrapper"><div class="embed-loading"><div class="loadingRow"></div><div class="loadingRow"></div></div><a class="embed-card" href="https://twitter.com/ArnabSen1729/status/1528345010164613123">https://twitter.com/ArnabSen1729/status/1528345010164613123</a></div> To<div class="embed-wrapper"><div class="embed-loading"><div class="loadingRow"></div><div class="loadingRow"></div></div><a class="embed-card" href="https://twitter.com/ArnabSen1729/status/1551660735440179201">https://twitter.com/ArnabSen1729/status/1551660735440179201</a></div> <h1 id="heading-conversion-process">Conversion Process</h1>Before sharing the pieces of advice, I feel it's important to learn about the conversion steps. This will vary from company to company but overall the process remains the same.<ol><li>Every Intern is assigned a project and a mentor (at Google we call them host) and a co-host. The project is usually a part of the product that the team is working on. The mentor is a senior engineer who will be your guide throughout the internship. You will be working with your mentor and other engineers on the team.</li><li>There will be two evaluations during the internship. The first evaluation called Mid Evaluation happens after 5 weeks (since my internship period was 10 weeks) and the Final Evaluation happens in the last week of your internship. The Mid Evaluation is more of a check-in to see how you are doing and if you are on track. The Final Evaluation is more of a review of your work and the project. The host and co-host also get to give feedback on your performance and your project progress. This feedback plays a huge role in the final conversion.</li></ol><h1 id="heading-technical-advice">Technical Advice</h1><h2 id="heading-1-learn-the-basics">1. Learn the basics</h2>You must have strong problem-solving abilities to work on the project that will be assigned to you. There are some basic Computer Science concepts that one should be familiar with. I will list them down below.<h3 id="heading-a-version-control-systems">a. Version Control Systems</h3>No matter which company you are working at, you will be using a version control system. You must be familiar with the basics of version control systems. I would recommend you learn Git and GitHub. You can learn it from here.<div class="embed-wrapper"><div class="embed-loading"><div class="loadingRow"></div><div class="loadingRow"></div></div><a class="embed-card" href="https://youtu.be/RGOj5yH7evk">https://youtu.be/RGOj5yH7evk</a></div> At Google, we didn't use <code>git</code>, instead, we used Mercurial. But the fundamental concepts are the same. If you know how <code>git</code> works, learning a new version control system will just require you to learn the commands specific to that version control system.<h3 id="heading-b-basic-commandline">b. Basic Commandline</h3>There is a high chance that you will be working in a cloud environment and you must be comfortable with the terminal. You should be able to navigate through the file system, create directories, create files, delete files, etc. You should also be able to use basic commands like <code>cat</code>, <code>grep</code>, <code>sed</code>, <code>awk</code>, etc. You can learn about these commands from the man pages. For example, to learn about the <code>cat</code> command, you can run <code>man cat</code> in your terminal.<div class="embed-wrapper"><div class="embed-loading"><div class="loadingRow"></div><div class="loadingRow"></div></div><a class="embed-card" href="https://youtu.be/ZtqBQ68cfJc">https://youtu.be/ZtqBQ68cfJc</a></div> <h3 id="heading-c-data-structures-and-algorithms">c. Data Structures and Algorithms</h3>You should be familiar with the basic data structures and algorithms. You should be able to implement them in your language of choice and most importantly be aware of when to use which data structure and algorithm.<blockquote>Note: You don't need to be an expert in these topics. You just need to be familiar with them. But most of these companies have DSA questions in their interviews. So, you must be familiar with them.</blockquote>I love the MIT 6.006 Introduction to Algorithms lectures, they have helped me build the foundation of algorithms. You can find the playlist <a target="_blank" href="https://youtu.be/HtSuA80QTyo?list=PLUl4u3cNGP61Oq3tWYp6V_F-5jb5L2iHb">here</a>:<div class="embed-wrapper"><div class="embed-loading"><div class="loadingRow"></div><div class="loadingRow"></div></div><a class="embed-card" href="https://youtu.be/HtSuA80QTyo">https://youtu.be/HtSuA80QTyo</a></div> <h2 id="heading-2-knowing-the-syntax-of-the-language-you-will-be-using">2. Knowing the syntax of the language you will be using</h2>Sometimes you can ask the recruiter about the tech stack that you will be using. But most of the time, you won't be able to know the tech stack beforehand. But, you can still learn the basics of the language that is popularly used in the company. For example, I knew that Google uses C++, Java, and Golang a lot in their backend. I already knew C++ and Golang. So I learned the basics of Java before joining Google. This helped me save a lot of time during the internship picking up the language.<h2 id="heading-3-learn-design-patterns">3. Learn Design Patterns</h2>Design patterns are important for developers to understand because they provide proven solutions to common software design problems. By using design patterns, developers can improve the quality, reusability, and maintainability of their code.While you don't need to know every design pattern in existence, it's helpful to have a good understanding of the most common design patterns and when to use them. This knowledge can save a lot of time and effort by allowing you to reuse tried-and-true solutions to common problems, rather than having to reinvent the wheel every time you encounter a new challenge.<h2 id="heading-4-write-clean-code">4. Write clean code</h2>I can't stress this enough. Writing clean code is very important especially if you are interning at a big organization like Google. Every change you make goes through a rigorous review process. If your code is not clean, it will be rejected. So, you must write clean code. Here are some points that you should keep in mind while writing clean code.<ul><li>Keep it simple: When writing code, it's important to avoid unnecessary complexity. Simple, easy-to-understand code is usually easier to maintain and debug.</li><li>Use clear and descriptive names: Choosing clear and descriptive names for variables, functions, and other elements of the code can make it much easier to read and understand. Avoid using abbreviations or single-letter names, unless they are well-known and widely understood.</li><li>Use comments: Comments can be a helpful tool for explaining the purpose and functionality of different parts of the code. However, it's important to use comments sparingly and only when they are truly necessary, as too many comments can make the code harder to read.</li><li>Follow a consistent style: Consistency is key when it comes to writing clean code. Using a consistent style for indentation, spacing, and other elements of the code can make it easier to read and understand. It's also a good idea to follow established coding standards and guidelines, such as those provided by the programming language's community or the company you work for.</li></ul><h2 id="heading-5-learn-testing">5. Learn testing</h2>Testing is an important part of software development. It's important to test your code to make sure it works as expected and to catch any bugs that might have been introduced during development. Testing can also help you to ensure that your code is clean and easy to understand. There are different types of testing like unit testing, integration testing, etc. Knowing these testing types will help you a lot.Here is a free course on <a target="_blank" href="https://thoughtbot.com/upcase/fundamentals-of-tdd">Test Driven Development</a>.<h2 id="heading-6-know-how-to-write-a-design-doc">6. Know how to write a Design Doc</h2>At Google, we heavily focus on the Design Doc. Even before we write a piece of code for a particular implementation we have to make sure we have analyzed every possible edge case, and technical and non-technical requirements. Design documents also help with communication among team members and can serve as a reference for future maintenance and updates to the software.Certain key points need to be covered in a Design Doc like Architecture, Requirements, Design, User Interface, Testing .etc.<a target="_blank" href="https://www.industrialempathy.com/posts/design-docs-at-google/">Here</a> is a great post that talks in depth about Design Docs.This is a <a target="_blank" href="https://docs.google.com/document/d/1pgMutdDasJb6eN6yK6M95JM8gQ16IKacxxhPXgeL9WY/edit">sample template of a Design Document</a>.<h1 id="heading-non-technical-advice">Non-Technical Advice</h1>In addition to the technical advice, I would also like to share some non-technical advice that I followed during my internship.<h2 id="heading-1-be-proactive">1. Be proactive</h2>This was something my host pointed out in my mid-evaluation. I was being too dependent initially on my mentor, waiting for her to tell me the next steps. But later I took the advice. So I started planning my work, assigning tasks to myself, and taking the initiative to solve the problems. But obviously, I kept my mentor in the loop. This takes me to the next point.<h2 id="heading-2-keep-your-mentor-host-and-co-host-in-the-loop">2. Keep your mentor (host and co-host) in the loop</h2>Remember, your host and co-host will be evaluating you at the end of your internship. Also, they have their deadline and they will be busy with their work. So, you must keep them in the loop. You should be able to communicate with them and tell them what you are working on and what problems you are facing. This will help them to guide you better.We had a weekly meeting with our host and co-host but even then I would update them on my progress every alternate day so that they are aware of what I am working on. I would also disclose the problems I faced and the solutions I found.There is a really good article on <a target="_blank" href="https://jvns.ca/blog/brag-documents/?utm_source=pocket_mylist">Writing a brag document</a> that I would recommend you read.<h2 id="heading-3-be-open-to-feedback">3. Be open to feedback</h2>No matter how big a company you are interning at, you should remember that you are still very new to this field and there is a lot to improve. Feedback from your host/co-host or other team members can help you identify areas of their work that needs improvement. It might be difficult to accept the feedback at first but you should be open to it. You should be able to take the feedback positively and work on improving yourself.Not always your mentor will reach out to you to give you feedback. If required you might have to reach out to them and ask for feedback. You can ask them to give you feedback on your code, your communication skills, your problem-solving skills, etc. This also shows that you are proactive and you are willing to make improvements.A good place to get feedback from your host/co-host is through 1:1 meetings. Here is a good video on <a target="_blank" href="https://youtu.be/ADWkkJtZna4">The Art Of The 1:1 Meeting</a>.<div class="embed-wrapper"><div class="embed-loading"><div class="loadingRow"></div><div class="loadingRow"></div></div><a class="embed-card" href="https://youtu.be/ADWkkJtZna4">https://youtu.be/ADWkkJtZna4</a></div> <h2 id="heading-4-time-management">4. Time management</h2>Plan out the tasks that you will have to do, assign priority to them and then start working on them. This will help you to manage your time better. You should also be able to estimate the time that you will need to complete a task. Initially, you may end up overestimating or underestimating the time but as you gain more experience, you will be able to estimate the time better.<h2 id="heading-5-ask-questions">5. Ask questions</h2>An internship usually lasts 10-12 weeks, which is not a lot of time. So if you are stuck at something, you can't afford to lose time. You should be able to ask questions to your mentor and other team members. But before that make sure you have done a fair amount of research on your end as well.You have to ask the right questions to get quick answers. If you came across an error message, explain properly how to reproduce the error, and what the error message is. Share the code snippet that might be causing the error, or share the files you are working on. Further, the approaches you have already taken to solve the problem. This will help the person you are asking the question to understand the problem better. Also, make sure you are asking the question in the right channel. For example, if you are stuck on a coding problem, you should ask your mentor or other team members. But if you are stuck in a process, you should ask your host/co-host.Here is a great article on <a target="_blank" href="https://jvns.ca/blog/good-questions/?utm_source=pocket_mylist">How to ask questions the smart way</a>.<h1 id="heading-podcast-w-aniket">Podcast w/ Aniket</h1>I was recently on a podcast where we went through some of these points in more depth. Watch the video here:<div class="embed-wrapper"><div class="embed-loading"><div class="loadingRow"></div><div class="loadingRow"></div></div><a class="embed-card" href="https://youtu.be/0EB-np28BZU">https://youtu.be/0EB-np28BZU</a></div> <h1 id="heading-conclusion">Conclusion</h1>These are some of the things that I followed during my internship. I hope this article will help you to prepare for your internship. One thing you should also keep in mind is to enjoy the internship. It is a great opportunity to learn and grow, network with people and build your resume. Don't burnout yourself before or during the internship. Take breaks, go out with your friends, play games, watch movies, etc. This will help you to stay motivated and perform better.If you have any questions, feel free to ask in the comments. I will try my best to answer them. Also, if you have any other advice that you would like to share, feel free to do so in the comments. </article> <article> <h1>Step by Step learning Redis</h1> Arnab Sen — Sat, 28 Jan 2023 12:22:10 GMT In this blog, we will take a look at what Redis is, how it works, and how to use it.<h2 id="heading-introduction">Introduction</h2>Redis stands for REmote DIctionary Server.In simple words, Redis is an in-memory data structure store and is mainly popular for its very low-level latency. The fact that it is in-memory means that the time taken to access the data is very low compared to in-disk databases. This can be useful for applications that require fast data access and manipulation, such as real-time bidding systems or gaming servers.Redis was created by Salvatore Sanfilippo, also known as <code>"antirez"</code>, in order to provide a fast, in-memory database that could be used for a variety of purposes. It has since become widely popular for its performance, reliability, and flexibility.It is because of its low latency and high performance that Redis is used by a lot of major companies, such as:<ul><li>Twitter</li><li>Snapchat</li><li>GitHub</li><li>Pinterest</li><li>Craigslist</li></ul><h2 id="heading-how-does-redis-work">How does Redis work?</h2>Redis is a key-value store, which means that it stores data in the form of key-value pairs. The key is used to access the value, and the value can be of any type, such as a <code>string</code>, a <code>list</code>, a <code>set</code>, a <code>sorted set</code>, a <code>hash</code>. These data types are supported natively by Redis and provide a rich set of commands for manipulating and querying the data.In addition to these data types, Redis can be extended with Redis Modules. Some of these modules are provided by Redis Labs, and some are provided by the community. Some of the modules provided by Redis Labs are:<ul><li>RediSearch: This module provides full-text search capabilities for Redis, allowing users to index, search, and query text data stored in Redis similar to Elasticsearch.</li><li>RedisGraph: This module provides graph database capabilities for Redis, allowing users to store, query, and manipulate graph data structures similar to Neo4j.</li><li>RedisTimeSeries: This module provides time series data management capabilities for Redis, allowing users to store, query, and visualize time series data similar to InfluxDB.</li><li>RedisJSON: This module provides native support for JSON data in Redis, allowing users to store, manipulate, and query JSON data using Redis commands similar to MongoDB.</li><li>RedisBloom: This module provides probabilistic data structures for Redis, allowing users to store and query data using bloom filters, count-min sketches, and other data structures that can approximate data sets.</li></ul>These are just a few examples of the many Redis Modules that are available. There are many more modules that provide a wide range of additional functionality for Redis. For a complete list of available modules, you can visit the Redis Modules registry at the following URL: <a target="_blank" href="http://redis.io/resources/modules">redis.io/resources/modules</a>.This makes Redis a multi-model database.<h2 id="heading-installing-redis">Installing Redis</h2>Redis can be installed on Linux, Windows, and macOS using the following steps:<h3 id="heading-linux">Linux</h3>Begin by updating your local package index with the latest package information:<pre><code class="lang-bash">curl -fsSL https://packages.redis.io/gpg | sudo gpg --dearmor -o /usr/share/keyrings/redis-archive-keyring.gpgecho "deb [signed-by=/usr/share/keyrings/redis-archive-keyring.gpg] https://packages.redis.io/deb $(lsb_release -cs) main" | sudo tee /etc/apt/sources.list.d/redis.listsudo apt-get updatesudo apt-get install redis</code></pre><h3 id="heading-windows">Windows</h3>Redis is not officially supported on Windows. However, you can install Redis on Windows for development by following the instructions mentioned in the <a target="_blank" href="https://redis.io/docs/getting-started/installation/install-redis-on-windows/">website</a>.<h3 id="heading-macos">MacOS</h3>Install the Redis server and command-line tools using Homebrew:<pre><code class="lang-bash">brew install redis</code></pre>Once the installation is complete, you can start the Redis server using the following command:<pre><code class="lang-bash">brew services start redis</code></pre>To enable Redis to start automatically when the system boots, you can use the following command:<pre><code class="lang-bash">brew services enable redis</code></pre>Once, you have Redis installed, you can start it by running the following command:<pre><code class="lang-bash">redis-server</code></pre><h2 id="heading-using-redis">Using Redis</h2>Redis provides a command-line interface (CLI) that can be used to interact with the Redis server. You can start the Redis CLI using the following command:<pre><code class="lang-bash">redis-cli</code></pre><h2 id="heading-basic-redis-commands">Basic Redis Commands</h2>You can set a key-value pair in Redis using the <code>SET</code> command. For example, to set the key <code>name</code> to the value <code>John</code>, you can use the following command:<pre><code class="lang-bash">SET name John</code></pre>It will return <code>OK</code> if the operation is successful.Now, to get the value of the key <code>name</code>, you can use the <code>GET</code> command:<pre><code class="lang-bash">GET name</code></pre>It will return <code>John</code>.<pre><code class="lang-sql">127.0.0.1:6379> SET name JohnOK127.0.0.1:6379> GET name"John"</code></pre>You can also delete a key using the <code>DEL</code> command:<pre><code class="lang-bash">DEL name</code></pre>It will return <code>1</code> if the operation is successful. Now, if you try to get the value of the key <code>name</code>, it will return <code>(nil)</code>.<pre><code class="lang-sql">127.0.0.1:6379> SET name JohnOK127.0.0.1:6379> GET name"John"127.0.0.1:6379> DEL name(integer) 1127.0.0.1:6379> GET name(nil)127.0.0.1:6379></code></pre>To get a list of all the keys in the database, you can use the <code>KEYS</code> command:<pre><code class="lang-bash">KEYS *</code></pre>Here is what it looks like:<pre><code class="lang-sql">127.0.0.1:6379> SET fname JohnOK127.0.0.1:6379> SET lname DoeOK127.0.0.1:6379> SET age 30OK127.0.0.1:6379> KEYS *1) "lname"2) "age"3) "fname"</code></pre>To get the number of keys in the database, you can use the <code>DBSIZE</code> command:<pre><code class="lang-bash">DBSIZE</code></pre>It will return <code>3</code> in this case.<h2 id="heading-setting-expiration-time">Setting Expiration Time</h2>The expiration time of a key refers to the amount of time that the key will remain in the database before it is automatically deleted. We can set the expiration time of a key using the <code>EXPIRE</code> command. For example, to set the expiration time of the key <code>name</code> to 10 seconds, you can use the following command:<pre><code class="lang-bash">EXPIRE name 10</code></pre>It will return <code>1</code> if the operation is successful. Now, if you try to get the value of the key <code>name</code>, it will return <code>(nil)</code> after 10 seconds.To find out the remaining time before the key expires, you can use the <code>TTL</code> command:<pre><code class="lang-bash">TTL name</code></pre>It will return <code>-2</code> if the key does not exist, <code>-1</code> if the key does not have an expiration time or the number of seconds remaining before the key expires.<pre><code class="lang-sql">127.0.0.1:6379> SET id 1001OK127.0.0.1:6379> GET id"1001"127.0.0.1:6379> EXPIRE id 10(integer) 1127.0.0.1:6379> TTL id(integer) 7127.0.0.1:6379> TTL id(integer) 6127.0.0.1:6379> TTL id(integer) 4127.0.0.1:6379> TTL id(integer) 3127.0.0.1:6379> TTL id(integer) 1127.0.0.1:6379> TTL id(integer) -2127.0.0.1:6379> GET id(nil)</code></pre><h2 id="heading-handling-lists">Handling Lists</h2>Redis provides a data structure called a list that can be used to store a collection of strings. You can create a list using the <code>RPUSH</code> command. For example, to create a list named <code>colors</code> and add the values <code>red</code>, <code>green</code>, and <code>blue</code> to it, you can use the following command:<pre><code class="lang-bash">RPUSH colors red green blue</code></pre>Here <code>RPUSH</code> refers to the right push operation. It will return <code>3</code> (the number of elements pushed) if the operation is successful.<blockquote>Note: Like <code>RPUSH</code> there are other commands like <code>LPUSH</code> (left push), <code>LPOP</code> (left pop), and <code>RPOP</code> (right pop) that can be used to push and pop elements from the list.</blockquote>To get all the elements in the list, you cannot use the <code>GET</code>. This is because you are dealing with lists, so you have to use the <code>LRANGE</code> command:<pre><code class="lang-bash">LRANGE colors 0 -1</code></pre>Here 0 refers to the starting index and -1 refers to the ending index. It will return all the elements in the list.<pre><code class="lang-sql">127.0.0.1:6379> RPUSH colors red green blue(integer) 3127.0.0.1:6379> GET colors(error) WRONGTYPE Operation against a key holding the wrong kind of value127.0.0.1:6379> LRANGE colors 0 -11) "red"2) "green"3) "blue"127.0.0.1:6379></code></pre><h2 id="heading-handling-sets">Handling Sets</h2>Sets are similar to lists, but they do not allow duplicate elements. You can create a set using the <code>SADD</code> command. For example, to create a set named employee_ids and add the values 1001, 1002, and 1003 to it, you can use the following command:<pre><code class="lang-bash">SADD employee_ids 1001 1002 1003</code></pre>To get all the elements in the set, you can use the <code>SMEMBERS</code> command:<pre><code class="lang-bash">SMEMBERS employee_ids</code></pre>If you try to add an element that already exists in the set, it will be ignored. For example, it will be ignored if you try to add the value 1001 to the set <code>employee_ids</code>.<pre><code class="lang-sql">127.0.0.1:6379> SADD employee_ids 1001 1002 1003(integer) 3127.0.0.1:6379> SMEMBERS employee_ids1) "1001"2) "1002"3) "1003"127.0.0.1:6379> SADD employee_ids 1001(integer) 0 127.0.0.1:6379> SMEMBERS employee_ids1) "1001"2) "1002"3) "1003"</code></pre>In the case of sets, to check if an element exists in the set, you can use the <code>SISMEMBER</code> command:<pre><code class="lang-bash">SISMEMBER employee_ids 1001</code></pre><pre><code class="lang-sql">127.0.0.1:6379> SISMEMBER employee_ids 1002(integer) 1127.0.0.1:6379> SISMEMBER employee_ids 1005(integer) 0</code></pre>To remove an element from the set, you can use the <code>SREM</code> command:<pre><code class="lang-bash">SREM employee_ids 1001</code></pre><pre><code class="lang-sql">127.0.0.1:6379> SMEMBERS employee_ids1) "1001"2) "1002"3) "1003"127.0.0.1:6379> SREM employee_ids 1001(integer) 1127.0.0.1:6379> SMEMBERS employee_ids1) "1002"2) "1003"</code></pre><h2 id="heading-handling-hashes">Handling Hashes</h2>Hashes are also key-value pairs. So, using hashes in Redis is like using key-value pairs inside key-value pairs.To create a hash, you can use the <code>HSET</code> command. For example, to create a hash named <code>employee</code> and add the values <code>id</code>, <code>fname</code>, and <code>lname</code> to it, you can use the following command:<pre><code class="lang-bash">HSET employee id 1001 fname John lname Doe</code></pre>And to get the value of a field in the hash, you can use the <code>HGET</code> command:<pre><code class="lang-bash">HGET employee id</code></pre><pre><code class="lang-sql">127.0.0.1:6379> HSET employee id 1001 fname John lname Doe(integer) 3127.0.0.1:6379> HGET employee id"1001"127.0.0.1:6379> HGET employee fname"John"127.0.0.1:6379> HGET employee lname"Doe"</code></pre>To get all the fields in the hash, you can use the <code>HKEYS</code> command:<pre><code class="lang-bash">HKEYS employee</code></pre>To get all the values in the hash, you can use the <code>HVALS</code> command:<pre><code class="lang-bash">HVALS employee</code></pre>To get everything in the hash, you can use the <code>HGETALL</code> command:<pre><code class="lang-bash">HGETALL employee</code></pre><pre><code class="lang-sql">127.0.0.1:6379> HGETALL employee1) "id"2) "1001"3) "fname"4) "John"5) "lname"6) "Doe"127.0.0.1:6379> HKEYS employee1) "id"2) "fname"3) "lname"127.0.0.1:6379> HVALS employee1) "1001"2) "John"3) "Doe"</code></pre>To delete a field from the hash, you can use the <code>HDEL</code> command:<pre><code class="lang-plaintext">HDEL employee id</code></pre>To check if a field exists in the hash, you can use the <code>HEXISTS</code> command:<pre><code class="lang-bash">HEXISTS employee id</code></pre><pre><code class="lang-sql">127.0.0.1:6379> HEXISTS employee id(integer) 1127.0.0.1:6379> HDEL employee id(integer) 1127.0.0.1:6379> HEXISTS employee id(integer) 0</code></pre>You can see initially the field id exists in the hash employee so the output was 1. After deleting it, it no longer exists and hence the value is 0.Another very useful command is <code>flushall</code>. It will delete all the keys in the database. So, if you want to delete all the keys in the database, you can use the following command:<pre><code class="lang-sql">127.0.0.1:6379> KEYS *1) "name"2) "age"3) "employee"127.0.0.1:6379> flushallOK127.0.0.1:6379> KEYS *(empty array)</code></pre><h2 id="heading-conclusion">Conclusion</h2><ul><li>In conclusion, Redis is a popular and robust data management tool that is widely used by organizations for a variety of purposes.</li><li>Its key features and benefits include high performance, flexibility, and support for a wide range of data structures.</li><li>Once you have installed it, you can use a variety of basic Redis commands, such as <code>SET</code>, <code>GET</code>, <code>DEL</code>, and <code>KEYS</code>, to manage and manipulate data in Redis. By understanding and using these commands effectively, you can unlock the full potential of Redis to support your specific use cases.</li></ul>I would encourage you to experiment with Redis and explore its capabilities to gain a deeper understanding of how it can help your organization.If you like this article, please share it with your friends and colleagues who are interested in Redis. If you have any questions or comments, please feel free to leave them below. I will be happy to answer them.My Website: <a target="_blank" href="http://arnabsen.dev">arnabsen.dev</a>My Twitter: <a target="_blank" href="https://twitter.com/ArnabSen1729">@ArnabSen1729</a> </article> <article> <h1>Hello World! in Rescript-React</h1> Arnab Sen — Sun, 21 Aug 2022 04:48:19 GMT <h1 id="heading-hello-world-in-rescript-react">Hello World! in Rescript React</h1>What is Rescript? 🤔It is a language that combines the best parts of JS and OCaml. It has a compiler that compiles to JS and leverages OCamls static and sound type system. The compiler was originally named BuckleScript but in 2020, it was rebranded to ReScript because of its Reason-like syntax.Where can I learn more about it?Rescript docs are well written, organized, and maintained actively. Take a look at their documentation <a target="_blank" href="https://rescript-lang.org/docs/manual/latest/introduction">here</a>.So, let's start with "Hello World!"If you have a basic knowledge of Rescript syntax and React, then we are good to go.<h2 id="heading-setting-up-the-project">Setting up the project</h2><ul><li>Start with the command <code>npm init -y</code>. If you simply do <code>npm init</code> it will ask you to fill in a couple of data, if you want them to be default values add the <code>-y</code> tag.</li></ul><pre><code class="lang-bash">$ npm init -y</code></pre><ul><li>Now, let's install the ReScript compiler locally. The reason we are installing it with <code>--save-dev</code> is because we need it as a developer dependency. The Rescript compiler will compile the <code>.res</code> files to <code>.js</code> files. This is actually called source-to-source compilation.</li></ul><pre><code class="lang-bash">$ npm install --save-dev bs-platform</code></pre><ul><li>Before, moving on to the configuration part, let's install the <code>rescript-react</code> plugin.</li></ul><pre><code class="lang-bash">$ npm install @rescript/react --save</code></pre><ul><li>Now, we have to set up a <code>bsconfig.json</code> file. Create this file and then copy and paste the following into it:</li></ul>bsconfig.json<pre><code class="lang-json">{ "$schema": "https://raw.githubusercontent.com/rescript-lang/rescript-compiler/master/docs/docson/build-schema.json", "name": "project-name", "sources": [ { "dir": "src", "subdirs": true } ], "package-specs": [ { "module": "es6", "in-source": true } ], "suffix": ".bs.js", "namespace": true, "bs-dependencies": ["@rescript/react"], "ppx-flags": [], "reason": {"react-jsx": 3}, "warnings": { "error": "+101" }, "refmt": 3}</code></pre>Let's understand the config for a moment, going line by line:<ol><li>$schema: Editors like VSCode have a feature of schema autocompletion, and to get that we need to specify a schema. Here, we are specifying the schema for rescript.</li><li>name: It is the name of the library or the main project name.</li></ol><blockquote>NOTE: The name in <code>bsconfig.json</code> should be the same as the name in <code>package.json</code>, to avoid confusing corner-cases. However, this means that you can't use camelCased names such as MyProject, since <code>package.json</code> and <code>npm</code> forbid you to do so (some file systems are case-insensitive).</blockquote><ol><li>sources: We have to specify where the source files will reside. Usually, it is the <code>/src</code> directory.</li><li>suffix: The file extensions after compilation by the Rescript compiler, can be <code>.js</code> or <code>.bs.js</code>. ( Although, the latter is always preferred )</li><li>bs-dependencies: The list of Rescript dependencies, and since we are using the <code>rescript-react</code> we have to specify that here. There are also <code>bs-dev-dependencies</code>. These are very similar to how we mention node dependencies in <code>package.json</code>.</li><li>reason: Since we will be using React-JSX, we have to specify <code>{"react-jsx": 3}</code>.</li></ol><blockquote>For normal Rescript projects, the <code>bs-dependencies</code> will have <code>[]</code> and <code>reason</code> will be <code>{}</code>. You can read more about the config <a target="_blank" href="https://rescript-lang.org/docs/manual/latest/build-configuration-schema">here</a></blockquote><ul><li>To compile the source files we have to add two scripts in our <code>package.json</code>.</li></ul>package.json<pre><code class="lang-json">"scripts": { "clean": "bsb -clean-world", "start": "bsb -make-world -w",}</code></pre><ol><li>start: will compile the res files and will run in watch mode i.e will continuously look for changes and compile it. It is always advised to run the start script and then code the Res files, cause the intelligent rescript compilation helps a lot and makes the coding easier for beginners to the language.</li><li>clear: will clean/remove the previously compiled <code>*.bs.js</code> files.</li></ol><ul><li>Now, install <code>react</code> and <code>react-dom</code> packages.</li></ul><pre><code class="lang-bash">$ npm install react react-dom --save</code></pre><ul><li>Finally, we will use <code>snowpack</code> to build and bundle the project during development. In <code>snowpack</code> each file needs to be built only once and then is cached forever. When a file changes, Snowpack rebuilds that single file. Theres no time wasted re-bundling every change, just instant updates in the browser (made even faster via Hot-Module Replacement (HMR)). It is really fast. </li></ul>We will install snowpack as a dev dependency:<pre><code class="lang-bash">$ npm install --save-dev snowpack</code></pre><ul><li>Make a <code>snowpack.config.js</code> file and copy-paste this. Since we are not using any kind of plugins there is nothing to specify over here.</li></ul>snowpack.config.j<pre><code class="lang-js">// Example: snowpack.config.js// The added "@type" comment will enable TypeScript type information via VSCode, etc./** @type {import("snowpack").SnowpackUserConfig } */module.exports = { plugins: [ /* ... */ ],};</code></pre><ul><li>Now, we have to add a couple of npm scripts for the snowpack to start the server and build the project.</li></ul>package.json<pre><code class="lang-json">"scripts": { "dev": "snowpack dev", "build": "snowpack build"}</code></pre>At, this point we are done setting up dependencies, and your final <code>package.json</code> should look like this:package.json<pre><code class="lang-json">{ "name": "react-res-test", "version": "1.0.0", "description": "", "main": "index.js", "scripts": { "clean": "bsb -clean-world", "start": "bsb -make-world -w", "dev": "snowpack dev", "build": "snowpack build" }, "keywords": [], "author": "", "license": "ISC", "devDependencies": { "bs-platform": "^9.0.1", "snowpack": "^3.0.13" }, "dependencies": { "@rescript/react": "^0.10.1", "react": "^17.0.1", "react-dom": "^17.0.1" }}</code></pre><h2 id="heading-start-writing-your-hello-world-code">Start writing your Hello World Code!!</h2><ul><li>First, start off with a simple <code>index.html</code> in your root directory. Paste this basic template. One important thing is that there should be a <code><div></code> with <code>id="root"</code> inside which the React components will be rendered. Also, we have to include the compiled <code>index.bs.js</code> file. You can simply copy-paste this HTML. If you will use <code>index.css</code> then create a file and add its relative path as well.</li></ul>index.html<pre><code class="lang-html"><!DOCTYPE html><html lang="en"> <head> <meta charset="utf-8" /> <title>ReScript React</title> <link rel="stylesheet" href="index.css" /> </head> <body> <div id="root"></div> <script type="module" src="src/index.bs.js"></script> </body></html></code></pre><ul><li>Now create a folder <code>src</code> and create a new file <code>src/index.res</code>. Inside the <code>index.res</code> file just copy-paste this for the time being. We will dive into the code later on. Currently, let's test if all the setup we did was a success.</li></ul><pre><code class="lang-plaintext">switch ReactDOM.querySelector("#root") {| Some(root) => ReactDOM.render(<div> {React.string("Hello World!")} </div>, root)| None => Js.log("Error: could not find react element")}</code></pre><blockquote>NOTE: Here we are querying for <code>#root</code> in the DOM(index.html) i.e looking for an element with <code>id="root"</code>. If in the <code>index.html</code> you have specified some other id value, let's say <code>xyz</code> then over here you have to replace <code>#root</code> with <code>#xyz</code>.</blockquote><ul><li>Now, in the terminal run the command to compile the <code>index.res</code> file.</li></ul><pre><code class="lang-bash">$ npm run start</code></pre>Once you run this, a new file <code>index.bs.js</code> will be created. ( This is why we included the <code>index.bs.js</code> file in <code><script></code> tag in <code>index.html</code> ).To build the react app run this in another terminal.<pre><code class="lang-bash">$ npm run dev</code></pre>This will start a snowpack server at <code>localhost:8080</code> with hot-reload. Visit the URL in your browser and if you see <code>Hello World!</code> Congratulations!! You are now ready to create your rescript-react project.<blockquote>NOTE: Don't keep your <code>index.css</code> file empty if you have linked it to the <code>index.html</code>.</blockquote>The final folder structure will look like this:<pre><code class="lang-plaintext"> bsconfig.json index.css index.html package.json package-lock.json snowpack.config.js src index.bs.js index.res</code></pre>Was it a long process? Yeah, kind of. But every time you won't have to go through it all. Here I have already created a repo with all these steps. Simply clone it and get started:<a target="_blank" href="https://github.com/arnabsen1729/rescript-react-starter">arnabsen1729/rescript-react-starter</a><hr />Hi 👋, hope you liked my blog. I am still a newbie in this technical content writing field, so would really appreciate the honest feedback. Feel free to connect with me through any of my social handles. You can find them here: <a target="_blank" href="arnabsen.bio.link">arnabsen.bio.link</a> </article> <article> <h1>How to debug buggy code? Exploring the debugger GDB</h1> Arnab Sen — Mon, 31 Jan 2022 12:52:57 GMT <h1 id="heading-exploring-gdb">Exploring GDB</h1><h2 id="heading-introduction">Introduction</h2>No matter how big or small the codebase is, errors and bugs are inevitable. So as a developer we should know how to fix them. But, even before fixing an error, we have a more daunting task. Can you guess it? Yes, finding where the error is in the first place.Debugging is the process of detecting and removing existing and potential errors (also called 'bugs') in a software code.In our day-to-day programming scenarios, we often use the print statements (like <code>console.log()</code>) to debug. But imagine this scenario. You are working on a large codebase and compiling the entire code takes like 3-4 mins. Now if you are stuck somewhere and you add a print statement you have to recompile the code. And if you keep doing that repeatedly, you will have to recompile the code every single time you make a change. This is too inefficient and a huge waste of time.This is where Debuggers come to the rescue. In this article, we will look at GDB.<h2 id="heading-what-is-gdb">What is GDB?</h2>GDB (GNU Debugger) is a debugger for C and C++.<blockquote>GDB allows you to do things like run the program up to a certain point then stop and print out the values of certain variables at that point, or step through the program one line at a time and print out the values of each variable after executing each line.</blockquote>GDB uses a command-line interface which might be intimidating at first but once you get used to the commands it becomes very easy to use. We will explore some of the basic commands in this article.Let's start with installing <code>gdb</code> first.<h2 id="heading-installing-gdb">Installing GDB</h2>If you are using Linux, you probably already have gdb, but if you are using Windows, you will need to install it.Check if you already have gdb installed by running the following command in your terminal:<pre><code class="lang-bash">gdb --version # if you are in a Unix based systemc:\mingw\bin\gdb.exe --version # if you are in windows</code></pre><h3 id="heading-linux">Linux</h3>You can install gdb on Debian-based Linux distro (e.g. Ubuntu, Mint, etc) by the following command.<pre><code class="lang-bash">sudo apt-get updatesudo apt-get install gdb</code></pre><h3 id="heading-windows">Windows</h3>If you have MinGW installed then you already have <code>gdb</code>. Otherwise, download the latest MinGW installer from <a target="_blank" href="https://sourceforge.net/projects/mingw/files/">here</a>.<ul><li>Install MinGW from the installer.</li><li>Run the "MinGW Installation Manager" (which might be located in <code>C:\MinGW\libexec\mingw-get\guimain.exe</code> )</li><li>Make sure that the <code>mingw32-gdb</code> bin package is installed.</li></ul>After installation is complete, check the version once again with the commands above. Let's get started with debugging a simple program now.<blockquote>NOTE: GDB is good but it isn't very fancy, so instead you can use <a target="_blank" href="https://gef.readthedocs.io/en/master/">GDB-GEF</a>. It's very easy to install. I will be using GEF throughout this article and you should use it too.</blockquote><h2 id="heading-starting-gdb">Starting GDB</h2>To debug we need to have some code. Let's write a simple program to calculate factorial in C.<pre><code class="lang-c++">#include <stdio.h>const int MAX_FACTORIAL = 20;int factorial(int n) { int res = 1; for (int i = 1; i <= n; i++) { res = res * i; } return res;}int main() { int val; printf("Enter a number: "); scanf("%d", &val); printf("Factorial of %d is %d\n", val, factorial(val)); return 0;}</code></pre>Let's compile the code:<pre><code class="lang-bash">gcc main.c -o factorial</code></pre>Now, let's run <code>gdb</code> on the binary file <code>factorial</code>. We do that by the <code>gdb program</code> command.<pre><code class="lang-bash">gdb factorial</code></pre>Do you see some errors like this?<pre><code class="lang-gdb">Reading symbols from factorial...(No debugging symbols found in factorial)</code></pre><blockquote>To quit gdb press <code>Ctrl+C</code>.</blockquote>This is because to prepare our program for debugging with <code>gdb</code>, we must compile it with the <code>-g</code> flag. So, let's recompile our program this time with the <code>-g</code> flag.<pre><code class="lang-bash">gcc -g main.c -o factorialgdb factorial</code></pre>Now, we don't see the error <code>(No debugging symbols found in factorial)</code> anymore. So now we can use <code>gdb</code> to debug our code.<blockquote>What gdb did under the hood is that it automatically loaded the symbol table. We will look into Symbol Table later.</blockquote><h2 id="heading-breakpoint">Breakpoint</h2>A <code>breakpoint</code> is a spot in your program where you would like to temporarily stop execution in order to check the values of variables, or to try to find out where the program is crashing, etc. To set a breakpoint you use the <code>break</code> command.<blockquote>Remember: Almost all the commonly used gdb commands have a shorter version and we should use them heavily to improve speed. </blockquote><pre><code class="lang-h">break main# OR a shorter versionb main</code></pre>Output:<pre><code class="lang-h">gef b mainBreakpoint 1 at 0x117f: file main.c, line 15.</code></pre>This means we have set a breakpoint at the <code>main</code> function so that as soon as the instruction pointer reaches the main function it will stop the execution and wait for our commands.To see the list of all the breakpoints use the command <code>info break</code>. As the command name suggests it will give us information about all the breakpoints we have set.<pre><code class="lang-h">gef info breakNum Type Disp Enb Address What1 breakpoint keep y 0x000000000000117f in main at main.c:15</code></pre>There is an index associated with each breakpoint. These index values will be used if you want to delete a breakpoint or disable it. First, let's run the program with the breakpoint set. To run, use the <code>run</code> command (or just <code>r</code> as a shorter version).<pre><code class="lang-h">gef run// ORgef r</code></pre>You will see the program execution stopped. Let's just resume the execution with the command <code>continue</code> (or just <code>c</code>).<pre><code class="lang-h">gef cContinuing.Enter a number: 5Factorial of 5 is 120[Inferior 1 (process 128334) exited normally]</code></pre>You will notice it will prompt for the number and then display the factorial of that number and exit normally. Why? Because there were no other breakpoints in our code and execution hence didn't stop anywhere else.Sometimes, we don't need to stop at a breakpoint so we can disable it. To disable a breakpoint use the <code>disable</code> command.<pre><code class="lang-h">gef info breakNum Type Disp Enb Address What1 breakpoint keep y 0x000000000000117f in main at main.c:15gef disable 1gef info breakNum Type Disp Enb Address What1 breakpoint keep n 0x000000000000117f in main at main.c:15</code></pre>Once, disabled you can notice the value under <code>Enb</code> is not <code>y</code> anymore. It's <code>n</code> representing it's not enabled anymore. If you run the program now, you will notice that it didn't stop at any place. This is because the breakpoint is disabled.<pre><code class="lang-h">gef rStarting program: /home/arnab/Desktop/SWE-Lab/ass2/prog/factorialEnter a number: 5Factorial of 5 is 120[Inferior 1 (process 411121) exited normally]</code></pre>Sometimes, we add a breakpoint to a function inside the loop. And we don't want to stop for every iteration of the loop. In that scenario, we can ignore the breakpoint for a fixed number of times using the <code>ignore</code> command. Let's try that. We have a loop in our <code>factorial()</code> function. Let's add a breakpoint to the loop.<blockquote>TIP: You can see the code of a function directly from the gdb terminal using the <code>list</code> command. So do <code>list factorial</code> and it will display the source code of <code>factorial()</code>.</blockquote>We want to set the breakpoint at line 8 of this <code>main.c</code> file. So, we will use the command <code>b main.c:8</code>.<pre><code class="lang-bash">gef info break # to list our breakpointsNum Type Disp Enb Address What1 breakpoint keep n 0x000000000000117f in main at main.c:15gef list factorial # to see the source code1 #include <stdio.h>23 const int MAX_FACTORIAL = 20;45 int factorial(int n) {6 int res = 1;7 for (int i = 1; i <= n; i++) {8 res = res * I; # <-- breakpoint here9 }10 return res;11 }gef b main.c:8 # add breakpoint to line 8 of main.cBreakpoint 2 at 0x115c: file main.c, line 8.gef info break # to check if the breakpoint was addedNum Type Disp Enb Address What1 breakpoint keep n 0x000000000000117f in main at main.c:152 breakpoint keep y 0x000000000000115c in factorial at main.c:8</code></pre>Now, if you run the program with the <code>r</code> command. It will prompt you for a number (enter something like 5). You will then see it hits the breakpoint at the <code>factorial()</code> function (Remember we disabled the breakpoint at the <code>main()</code> function). Type the command <code>continue</code> or <code>c</code> and it will again hit the same point. Cause not we are now inside a loop. Keep doing this 5 times. Finally, when the loop ends you will notice the factorial result is displayed.In this scenario, if we want to ignore the breakpoint for the first 4 times maybe. We will use the <code>ignore <index of breakpoint> 4</code> command.<pre><code class="lang-h">gef info breakNum Type Disp Enb Address What1 breakpoint keep n 0x000055555555517f in main at main.c:152 breakpoint keep y 0x000055555555515c in factorial at main.c:8 breakpoint already hit 5 timesgef ignore 2 4Will ignore next 4 crossings of breakpoint 2.gef info breakNum Type Disp Enb Address What1 breakpoint keep n 0x000055555555517f in main at main.c:152 breakpoint keep y 0x000055555555515c in factorial at main.c:8 breakpoint already hit 5 times ignore next 4 hits</code></pre>Let's run the program. This time when it prompts for a number, we will enter 5. After that, it will hit the breakpoint at the <code>factorial()</code> function. Press <code>c</code> to continue, and the program ends. This is because gdb ignored the breakpoint for the first 4 times and only hit the 5-th time we hit the breakpoint.Now, let's delete that breakpoint using the command <code>delete breakpoints <index></code>.<pre><code class="lang-h">gef info breakNum Type Disp Enb Address What1 breakpoint keep n 0x000055555555517f in main at main.c:152 breakpoint keep y 0x000055555555515c in factorial at main.c:8 breakpoint already hit 5 timesgef delete 2gef info breakNum Type Disp Enb Address What1 breakpoint keep n 0x000055555555517f in main at main.c:15gef delete 1gef info breakNo breakpoints or watchpoints.</code></pre><h2 id="heading-symbol-table">Symbol Table</h2>Have you ever wondered how a compiler remembers which data structure is of which type and so on?Well, the answer is Symbol Table. It is a data structure used by a compiler to keep track of semantics of variable names like:<ul><li>Data type of the variable.</li><li>When is used: scope (The effective context where a name is valid).</li><li>Where it is stored: storage address.</li></ul>In gdb, when you run <code>gdb factorial</code> it will load the symbol table. Let's quit the gdb now by pressing <code>Ctrl+C</code> or typing <code>q</code> and Enter. This time just run <code>gdb</code>.To load a program and its symbol table use the command <code>file <program></code>.<pre><code class="lang-h">gef file ./prog/factorialReading symbols from ./prog/factorial...</code></pre>Now that it has read the symbols, we can interact with them. Run the command <code>info address <symbol name></code> to see the address of that symbol.<pre><code class="lang-h">gef info address mainSymbol "main" is a function at address 0x1177.gef info address factorialSymbol "factorial" is a function at address 0x1145.gef info address MAX_FACTORIALSymbol "MAX_FACTORIAL" is static storage at address 0x2004.</code></pre>Now since you know the address you can reverse lookup the symbol name with the command <code>info symbol <address></code>.<pre><code class="lang-h">gef info symbol 0x1177main in section .textgef info symbol 0x1145factorial in section .text</code></pre>The best part of this <code>info symbol</code> is that if you don't specify the exact address, show the offset from the beginning of the symbol. So in my case, the <code>main</code> symbol is at address <code>0x1177</code>. So let's try to see what the symbol name of the address <code>0x1180</code>.<pre><code class="lang-h">gef info symbol 0x1180main + 9 in section .text</code></pre>Now, remember we had a variable called <code>val</code> in main. Let's see the address of that.<pre><code class="lang-h">gef info address valNo symbol "val" in current context.</code></pre>This means we are not in the correct scope yet. Because the variable <code>val</code> was declared inside the <code>main()</code> function, we cannot access it outside the <code>main()</code> function. So let's set a breakpoint at the main and run the program and then we will see the address of <code>val</code> in the symbol table.<pre><code class="lang-h">gef b mainBreakpoint 1 at 0x117f: file main.c, line 15.gef rgef info address valSymbol "val" is a complex DWARF expression: 0: DW_OP_fbreg -20.</code></pre>To see the list of all the functions in the symbol table use the command <code>info func</code>. You can also filter the functions with regexes. Let's search for our <code>factorial</code> function. The exact regex match will be <code>^facorial$</code>.<pre><code class="lang-h">gef info func ^factorial$All functions matching regular expression "^factorial$":File main.c:5: int factorial(int);</code></pre>We can do the same for global variables using the command <code>info var</code>. Let's look for our global variable <code>MAX_FACTORIAL</code>.<pre><code class="lang-h">gef info var ^MAX_FACTORIAL$All variables matching regular expression "^MAX_FACTORIAL$":File main.c:3: const int MAX_FACTORIAL;</code></pre>The symbol table also stores the type of the variable. We can use the command <code>whatis <variable name></code> to see the type of the variable.<pre><code class="lang-h">gef whatis MAX_FACTORIALtype = const intgef whatis valtype = int</code></pre><h2 id="heading-working-with-variables">Working with Variables</h2>So far we have seen how to set breakpoints, run the program and get the symbol table. Now let's see how to work with variables. Most of the time we don't need all the global variables rather want to see the local variables. For that, we have the command <code>info locals</code> which will show us the local variables.First, let's set a breakpoint after the <code>scanf</code> statement (which is line number 17 in main.c file). So run:<pre><code class="lang-h">gef b main.c:17Breakpoint 2 at 0x11a8: file main.c, line 17.</code></pre>If the program is not running press <code>r</code> (to run) otherwise press <code>c</code> (to continue). It will show the prompt <code>Enter a number:</code> to which you should type a value and the program will reach another breakpoint.Now run the command to get the local variables.<pre><code class="lang-h">gef info localsval = 0xa</code></pre>It says the value of the variable <code>val</code> is <code>0xa</code>. We can also print the values with the command <code>print</code> (or just <code>p</code>).<pre><code class="lang-h">gef p val$1 = 0xagef p/d val$2 = 10</code></pre>By default, the value will be in hex, but we can specify <code>/d</code> to display in decimal. Here are other formats:<div class="hn-table"><table><thead><tr><td>format</td><td>description</td></tr></thead><tbody><tr><td>x</td><td>hexadecimal</td></tr><tr><td>d</td><td>signed decimal</td></tr><tr><td>u</td><td>unsigned decimal</td></tr><tr><td>o</td><td>octal</td></tr><tr><td>t</td><td>binary</td></tr><tr><td>a</td><td>address, absolute and relative</td></tr><tr><td>c</td><td>character</td></tr><tr><td>f</td><td>floating-point</td></tr></tbody></table></div>We can also set the variable values, using the <code>set variable</code> command.<pre><code class="lang-h">gef set variable val=5gef p/d val$3 = 5</code></pre><h2 id="heading-functions">Functions</h2>Functions play an important role in programs. So, let's see how gdb handles them. In our code, we have two functions <code>main</code> and <code>factorial</code>. So, let's start with setting a breakpoint at <code>main</code>.<pre><code class="lang-h">gef b mainBreakpoint 1 at 0x117f: file main.c, line 15.gef info breakNum Type Disp Enb Address What1 breakpoint keep y 0x000000000000117f in main at main.c:15</code></pre>Run the program with <code>r</code>. GDB will stop execution at the first line of the function <code>main</code>. Before moving ahead, let's try to understand the difference between this <code>next</code> command and another very similar command called <code>stepi</code>.<ul><li><code>next</code>: execute next line, including any function calls.</li><li><code>stepi</code>: step by machine instructions rather than source lines</li></ul>So, at any point, if we are at a function call, do <code>si</code> (step into) to enter the function.<blockquote>TIP : You can use a count with <code>next</code> or <code>stepi</code> to perform the command multiple times.</blockquote>We will perform <code>next 2</code> i.e next twice. This time you will be prompted for a number. After that, we will step into the instruction that will take us to the factorial function.<pre><code class="lang-h">gef rgef n 2 // we will be in that line where factorial is calledgef si 3 // we will be in the factorial functiongef n // the first line of the factorial function</code></pre><blockquote>We are doing <code>si 3</code> because, <code>si</code> steps one instruction at a time, so to go through the function prologue we have to do it thrice.</blockquote>Now, to see the function arguments we will again use the <code>info</code> command. This time with <code>args</code>. So perform <code>info args</code>.<pre><code class="lang-h">gef info argsn = 0xagef p/d n$1 = 10</code></pre>If you know the name of the function argument then we can use the <code>print</code> command to print the value of that argument as well.<h2 id="heading-program-stack">Program Stack</h2>Since, we are inside the function <code>factorial()</code> which was called from <code>main()</code> so if we return from <code>factorial()</code> we will be back in <code>main()</code>. For a simple program like this, it's not much of a big deal to remember which function is called by whom. But for a large codebase, it becomes difficult to keep track of it. So we have a command called <code>backtrace</code> that shows us exactly this.<pre><code class="lang-h">gef backtrace#0 factorial (n=0xa) at main.c:6#1 0x00005555555551b2 in main () at main.c:17</code></pre>This very clearly tells us that we left <code>main()</code> on line 17 and now we are in <code>factorial()</code> on line 6.<h2 id="heading-cheatsheet">Cheatsheet</h2><div class="hn-table"><table><thead><tr><td>Command</td><td>Description</td></tr></thead><tbody><tr><td><code>gdb program</code></td><td>to load the symbol table of the program and run the debugger</td></tr><tr><td><code>break <function name></code></td><td>to set a breakpoint at a function</td></tr><tr><td><code>break *<address></code></td><td>to set a breakpoint at a specific address</td></tr><tr><td><code>info break</code></td><td>to see the list of all the breakpoints</td></tr><tr><td><code>delete break <index></code></td><td>to delete a breakpoint</td></tr><tr><td><code>disable break <index></code></td><td>to disable a breakpoint</td></tr><tr><td><code>ignore <index> <count></code></td><td>to ignore a breakpoint for count number of times</td></tr><tr><td><code>run <arglist></code></td><td>start your program with arglist</td></tr><tr><td><code>continue</code></td><td>continue the program execution</td></tr><tr><td><code>next</code></td><td>to execute the next line, including function calls</td></tr><tr><td><code>stepi</code></td><td>to step into the next machine instruction</td></tr><tr><td><code>info address <symbol-name></code></td><td>show where symbol s is stored</td></tr><tr><td><code>info func <regex></code></td><td>show names, types of defined functions (all, or matching regex)</td></tr><tr><td><code>info var <regex></code></td><td>show names, types of global variables (all,or matching regex)</td></tr><tr><td><code>info locals</code></td><td>show names, types of local variables</td></tr><tr><td><code>info args</code></td><td>show names, types of function arguments</td></tr><tr><td><code>whatis <expr></code></td><td>show data type of expr</td></tr><tr><td><code>p/<format> <expr></code></td><td>print the value of expr in the specified format</td></tr><tr><td><code>backtrace</code></td><td>show the call stack</td></tr></tbody></table></div><h2 id="heading-final-notes">Final Notes</h2>Even though there is still a lot more to cover this article should give a basic understanding of how the debugger GDB works. If you are working with a large codebase you will have no option but to use debuggers like GDB. If properly used, these tools make your life a lot easier. Also, if you are a CTF player you might have come across Binary Exploitation challenges where again the only way to get the flag is by debugging the code and understanding the binary. There are more to GDB, like looking up the registers, finding a variable in a stack, watchpoints, threads, etc but those will be covered in later articles.<hr />I am a college undergrad who is constantly learning. If you have any feedback feel free to put it in the comments. I would really appreciate that. Also, feel free to contact me through any of these social handles (mainly active on Twitter):<ul><li><a target="_blank" href="https://twitter.com/ArnabSen1729">Twitter @ArnabSen1729</a></li><li><a target="_blank" href="https://www.linkedin.com/in/arnabsen1729/">LinkedIn arnabsen1729</a></li><li><a target="_blank" href="https://github.com/arnabsen1729">Github arnabsen1729</a></li><li><a target="_blank" href="arnabsen1729@gmail.com">Gmail</a></li><li><a target="_blank" href="https://peerlist.io/arnabsen">Peerlist.io</a></li><li><a target="_blank" href="https://arnabsen.bio.link/">arnabsen.bio.link</a></li></ul> </article> <article> <h1>Introduction to Bitcoin Script</h1> Arnab Sen — Mon, 17 Jan 2022 18:45:52 GMT We know the bitcoin ledger is nothing but a Blockchain. And a single entity of the blockchain is called Block. A block is a container data structure that aggregates transactions for inclusion in the public ledger, the blockchain.Fig: The blocks in the Blockchain and one block waiting to be mined. Image Source: <a target="_blank" href="https://mempool.space/">https://mempool.space/</a>The structure of a block<div class="hn-table"><table><thead><tr><td>Size</td><td>Field</td><td>Description</td></tr></thead><tbody><tr><td>4 bytes</td><td>Block Size</td><td>The size of the block, in bytes, following this field</td></tr><tr><td>80 bytes</td><td>Block Header</td><td>Several fields form the block header</td></tr><tr><td>1 - 9 bytes (varInt)</td><td>Transaction Counter</td><td>How many transactions follow</td></tr><tr><td>Variable</td><td>Transactions</td><td>The transactions recorded in this block</td></tr></tbody></table></div>So each block has some transactions.Fig: An example of transactions in a block (in this case, block at height 695701). Image Source: <a target="_blank" href="https://www.blockchain.com/">https://www.blockchain.com/</a>Let's zoom deep into a single transaction.Fig: A single bitcoin transaction. Image Source: <a target="_blank" href="https://mempool.space/">https://mempool.space/</a>There are some mysterious scripts mentioned in every transaction. I have highlighted the scripts in the above image. In this blog, we will discuss what these scripts are.<h2 id="heading-dissecting-a-transaction">Dissecting a Transaction</h2>An actual transaction looks very different from a transaction provided by a typical block explorer. If we use the Bitcoin Cores command-line interface (getrawtransaction and decoderawtransaction) to retrieve a "raw" transaction, decode it, and see what it contains. The result looks like this:<pre><code class="lang-json">{ "version": 1, "locktime": 0, "vin": [ { "txid": "7957a35fe64f80d234d76d83a2a8f1a0d8149a41d81de548f0a65a8a999f6f18", "vout": 0, "scriptSig" : "3045022100884d142d86652a3f47ba4746ec719bbfbd040a570b1deccbb6498c75c4ae24cb02204b9f039ff08df09cbe9f6addac960298cad530a863ea8f53982c09db8f6e3813[ALL] 0484ecc0d46f1918b30928fa0e4ed99f16a0fb4fde0735e7ade8416ab9fe423cc5412336376789d172787ec3457eee41c04f4938de5cc17b4a10fa336a8d752adf", "sequence": 4294967295 } ], "vout": [ { "value": 0.01500000, "scriptPubKey": "OP_DUP OP_HASH160 ab68025513c3dbd2f7b92a94e0581f5d50f654e7 OP_EQUALVERIFY OP_CHECKSIG" }, { "value": 0.08450000, "scriptPubKey": "OP_DUP OP_HASH160 7f9b1a7fb68d60c536c2fd8aeaa53a8f3cc025a8 OP_EQUALVERIFY OP_CHECKSIG", } ]}</code></pre>Every transaction has some inputs and then some outputs. These transaction outputs are the fundamental building blocks of a bitcoin. So for instance, if you make a new transaction the inputs will be nothing but point to the outputs of some previous transactions which is in the blockchain. And the new outputs will now become available and spendable. These transaction outputs are actually known as UTXO (Unspent Transaction Output). And the collection of all the UTXO is called the UTXO set.Transaction outputs consist of two parts:<ul><li>An amount of bitcoin, denominated in satoshis, the smallest bitcoin unit.</li><li>A cryptographic puzzle that determines the conditions required to spend the output.The cryptographic puzzle is also known as a locking script, a witness script, or a <code>scriptPubKey</code>.</li></ul>Transaction inputs consists of:<ul><li>A transaction ID, referencing the transaction that contains the UTXO being spent</li><li>An output index (<code>vout</code>), identifying from which UTXO the transaction is referenced (the first one is zero)</li><li>A scriptSig, which satisfies the conditions placed on the UTXO, unlocking it for spending</li><li>A sequence number (to be discussed later)</li></ul>We will mainly be focussing on the cryptographic puzzle in the transaction output and the <code>scriptSig</code> in the transaction input.<h2 id="heading-script">Script</h2>In Bitcoin, the verification is done by executing a script. The scripting language is called Script. It is very similar to Froth. Some of the properties of this language are:<h3 id="heading-stack-based-execution">Stack Based Execution</h3>Executing Script involves two stack operations <code>PUSH</code> and <code>POP</code>. Stack is a data structure that follows the LIFO order (Last In First Out). The way the Bitcoin engine executes the script is, it reads the script from left to right, if there is data or value we will push it into the stack, else if there is some operation we will perform that particular operation. Depending on the type of operation some items might be popped from the stack. Let's take a simple example of a script<pre><code class="lang-bash">10 20 OP_ADD</code></pre>In the bitcoin script, there are some reserved keywords and they are meant to perform special operations. These are called Opcodes. The opcodes are assigned a unique hex value like <code>OP_ADD</code> is <code>0x93</code>. Each opcode has its own function. Like <code>OP_ADD</code> pops two values from the stack, performs arithmetic addition, and then pushes the final result into the stack.This is how the Bitcoin engine executes Script.<h3 id="heading-turing-incomplete">Turing Incomplete</h3><blockquote>The bitcoin transaction script language contains many operators, but is deliberately limited in one important waythere are no loops or complex flow control capabilities other than conditional flow control. This ensures that the language is not Turing Complete, meaning that scripts have limited complexity and predictable execution times. The script is not a general-purpose language. These limitations ensure that the language cannot be used to create an infinite loop or another form of "logic bomb" that could be embedded in a transaction in a way that causes a denial-of-service attack against the bitcoin network. Remember, every transaction is validated by every full node on the bitcoin network. A limited language prevents the transaction validation mechanism from being used as a vulnerability.</blockquote>Source: <a target="_blank" href="https://github.com/bitcoinbook/bitcoinbook/blob/develop/ch06.asciidoc">Bitcoin Book Ch6</a><h2 id="heading-why-does-script-has-so-many-limitations">Why does Script has so many limitations?</h2>The reason Script is a very simple language that was designed to be limited in scope:<ol><li>To ensure it is able to run on a range of hardware from a simple wallet to a miner and doesn't need much processing power to execute the script and validate the transaction.</li><li>The fact that the Script language can't do everything any modern high-level language like C++, Java can; is a deliberate security feature.</li></ol>But even after these restrictions Script provides a lot of flexibility, and there are various ways of making transactions as we will see later. But bitcoin-devs decided that only a few transactions should be considered <code>Standard</code> and these transactions should follow a particular format.<h2 id="heading-script-construction">Script Construction</h2><h3 id="heading-locking-script">Locking Script</h3><blockquote>A locking script is a spending condition placed on an output: it specifies the conditions that must be met to spend the output in the future.</blockquote>Let's imagine a briefcase that is locked, now who do you think the briefcase belongs to? Obviously to the person who has the key to that lock. Similarly, if Alice sends Bob some coins, the output of that transaction will have a locking script. If Bob decides to spend those coins then Bob has to provide the appropriate unlocking script which only he has access to. Indirectly we can say that those coins are now locked to Bob's BTC address i.e belong to Bob.Historically, the locking script was called a <code>scriptPubKey</code>, because it usually contained a public key or bitcoin address (public key hash).<h3 id="heading-unlocking-script">Unlocking Script</h3><blockquote>An unlocking script is a script that "solves," or satisfies, the conditions placed on output by a locking script and allows the output to be spent.</blockquote>Extending the previous example, if Bob now has to spend those coins by Alice he has to provide a suitable script that will "unlock" the locking script. In other words, if you want to give the briefcase to someone else you will have to unlock the lock with your key and then give it to the other person with their lock.Historically, the unlocking script was called scriptSig, because it usually contained a digital signature. In most bitcoin applications.<h2 id="heading-how-does-this-unlocking-happen">How does this "unlocking" happen?</h2>A question might arise: how does this unlocking script actually unlock the locking script? What the Bitcoin Engine does is, first execute the unlocking script (remember stack-based execution), chances are there will be some data left in the stack which is vital information. Now the locking script will be executed on the stack (will need the previous data). If after complete execution of these two scripts the stack is empty or has a non-zero value or has True on top of the stack, then we can consider the scripts are compatible and the unlocking script is valid. Otherwise, the execution will be considered invalid and the UTXO will not be used.<hr />I was a part of the <a target="_blank" href="https://summerofbitcoin.org/">Summer of Bitcoin'21</a> at the time of writing this blog. I am grateful to <a target="_blank" href="https://twitter.com/adibitcoin">Adi Shankara</a>, <a target="_blank" href="https://twitter.com/Caralie_C">Caralie Chrisco</a>, <a target="_blank" href="https://twitter.com/adamcjonas">Adam Jonas</a> for giving me this amazing opportunity and my mentor <a target="_blank" href="https://twitter.com/0xB10C">0xb10c</a> for his guidance and support. </article> <article> <h1>Merkle Trees and its role in the decentralized web</h1> Arnab Sen — Mon, 17 Jan 2022 18:16:07 GMT <h1 id="heading-merkle-trees">Merkle Trees</h1><h2 id="heading-centralised-web">Centralised Web</h2>How does the web in general work?Usually, we have a server where we upload files, and then we can access those files from some other nodes connected to the network the server is connected to. For example, if you want to host your videos for people to watch, you would rather choose Youtube or Google Drive or similar hosting/file-sharing services. Now, this is very convenient because a user doesn't have to maintain those services or manage them. But it comes at a cost. The data is now concentrated on the infrastructure of a handful of providers.<h2 id="heading-content-addressing">Content Addressing</h2>But we can avoid this if we use a peer-2-peer system, with the help of content addressing. Before diving into how the p2p system avoids decentralization let's first understand content addressing. So, depending on the value of the content we can generate an address. We can use a cryptographic hash function. One of the popular functions is the <code>SHA-256</code>. In your terminal, you can write this command.<pre><code class="lang-bash">$ sha256sum file # will generate the SHA-256 hash of the file.</code></pre>Now even if you make a very small change, the sha256 value of the file will change drastically.Now the output of this function (usually known as hash or digest) is of a fixed length. In the case of <code>SHA-256</code>, it's <code>32 bytes</code>. Also for each unique data, we will get a unique hash value. Thereby we can use the hash value for integrity check. As an analogy, if Alice and Bob are two different persons then quite naturally they will have different home addresses (not considering the exceptional cases, where Alice and Bob are roommates). So the home address in a way plays the role of identification for Alice and Bob. Similarly, the hash value plays the role of an address of the content (or data).<blockquote>The exceptional case where Alice and Bob are roommates is actually analogous to something called Hash Collision. Where two different values give the same hash, but it is very rare. So far no one has been able to find such values which give the same sha256 hash.</blockquote><h2 id="heading-peer-to-peer-network">Peer to Peer Network</h2>Coming back to the peer-to-peer network. Instead of keeping the data in one server, we can share the data across multiple nodes (computers connected to the network are called nodes).Now what are the pros:<ol><li>It requires less disk space since the data is now shared among multiple peers.</li><li>If one node gets destroyed/tampered with the data can still be retrieved from the other node. Hence, no single-point-of-failure.</li></ol>But there are some cons as well. Participants in a peer to peer cannot be trusted. So when we retrieve a file we have to ensure that it is in good condition and is not tampered with or it is not malicious. But how can we do that in an efficient manner? This is where Merkle Trees come into play.<h2 id="heading-merkle-trees-1">Merkle Trees</h2>Merkle trees are a data structure that allows us to do efficient data verification across networks. It is a binary tree where the nodes store hashes instead of chunks of data.<blockquote>Merkle trees are named after Ralph Merkle, who proposed them in a 1987 paper titled "A Digital Signature Based on a Conventional Encryption Function." Merkle also invented cryptographic hashing.</blockquote>The leaf nodes store the hashes of the chunks of the data. And their parent nodes store the hash of concatenating the hashes of left child and right child.So, if we have 4 chunks of data. We first hash each one of them and get <code>Hash 1</code>, <code>Hash 2</code>, <code>Hash 3</code>, <code>Hash 4</code>.Now we concatenate <code>Hash 1</code> and <code>Hash 2</code> and then calculate the hash of that concatenated string. That is stored in <code>B1</code> (Branch 1), similarly for <code>Hash 3</code> and <code>Hash 4</code>. We will keep doing this until we come to a single node in the tree which is called the root.The root node has a special name in a Merkle Tree, we call it the Root Hash. And actually plays a big role in p2p networks.<h3 id="heading-checking-equality">Checking Equality</h3>Checking for equality becomes easy with Merkle Tree. Why? Let's say one of the leaves has different data (unintended/tampered data). Now the hash for the corresponding leaf will be different. As a result, the parent hash will also be different and so on till the root. So if any data is tampered with, the root hash will give us different values.So a change in any leaf node will be bubbled to the root hash. Now that reduces the comparisons we need to make to check the equality. And all we need to send to verify the data is the root hash, which is just 256 bits (or 32 bytes).This ease of verification plays a big role in the p2p network because all the peers are anonymous and are untrusted.Now, let's say you are supposed to download a file from your peers. And you already have the root hash of the file you were supposed to get. You download the data from your peers. What you can now do is build the Merkle Tree and calculate the Root Hash and check if the calculated Root Hash matches with the one you are expecting. If it matches then we are 100% sure that the data wasn't accidentally corrupted or intentionally tampered with. If it doesn't match then the data is tampered with or it is malicious.<h3 id="heading-partial-verification">Partial Verification</h3>But why shall we go all the way to make a Merkle Tree and not just concatenate all the data and calculate the hash? The reason is partial verification. Sometimes we don't need the entire data set, only a couple of files are enough. We cannot verify that using the naive approach where we concatenated all the chunks of data and calculated the hash. But with the help of Merkle Trees, we can do it efficiently.Let's say we have 4 chunks of data. Before uploading to the p2p network I create a Merkle tree and find the root hash.Now we have saved the root hash safely. Now let's say we need Data 4. And one node says that it has the data. But how do we ensure that the data is not malicious?First, we will get Data 4, so we can just hash it and get Hash 4. Then we will need Hash 3 (remember not Data 3, just the hash). Using Hash 4 and Hash 3 we can calculate hash B2. Then we will need B1 and using B2 and B1 we can calculate root hash, which we already have. So we can easily compare them. So what information did we need?<ol><li>Data 4 (the chunk of data we need)</li><li>Hash 3</li><li>B2</li></ol>That's it. For a bigger tree, we will just need one data and a couple of hashes. The hashes have a fixed size of 32 bytes, so it will save a lot of bandwidth. Also, the number of hashes will be logarithmic to the number of leaves i.e chunks of data. This way Merkle Trees can ease the verification process.<h2 id="heading-use-cases">Use cases</h2>Merkle Trees are extensively used in Git, Bitcoin, and many other places. In Bitcoin specifically, it is used to validate transactions in a block. You can read more about it in the book <a target="_blank" href="http://rosenbaum.se/book/grokking-bitcoin.html#merkle-trees">Grokking Bitcoin by Kalle Rosenbaum Section 6.5</a>, the partial verification of Merkle Trees is used to check if a Transaction exists in a block.Gaurav Sen has also explained the same concept by taking the case of Git. You can watch the video <a target="_blank" href="https://www.youtube.com/watch?v=qHMLy5JjbjQ">here</a>.<hr />I was a part of the <a target="_blank" href="https://summerofbitcoin.org/">Summer of Bitcoin'21</a> when I came across the concept of Merkle Trees and understood the important role it plays in Bitcoin and Blockchain. I would like to thank <a target="_blank" href="https://twitter.com/kallerosenbaum">Kalle</a> for explaining these concepts so beautifully. If you are new to bitcoin you should definitely check out his book <a target="_blank" href="https://www.manning.com/books/grokking-bitcoin">Grokking Bitcoin</a>. I am also grateful to <a target="_blank" href="https://twitter.com/adibitcoin">Adi Shankara</a>, <a target="_blank" href="https://twitter.com/Caralie_C">Caralie Chrisco</a>, <a target="_blank" href="https://twitter.com/adamcjonas">Adam Jonas</a> for giving us this amazing opportunity. </article> <article> <h1>My experience of Summer Of Bitcoin'21</h1> Arnab Sen — Tue, 11 Jan 2022 18:22:05 GMT On June 26, I received an email saying that I was one of the 50 students selected for the Summer of Bitcoin 2021 program. I was really happy and felt proud of myself because to be very honest I didn't have high hopes at that point. In this blog, I will cover the selection process, and my experience at Summer of Bitcoin and will also share some tips for those who are interested in this program.<h2 id="heading-about-the-summer-of-bitcoin">About the Summer of Bitcoin</h2>A global program that matches students with open source, free software, and technology-related organizations working on bitcoin to write code and become part of these communities while making some BTC along the way! The organizations provide mentors who act as guides through the entire process, from learning about the community to contributing code. Students get involved in and become familiar with the bitcoin open-source community and put their summer break to good use.To know more visit the Summer Of Bitcoin <a target="_blank" href="https://www.summerofbitcoin.org/">website</a>.<h2 id="heading-selection-process">🪴Selection Process</h2><h3 id="heading-how-i-came-to-know-about-the-program">How I came to know about the program</h3>I was scrolling through LinkedIn when I came across a post about the Summer Of Bitcoin. There was a form attached to it. It had some basic questions and a couple of essay questions like why do you want to take part etc. My tips for filling out the form would be to:<ol><li>Be truthful and original. You have to understand that those who will select you are way more experienced than you so don't just copy-paste from random sites.</li><li>Make sure you don't make any grammatical errors. What I usually do is first write it down in Google Docs and use spell-check. You can also install the <a target="_blank" href="https://www.grammarly.com/">Grammarly</a> extension to avoid typos and grammatical errors.</li><li>Gather information. Go through the official websites, or take help from past participants. In my case, this was the first time, so there wasn't much public information available, but yeah I did visit the official website. The reason I am saying this is you can understand what the motive of the program is and then write your essays on the same line.</li></ol><h3 id="heading-1-round-1-problem-solving">1 Round 1: Problem-Solving</h3>We are not supposed to discuss the problem and our approaches, so I would refrain from giving detailsIt was not a typical CP question, rather was more real-life and bitcoin-related. We had to find an optimal solution but satisfy the conditions specified. And then submit our code. After spending around 2 days on the problem I was somewhat satisfied with my result, and I thought of submitting it. I also described my approach in a README file.<h3 id="heading-2-round-2-follow-up-question">2 Round 2: Follow-up question</h3>Then the next day I received a follow-up question in the email where I had to explain in detail a better approach to my previous solution. I had one more approach in my mind which I then explained and also proved how theoretically it is supposed to give a better solution.And a few days later I received this mail:Yayy !! My solution was in the top 1% among thousands of applicants. At this point, I was pretty confident.<h3 id="heading-3-round-3-navigating-ambiguity">3 Round 3: "Navigating Ambiguity"</h3>We were asked to find an issue in the official <a target="_blank" href="https://github.com/bitcoin/bitcoin">Bitcoin</a> repo and explain what approach we would take to solve it. So I went to the issues tab and I didn't understand a single word since I was an absolute beginner. But I spent around 2 days trying to figure out the issues, even while doing this I learned a lot about stuff like fuzzing, AFL, and many other concepts. I picked one particular issue, tried to understand it end to end, researched a little bit, and came across an approach that I then explained.<h3 id="heading-4-round-4-telephonic-interview">4 Round 4: Telephonic Interview</h3>2 days later I received a calendly link for a telephone round. In the telephonic round, I was asked to give a brief introduction of myself, my past experiences, and achievements, technical skills, and projects. Also, I was asked some questions like why I wanted to be a part of this program and what my expectations are.After the call, my interviewer sent me a link for a video interview which was scheduled an hour later.<h3 id="heading-5-round-5-final-interview">5 Round 5: Final Interview</h3>Here I was asked more about my knowledge of Bitcoin and Blockchain, which programming aspects I like to work on, and some more HR questions. I also asked him questions regarding the program.I prepared myself by researching a little bit on Bitcoin, going through the Bitcoin whitepaper, and watching a couple of introductory videos.At the end of the conversation, I was pretty confident that I was able to ace it, cause I was getting positive vibes and my interviewer also shared some resources to get started (I have added them below).<h2 id="heading-sessions">📼 Sessions</h2>We had some really amazing sessions with folks who are associated with Bitcoin for years. They have made contributions to Bitcoin Core and other bitcoin-related projects, and every one of them shared their perspective towards Bitcoin and how revolutionary this concept of a "Decentralised monetary system" is.Here are some of the links to the recorded sessions:<a target="_blank" href="https://www.youtube.com/watch?v=lnEWDLScep0&t=17s">The CS in Bitcoin with Sanket Kanjalkar</a><div class="embed-wrapper"><div class="embed-loading"><div class="loadingRow"></div><div class="loadingRow"></div></div><a class="embed-card" href="https://www.youtube.com/watch?v=lnEWDLScep0&t=17s">https://www.youtube.com/watch?v=lnEWDLScep0&t=17s</a></div> <a target="_blank" href="https://www.youtube.com/watch?v=URMoDCCvjdI">Deep Dive Into Bitcoin Mechanics with Matthew Zipkin</a><div class="embed-wrapper"><div class="embed-loading"><div class="loadingRow"></div><div class="loadingRow"></div></div><a class="embed-card" href="https://www.youtube.com/watch?v=URMoDCCvjdI">https://www.youtube.com/watch?v=URMoDCCvjdI</a></div> <a target="_blank" href="https://www.youtube.com/watch?v=l-1Cur4I5yY">Decentralized open-source development with Amiti Uttarwar</a><div class="embed-wrapper"><div class="embed-loading"><div class="loadingRow"></div><div class="loadingRow"></div></div><a class="embed-card" href="https://www.youtube.com/watch?v=l-1Cur4I5yY">https://www.youtube.com/watch?v=l-1Cur4I5yY</a></div> <a target="_blank" href="https://www.youtube.com/watch?v=yZxwre6aT44">Layering to Scale Bitcoin with Will Clark</a><div class="embed-wrapper"><div class="embed-loading"><div class="loadingRow"></div><div class="loadingRow"></div></div><a class="embed-card" href="https://www.youtube.com/watch?v=yZxwre6aT44">https://www.youtube.com/watch?v=yZxwre6aT44</a></div> You will find more on the <a target="_blank" href="https://www.youtube.com/channel/UCu8iAf7tWfJNy2dcY7KvRBg/videos">Summer Of Bitcoin Youtube Channel</a>.Then we had some lectures which were given by <a target="_blank" href="https://twitter.com/kallerosenbaum">Kalle Rosenbaum</a>, he is the author of the book Grokking Bitcoin. I would suggest this book to any beginner. The book explains how a system of Cookie Token used in a Cafe can be improved step by step and we will end up with a system like Bitcoin. The last day of the first week ended with experimenting with the <code>bitcoin-cli</code>. We created our own bitcoin full node in our machine and created a wallet. For testing purposes, we used the signet. Kalle gave us some coins which we then used to make transactions among our peers.In between the lectures we used to have Breakout rooms that would give us a chance to discuss with peers and solve questions and resolve each other's doubts.By the end of the first week, I went from having very little knowledge about Bitcoin to having a good grasp on fundamentals like consensus (proof-of-work), the P2P network, how bitcoin protocol upgrades take place, and innovations taking place in Bitcoin like the SegWit, Taproot, etc.<h2 id="heading-project-matching">📦 Project Matching</h2>Everyone was matched with a mentor and was assigned a Project. I was assigned the amazing <a target="_blank" href="https://b10c.me/">0xB10C</a> and my project was to improve the "User-Space, Statically Defined Tracing" (USDT) support for Bitcoin Core. For the next few weeks, we collaborated with our mentors and made contributions to the various projects at the same time reviewing each other's work. I worked on adding some <code>utxocache</code> tracepoints for events like <code>flush</code>, <code>addCoin</code>, <code>spendCoin</code> etc. I learned a lot about the eBPF (extended Berkeley Packet Filter) technology, tracepoints, Linux Observability, navigating through large codebases like Bitcoin, and about open source in general.<h2 id="heading-weekly-seminars">🗓 Weekly Seminars</h2>For the first 5 weeks, we also had weekly seminars on Bitcoin and Lightning Protocol Development. I was in the Bitcoin Protocol Development, so we had these topics:Week 1: Welcome to Bitcoin Protocol Development Week 2: Segwit Week 3: Mining & Network Block Propagation Week 4: P2P Week 5: Script & WalletsEvery week, we would be matched with a random peer and assigned a particular question on the topic we had that week. We have to work with our partners to brainstorm answers to that question. Then every week we will meet in a breakout room and discuss our findings with everyone else. It was so much fun learning and sharing our research with our peers.<h2 id="heading-resources">📚 Resources</h2><ol><li><a target="_blank" href="https://learn.saylor.org/course/view.php?id=468">PRDV151: Bitcoin for Everybody</a></li><li><a target="_blank" href="https://learn.saylor.org/course/view.php?id=500">CS120: Bitcoin for Developers I</a></li><li><a target="_blank" href="https://rosenbaum.se/book/grokking-bitcoin.html">Grokking Bitcoin</a> by <a target="_blank" href="https://twitter.com/kallerosenbaum">Kalle Rosenbaum</a></li><li><a target="_blank" href="https://www.coursera.org/learn/cryptocurrency">Bitcoin and Cryptocurrency Technologies</a>.</li><li><a target="_blank" href="https://github.com/bitcoinbook/bitcoinbook">Mastering Bitcoin</a> for more technical insights.</li><li><a target="_blank" href="https://medium.com/@amitiu/onboarding-to-bitcoin-core-7c1a83b20365">Onboarding to Bitcoin Core by Amiti Uttarwar</a></li><li><a target="_blank" href="https://bitcoin.org/bitcoin.pdf">Bitcoin Whitepaper</a> (must read !!)</li><li><a target="_blank" href="https://github.com/BlockchainCommons/Learning-Bitcoin-from-the-Command-Line">Learning-Bitcoin-from-the-Command-Line</a></li></ol>After you have a good understanding of the fundamentals you can also dive into https://chaincode.gitbook.io/seminars/ to get into the depth of each concept.<h2 id="heading-takeaways">📝 Takeaways</h2>Prior to this Summer Of Bitcoin, I had very little experience with cryptocurrency, blockchain, and also open-source. It was not just a bitcoin learning experience but we had career-building, resume, and interview prep sessions too. I also kept a talk on Bitcoin Script. To make things even more amazing we also got some swags !! T-shirt and a Hardware Crypto Wallet.But the most important takeaway would be the new connections I made, I made a lot of new friends who are really amazing at what they do, and I am still working with my mentor <code>0xB10C</code> and I will be joining back this summer as a student coach for the next cohort.<h2 id="heading-final-notes">Final Notes</h2>Since I was a part of the first cohort of Summer Of Bitcoin, the procedure was a bit different. This time they have made some updates to their application procedure and have also introduced Designer Track. You will find the new timeline on their <a target="_blank" href="https://www.summerofbitcoin.org/apply">website</a>. To keep getting updates follow <a target="_blank" href="https://twitter.com/summerofbitcoin">@summerofbitcoin</a>.I would love to extend my gratitude to <a target="_blank" href="https://twitter.com/adibitcoin">Adi Shankara</a>, <a target="_blank" href="https://twitter.com/Caralie_C">Caralie Chrisco</a>, and <a target="_blank" href="https://twitter.com/adamcjonas">Adam Jonas</a> for giving us this amazing opportunity and to my mentor 0xB10C for always helping me with my doubts. 0xB10C is very active in the Bitcoin Development Community you should definitely check out his work at <a target="_blank" href="https://b10c.me/">b10c.me</a> and follow him <a target="_blank" href="https://twitter.com/0xB10C">@0xB10C</a>.For those who are looking forward to applying for the next cohort of Summer of Bitcoin, all the very best. If you have any doubts or questions feel free to contact me, my DMs are always open.<ul><li><a target="_blank" href="https://twitter.com/ArnabSen1729">@ArnabSen1729</a></li><li><a target="_blank" href="https://www.linkedin.com/in/arnabsen1729/">LinkedIn</a></li><li><a target="_blank" href="https://github.com/arnabsen1729">GitHub</a></li><li><a target="_blank" href="arnabsen1729@gmail.com">Gmail</a></li></ul>(Source: https://www.summerofbitcoin.org/)<hr />I was recently invited to a podcast where I shared my Summer of Bitcoin experience and advice for the application rounds. We also discussed the future of Bitcoin and job opportunities in this space. You can check out the podcast as well 👇<div class="embed-wrapper"><div class="embed-loading"><div class="loadingRow"></div><div class="loadingRow"></div></div><a class="embed-card" href="https://www.youtube.com/watch?v=_nnpHfuiPzs&ab_channel=CodeHer%27s">https://www.youtube.com/watch?v=_nnpHfuiPzs&ab_channel=CodeHer%27s</a></div> </article> </main></body></html>