Practice Code Typing – SkillsTyping

Choose your language:

ℹ️ Select 'Choose Exercise', or randomize 'Next Random Exercise' in selected language.

Choose Exercise:

Ready

Exercise Algorithm Area

Correct typos to continue.

1;; Optimized Matrix Multiplication Kernel in WAT
2;; Computes C = A * B
3;; Assumes matrices are stored in row-major order.
4
5(func $matrix_multiply (param $a_ptr i32) (param $b_ptr i32) (param $c_ptr i32) (param $rows_a i32) (param $cols_a i32) (param $cols_b i32)
6;; $rows_a: Number of rows in matrix A and C
7;; $cols_a: Number of columns in matrix A and rows in matrix B
8;; $cols_b: Number of columns in matrix B and C
9
10(local $i i32) ;; Row index for matrix C (and A)
11(local $j i32) ;; Column index for matrix C (and B)
12(local $k i32) ;; Inner loop index (columns of A, rows of B)
13(local $sum i32) ;; Accumulator for C[i][j]
14(local $a_val i32) ;; Value from matrix A
15(local $b_val i32) ;; Value from matrix B
16(local $c_idx i32) ;; Index for matrix C
17(local $a_idx i32) ;; Index for matrix A
18(local $b_idx i32) ;; Index for matrix B
19
20;; Dimension compatibility check
21(if (i32.ne $cols_a $rows_b) ;; This check is implicit in the parameters, but good to be explicit
22(then (return (i32.const -1))) ;; Error: Incompatible dimensions
23)
24
25;; Initialize C to zeros
26(set_local $c_idx (i32.const 0))
27(loop $init_c_loop
28(if (i32.ge $c_idx (i32.mul (local.get $rows_a) (local.get $cols_b))) (br $end_init_c_loop))
29(memory.store (i32.add $c_ptr (i32.mul $c_idx (i32.const 4))) (i32.const 0))
30(set_local $c_idx (i32.add $c_idx (i32.const 1)))
31(br $init_c_loop)
32)
33(label $end_init_c_loop)
34
35;; Main matrix multiplication loops
36(set_local $i (i32.const 0))
37(loop $row_loop
38(if (i32.ge $i $rows_a) (br $end_row_loop))
39
40(set_local $j (i32.const 0))
41(loop $col_loop
42(if (i32.ge $j $cols_b) (br $end_col_loop))
43
44(set_local $sum (i32.const 0))
45(set_local $k (i32.const 0))
46(loop $inner_loop
47(if (i32.ge $k $cols_a) (br $end_inner_loop))
48
49;; Calculate indices for A[i][k] and B[k][j]
50;; A[i][k] = a_ptr + (i * cols_a + k) * 4
51;; B[k][j] = b_ptr + (k * cols_b + j) * 4
52(set_local $a_idx (i32.add (i32.mul $i $cols_a) $k))
53(set_local $b_idx (i32.add (i32.mul $k $cols_b) $j))
54
55;; Load values
56(set_local $a_val (memory.load (i32.add $a_ptr (i32.mul $a_idx (i32.const 4)))))
57(set_local $b_val (memory.load (i32.add $b_ptr (i32.mul $b_idx (i32.const 4)))))
58
59;; Accumulate sum
60(set_local $sum (i32.add $sum (i32.mul $a_val $b_val)))
61
62(set_local $k (i32.add $k (i32.const 1)))
63(br $inner_loop)
64)
65(label $end_inner_loop)
66
67;; Store the result C[i][j] = sum
68;; C[i][j] = c_ptr + (i * cols_b + j) * 4
69(set_local $c_idx (i32.add (i32.mul $i $cols_b) $j))
70(memory.store (i32.add $c_ptr (i32.mul $c_idx (i32.const 4)))) $sum
71
72(set_local $j (i32.add $j (i32.const 1)))
73(br $col_loop)
74)
75(label $end_col_loop)
76
77(set_local $i (i32.add $i (i32.const 1)))
78(br $row_loop)
79)
80(label $end_row_loop)
81
82(return (i32.const 0)) ;; Success
83)

Algorithm description viewbox

WAT Optimized Matrix Multiplication Kernel

Algorithm description:

This WAT program implements an optimized matrix multiplication routine. It calculates the product of two matrices, C = A * B, where matrices are stored in row-major order. The implementation focuses on optimizing the innermost loop, which performs the dot product of a row from A and a column from B, to maximize performance. This is a fundamental operation in scientific computing, machine learning, and graphics.

Algorithm explanation:

The `matrix_multiply` function computes the product of two matrices A (m x n) and B (n x p) to produce matrix C (m x p). It iterates through each element C[i][j] of the result matrix. For each element, it computes the dot product of the i-th row of A and the j-th column of B. The inner loop (over k) accumulates the sum of products A[i][k] * B[k][j]. Indices are carefully calculated to access elements in row-major order. The time complexity is O(m * n * p), which is inherent to matrix multiplication. Space complexity is O(1) beyond the storage for the matrices themselves. Edge cases include checking for compatible dimensions (columns of A must equal rows of B) and ensuring the result matrix C is initialized to zeros before accumulation. The optimization lies in minimizing memory loads and maximizing arithmetic operations within the tight inner loop.

Pseudocode:

function matrix_multiply(A, B, C):
  rows_A = number of rows in A
  cols_A = number of columns in A
  rows_B = number of rows in B
  cols_B = number of columns in B

  if cols_A is not equal to rows_B:
    return error (incompatible dimensions)

  initialize matrix C with zeros (rows_A x cols_B)

  for i from 0 to rows_A - 1:
    for j from 0 to cols_B - 1:
      sum = 0
      for k from 0 to cols_A - 1:
        sum = sum + A[i][k] * B[k][j]
      C[i][j] = sum

  return success

Library

Playable exercises

Choose Exercise

Filter, sort, and preview algorithm before you choose one to play.

Total 5 published items

Pages 1 pages available

Language

Sort by Direction

Browse published coding scenarios with filters, sorting, and pagination.

Published

WAT Specialist (wat)

WAT Loop Unrolling for Array Summation

Difficulty: writing: 1; length: 9
Popularity: 0
Created: 2026-01-26 13:09 UTC
Published: 2026-01-26 13:42 UTC
Author: Admin

#wat #loop unrolling #numeric kernels #optimization #linear memory #algorithm

goal: Sum elements of an array using loop unrolling. input: Pointer to an array of i32s and its length. output: The sum of the array elements as an i32.

Canonical Algorithm Text

;; Array Summation with Loop Unrolling (Factor of 4) in WAT ;; Computes the sum of elements in an i32 array. (func $sum_array_unrolled (param $array_ptr i32) (param $array_len i32) (result i32) (local $sum i32) ;; Accumulator for the sum (local $i i32) ;; Loop counter (local $unr...

Description Algorithm

This WAT program demonstrates loop unrolling for array summation. Instead of processing one element per iteration, the loop is unrolled by a factor of 4, meaning it processes four elements in each iteration. This reduces...

Explanation

The `sum_array_unrolled` function calculates the sum of elements in an i32 array. It first determines how many full groups of 4 elements can be processed and how many remain. The main loop iterates `unrolled_len` times, but in each iteration, it loads and adds four elements at once. This reduces the number of loop control instructions (increments, comparisons, branches). After the unrolled loop, a separate loop handles the `remainder` elements. The time complexity remains O(N), but the constant factor is reduced due to fewer loop overheads, potentially leading to faster execution. Space complexity is O(1). Edge cases include empty arrays and arrays whose lengths are not multiples of the unrolling factor (4). The `create_test_array` helper is for demonstration.

Pseudocode

function sum_array_unrolled(array, length): sum = 0 i = 0 unrolled_length = floor(length / 4) * 4 remainder = length - unrolled_length // Unrolled loop (process 4 elements at a time) while i < unrolled_length: sum = sum...

Published

WAT Specialist (wat)

WAT Custom Allocator for Fixed-Size Chunks

Difficulty: writing: 6; length: 10
Popularity: 0
Created: 2026-01-26 13:09 UTC
Published: 2026-01-26 13:42 UTC
Author: Admin

#wat #custom allocators #linear memory #linked list #memory management #algorithm

goal: Manage a pool of fixed-size memory chunks. input: None for initialization; pointer to chunk for deallocation. output: Pointer to allocated chunk or -1 for allocation failure; no return for deallocation.

Canonical Algorithm Text

;; Custom Allocator for Fixed-Size Chunks in WAT ;; Manages a pool of memory divided into fixed-size blocks. ;; Uses a free list to track available chunks. (module (memory 1) ;; 64KB initial memory (global $HEAP_START (mut i32) (i32.const 0)) (global $TOTAL_MEMORY (i32) (i32.cons...

Description Algorithm

This WAT program implements a custom memory allocator that manages a pool of fixed-size memory chunks. It uses a free list, implemented as a linked list within the memory pool itself, to keep track of available chunks. T...

Explanation

The custom allocator manages a contiguous block of memory divided into fixed-size chunks. A global variable, `$free_list_head`, points to the first available chunk. Each chunk, when not in use, contains a pointer (at a fixed offset, e.g., 4 bytes) to the next free chunk, forming a linked list. When `allocate_chunk` is called, it takes the chunk pointed to by `$free_list_head`, updates `$free_list_head` to point to the next chunk in the list, and returns the pointer to the allocated chunk. If `$free_list_head` is -1, the pool is full. `deallocate_chunk` takes a pointer, adds it to the front of the free list by updating its 'next' pointer and then setting `$free_list_head` to the deallocated chunk's pointer. It includes checks for invalid pointers and attempts to deallocate already freed chunks. The time complexity for allocation and deallocation is O(1) on average, assuming no need to check for already freed chunks. Space complexity is O(N) for the memory pool, where N is the number of chunks.

Pseudocode

initialize memory pool with fixed-size chunks initialize free_list_head to point to the first chunk for each chunk (except the last): set its 'next' pointer to the address of the next chunk set the last chunk's 'next' po...

Published

WAT Specialist (wat)

WAT Optimized Matrix Multiplication Kernel

Difficulty: writing: 4; length: 7
Popularity: 0
Created: 2026-01-26 13:09 UTC
Published: 2026-01-26 13:42 UTC
Author: Admin

#wat #matrix multiplication #numeric kernels #optimization #linear memory #algorithm

goal: Perform matrix multiplication C = A * B. input: Pointers to matrices A and B, pointer to matrix C (result), dimensions of A and B. output: 0 for success, -1 for dimension mismatch.

Canonical Algorithm Text

;; Optimized Matrix Multiplication Kernel in WAT ;; Computes C = A * B ;; Assumes matrices are stored in row-major order. (func $matrix_multiply (param $a_ptr i32) (param $b_ptr i32) (param $c_ptr i32) (param $rows_a i32) (param $cols_a i32) (param $cols_b i32) ;; $rows_a: Number...

Description Algorithm

This WAT program implements an optimized matrix multiplication routine. It calculates the product of two matrices, C = A * B, where matrices are stored in row-major order. The implementation focuses on optimizing the inn...

Explanation

The `matrix_multiply` function computes the product of two matrices A (m x n) and B (n x p) to produce matrix C (m x p). It iterates through each element C[i][j] of the result matrix. For each element, it computes the dot product of the i-th row of A and the j-th column of B. The inner loop (over k) accumulates the sum of products A[i][k] * B[k][j]. Indices are carefully calculated to access elements in row-major order. The time complexity is O(m * n * p), which is inherent to matrix multiplication. Space complexity is O(1) beyond the storage for the matrices themselves. Edge cases include checking for compatible dimensions (columns of A must equal rows of B) and ensuring the result matrix C is initialized to zeros before accumulation. The optimization lies in minimizing memory loads and maximizing arithmetic operations within the tight inner loop.

Pseudocode

function matrix_multiply(A, B, C): rows_A = number of rows in A cols_A = number of columns in A rows_B = number of rows in B cols_B = number of columns in B if cols_A is not equal to rows_B: return error (incompatible di...

Published

WAT Specialist (wat)

WAT Stack-Based Expression Evaluator with Operator Precedence

Difficulty: writing: 1; length: 10
Popularity: 0
Created: 2026-01-26 13:09 UTC
Published: 2026-01-26 13:42 UTC
Author: Admin

#wat #stack #postfix #expression evaluation #linear memory #algorithm #compiler

goal: Evaluate a postfix mathematical expression. input: A pointer to a null-terminated string representing a postfix expression (e.g., "3 4 + 5 * "). output: An i32 representing the result of the expression, or a negative error code.

Canonical Algorithm Text

;; Function to evaluate a postfix expression using a stack ;; Supports +, -, *, /, and parentheses (though parentheses are handled by the postfix conversion, not directly here). ;; Assumes input is a string of tokens separated by spaces. (func $evaluate_postfix (param $expression...

Description Algorithm

This WAT program implements a stack-based evaluator for postfix mathematical expressions. It parses a string representing a postfix expression, tokenizes it, and uses a stack to perform calculations. Numbers are pushed o...

Explanation

The `evaluate_postfix` function processes a postfix expression string. It iterates through tokens, pushing numbers onto a simulated stack and applying operators to popped operands. The stack is implemented using linear memory, with `stack_ptr` and `stack_top` managing its state. Operator handling involves popping two operands, performing the operation (addition, subtraction, multiplication, division), and pushing the result. Division by zero is explicitly checked. The algorithm's time complexity is O(N), where N is the number of tokens, as each token is processed once. Space complexity is O(N) in the worst case for the stack, though typically much less for balanced expressions. Edge cases include empty expressions, single-number expressions, invalid tokens, stack underflow/overflow, and division by zero. The correctness relies on the property that in postfix notation, operands appear before their operators, allowing for direct stack manipulation.

Pseudocode

function evaluate_postfix(expression_string): initialize stack set stack_top to -1 pointer = start of expression_string while pointer is not end of string: get next token and its length if token is empty, break loop if t...

Published

WAT Specialist (wat)

WAT Binary Search for Element in Sorted Array

Difficulty: writing: 3; length: 10
Popularity: 0
Created: 2026-01-26 13:09 UTC
Published: 2026-01-26 13:41 UTC
Author: Admin

#wat #binary search #sorting kernels #search algorithm #linear memory #algorithm #edge cases

goal: Find the index of a target value in a sorted array. input: Pointer to a sorted array of i32s, the length of the array, and the target i32 value. output: The index of the target value if found, otherwise -1.

Canonical Algorithm Text

;; Binary Search Implementation in WAT ;; Searches for a target value in a sorted array of i32s. ;; Returns the index of the target if found, otherwise -1. (func $binary_search (param $array_ptr i32) (param $array_len i32) (param $target i32) (result i32) (local $low i32) ;; Lowe...

Description Algorithm

This WAT program implements the binary search algorithm to efficiently find a target value within a sorted array. It repeatedly divides the search interval in half. If the value of the search key is less than the item in...

Explanation

The `binary_search` function takes a pointer to a sorted array of i32s, its length, and the target value. It initializes `low` to 0 and `high` to `length - 1`. The core loop continues as long as `low <= high`. In each iteration, it calculates the middle index `mid`. It then compares the value at `array[mid]` with the `target`. If they match, `mid` is returned. If `array[mid]` is less than `target`, the search continues in the right half by setting `low = mid + 1`. If `array[mid]` is greater than `target`, the search continues in the left half by setting `high = mid - 1`. The loop invariant is that the target, if present, must lie within the inclusive range `[low, high]`. If the loop terminates without finding the target (i.e., `low > high`), -1 is returned. Time complexity is O(log N), and space complexity is O(1). Edge cases include empty arrays, target not present, and targets at the array's boundaries.

Pseudocode

function binary_search(array, length, target): if length is 0: return -1 low = 0 high = length - 1 while low <= high: mid = low + (high - low) / 2 mid_value = array[mid] if mid_value == target: return mid else if mid_val...

Language (required)

Paste your code (required)

Max 20,000 characters. (Dangerous HTML/script fragments will be blocked)

Normalized preview:

I agree to Publish this, after admin approval (queued for review). (Not required) I confirm this submission is anonymous and does not contain personal data. I grant SkillsTyping a free, non-exclusive, worldwide, and irrevocable license to store, modify, adapt, and publish this content after review. I understand that publication is not guaranteed, that I retain no ownership or usage claims over published versions, and that no compensation will be provided. See Privacy Policy

Publish details

Shown only if you opt to submit for review. Required when consent is checked.

Title (required with consent)

Short description / summary (required with consent)

Algorithm explanation (required with consent)

Pseudocode (required with consent)