Data Storage And Compression (Copy)

Bit (b)
- Smallest unit of data in computing.
- Represents a single binary digit: 0 or 1.
- All larger units are multiples of bits.
Nibble
- Group of 4 bits.
- Can represent 2⁴ = 16 different values (0–15 in denary, 0–F in hex).
Byte (B)
- Group of 8 bits.
- Commonly used to store a single character in ASCII encoding.

Unit	Symbol	Value in bytes	Example equivalent
Kibibyte	KiB	1024 bytes	Small text file
Mebibyte	MiB	1024 KiB	Image or short audio
Gibibyte	GiB	1024 MiB	Video file
Tebibyte	TiB	1024 GiB	Hard drive
Pebibyte	PiB	1024 TiB	Data center storage
Exbibyte	EiB	1024 PiB	Large-scale backups

Formula for uncompressed image file size (in bits):
File size = Resolution (width × height) × Colour depth
- Resolution = total number of pixels.
- Colour depth = bits per pixel.
To convert bits → bytes: divide by 8.
To convert bytes → KiB/MiB: divide by 1024 accordingly.

Example:
Image size: 1920×1080 pixels, colour depth 24-bit.

Formula for uncompressed audio file size (in bits):
File size = Sample rate × Sample resolution × Duration (seconds) × Number of channels
Sample rate = number of samples per second (Hz).
Sample resolution = bits per sample.
Channels = 1 (mono) or 2 (stereo).

Example:
Stereo audio, 44,100 Hz, 16-bit, 5 seconds:

Purpose: Reduce file size to:
- Save storage space.
- Reduce transmission time over networks.
- Reduce bandwidth usage.
- Make files easier to send via email or upload/download.
Impact:
- Smaller files → faster downloads/uploads.
- Less data storage required on devices and servers.
- May affect quality (depending on method).

Definition: Reduces file size without losing any original data.
When decompressed, file is identical to original.
Example techniques:
- Run-Length Encoding (RLE): Stores sequences of repeated values as a single value and count.
  - Example: AAAAABBBCC → 5A3B2C.
- Huffman Coding: Assigns shorter binary codes to frequently used symbols and longer codes to less frequent ones.
Uses:
- Text documents (where accuracy is critical).
- Program files.
- Some image formats (PNG, GIF).

Definition: Reduces file size by permanently removing some data, often unnoticeable to human perception.
Common methods:
- Lower image resolution or colour depth.
- Lower audio sample rate or bit depth.
- Remove high-frequency sounds or visual details.
Uses:
- JPEG images.
- MP3 audio.
- MPEG video.
Advantages:
- Much smaller file sizes than lossless.
Disadvantages:
- Some quality is permanently lost.
- Not suitable where exact reproduction is required.

Want To Teach Online?