Abstract Datatypes and Data Structures: Stacks, Queues, ...

(抽象データ型とデータ構造、スタック、キューなど)

Data Structures and Algorithms

4th lecture, October 11, 2018

http://www.sw.it.aoyama.ac.jp/2018/DA/lecture4.html

Martin J. Dürst

AGU

© 2009-18 Martin J. Dürst 青山学院大学

 

Today's Schedule

 

Summary of Last Lecture

The asymptotic growth (order of growth) of a function and the time (and space) complexity of an algorithm can be expressed with the Big-O/Ω/Θ notation:

f(n)∈O(g(n)) ⇔ ∃c>0: ∃n0≥0: ∀nn0: f(n)≤c·g(n)

The order of growth of a function can be found by:

When using Big-O notation, always try to simplify g() as much as possible.

 

Last Week's Homework

(no need to submit)

Review this lecture's material every day!

On the Web, find algorithms with time complexity O(1), O(log n), O(n), O(n log n), O(n2), O(n3), O(2n), O(n!), and so on.

  

Frequent Orders

Example of other order: O(n2.373), fastest known algorithm for matrix multiplication

 

Polynomial versus Exponential Growth

Example:

1.1nn20

log(1.1)·n ≶ log(n)·20

n/log10(n) ≶ 20/log10(1.1) ≊483.2

n0 ≊ 1541

Conclusion: For a, b > 1, an will always eventually grow faster than nb

(nb is polynominal, an is exponential)

 

The Importance of Polynomial Time

[We will discuss this in more detail in lecture 14]

 

Finding the (Asymptotic) Time Complexity of an Algorithm

  1. Find/define the variables that determine the size of the input (e.g. n)
  2. Find the basic operations (steps) in the algorithm that are most frequently executed
  3. Express the total number of basic operations (steps) using summation or a recurrence relation
  4. Determine the time complexity expressed with big-O notation

Simplifications possible for big-O notation can be applied early.
Example: Because constant factors are irrelevant in big-O notation, they can be eliminated when counting steps.

 

How to Define Input Size Variables

 

How to Identify the Most Frequent Basic Operations

Caution: Some methods/functions may hide complexity (e.g. Ruby sort, ...)

 

Counting Basic Operations using Summation

 

Counting Basic Operations using Recurrence Relations

 

Recurrence Relations

 

Comparing the Execution Time of Algorithms

(from previous lectures)

Possible questions:

Conclusion: Expressing time complexity as O() allows to evaluate the essence of an algorithm, ignoring hardware and implementation differences.

 

Abstract Data Type (ADT)

 

Typical Examples of Abstract Data Types

 

Stack

Principle:
last-in-first-out (LIFO)
General example:
Stack of trays in cafeteria
Example from IT:
Function stack (local variables, return address, ...)
Main methods:
new, add/push, delete/pop
Other methods:
empty? (check whether the stack is empty or not)
top (return the topmost element without removing it from the stack)

 

Axioms for Stacks

It is possible to define a stack using the following four axioms:

  1. Stack.new.empty? ↔ true
  2. s.push(e).empty? ↔ false
  3. s.push(e).top ↔ e
  4. s.push(e).pop ↔ s (here, pop returns the new stack, not the top element)

(s is any arbitrary stack, e is any arbitrary data item)

Axioms can define a contract between implementation and users

 

Queue

Principle:
first-in-first-out (FIFO)
General example:
Queue in cafeteria waiting for food
Example from IT:
Queue of processes waiting for execution
Main methods:
add/enqueue, remove/delete/dequeue
Explain the meaning of GIGO: Garbage in, garbage out.

 

Comparing ADTs

Implementation: 4ADTs.rb; some complexities can be improved by using additional variables

ADT stack queue
Implemented as Array LinearList Array LinearList
create O(n) O(1) O(n) O(1)
add O(1) O(1) or O(n) O(n) or O(1) O(n) or O(1)
delete O(1) O(n) or O(1) O(1) or O(n) O(1) or O(n)
empty? O(1) O(1) O(1) O(1)
length O(1) O(n) O(1) O(n)

 

Summary

 

Homework

(no need to submit)

  1. Order the following orders of growth, and explain the reason for your order:

    O(n2), O(n!), O(n log log n), O(n log n), O(20n)

  2. Write a simple program that uses the classes in 4ADTs.rb.
    Use this program to compare the implementations.
    Hint: Use the second part of 2search.rb as an example.
  3. Implement the priority queue ADT (use Ruby or any other programming language)

    A priority queue keeps a priority (e.g. integer) for each data item.
    In the simplest case, the only data is the priority.
    The items with the highest priority leave the queue first.
    Your implementation can use an array or a linked list or any other data structure.

 

Report: Manual Sorting

Deadline: October 24, 2018 (Wednesday), 19:00.

Where to submit: Box in front of room O-529 (building O, 5th floor)

Format:

Problem: Propose and describe an algorithm for manual sorting, for the following two cases:

  1. One person sorts 5'000 pages
  2. 12 people together sort 40'000 pages

Each page is a sheet of paper of size A4, where a 10-digit number is printed in big letters.

The goal is to sort the pages by increasing number. There is no knowledge about the distribution of the numbers.

You can use the same algorithm for both cases, or a different algorithm.

Details:

  

Glossary

polynomial growth
多項式増加
exponential growth
指数的増加
integers with unlimited precision
非固定長整数
summation
総和
recurrence (relation)
漸化式
ceiling function
天井関数
substitution
置換
abstract data type
抽象データ型
encapsulation
カプセル化
data integrity
データの完全性
modularization
モジュール化
type theory
型理論
object-oriended
オブジェクト指向 (形容詞)
type
class
クラス
member function
メンバ関数
method
メソッド
stack
スタック
cafeteria
食堂
axiom
公理
queue
待ち行列、キュー
ring buffer
リングバッファ
priority queue
順位キュー、優先順位キュー、優先順位付き待ち行列