Standard Template Library (STL) IV - Algorithms - 2020

bogotobogo.com site search:

Algorithms

The standard algorithms are found in <algorithms>. There are about 60 standard algorithms are defined in <algorithms>.

STL Algorithms Library can be characterized as following:

Sorting algorithms
Quicksort algorithm over elements a to b:
```
template <typename RandomAccessIterator>
void sort(RandomAccessIterator a, RandomAccessIterator b)
```
Stable sort algorithm over elements a to b:
```
template <typename RandomAccessIterator>
void sort(RandomAccessIterator a, RandomAccessIterator b)
```
Note: "Stable sorting algorithms maintain the relative order of records with equal keys (i.e. values). That is, a sorting algorithm is stable if whenever there are two records R and S with the same key and with R appearing before S in the original list, R will appear before S in the sorted list."
- wiki on stable sort
Non-mutating sequence algorithms
It does not modify contents of the containers they work on
Typical operation is searching container for particular element and returning its position.
Finds position of t in range a to b:
```
template <typeame InputIterator, typename T>
InputIterator find(InputIterator a, InputIterator b, const T& t);
```
Finds position of first element that makes predicate true in range of a to b, otherwise position b returned:
```
template <typeame InputIterator, typename Predicate>
InputIterator find(InputIterator a, InputIterator b, Predicate p);
```

Mutating sequence algorithms

STL's copy() function copies elements from a range to a location identified by an iterator:

template<typename InputIterator, typename OutputIterator>
OutputIterator copy(InputIterator first,	// beginning of source
			InputIterator last, 	// end of source
			OutputIterator result)	// beginning of destination
{
	while (first != last) {
		*result = *first;
		++first;
		++result;
	}
	return result;
);

A sample code is available at STL iterators - copy_algorithm.

Numerical algorithms

Sums and Inner products

#include <iostream>
#include <algorithm>
using namespace std;

int main()
{
  double u[3] = {1.1, 2.2, 3.3};
  double v[3] = {11.1, 22.2, 33.3};

  /* sum */
  double sum = accumulate(u, u+3, 0.0);

  /* inner product */
  double ip = inner_product(u, u+3, v, 0.0);

  cout << "sum = " << sum << endl
       << "inner product = " << ip << endl;

  return 0;
}

Output:

$ g++ -std=c++11 -o num num.cpp
$ ./num
sum = 6.6
inner product = 170.94

Adjacent difference

Generally use iterators to access containers instantiated on a given type

An input sequence is defined by a pair of iterators; an output sequence is defined by an iterator to its first element. In general, an algorithm is parameterized by one or more operations that can be defined as functions or as function objects. By returning the end of an input sequence, the algorithms indicate failure. For instance, find(begin, end, x) returns end if it can not find x.

find()

find is one of the non-modifying sequence operations. It finds the first occurrence of a value in a range:

#include <iostream>
#include <vector>

using namespace std;

template<class T, class U>
T find(T first, T last, const U&val;)
{
	while(first != last && *first != val) ++first;
	return first;
}

int main()
{
	vector<int> v;
	for(int i = 0; i < 10; i++) 
		v.push_back(i);
    
	int x1 = 5, x2 = 100;

	vector<int>::iterator it; 
	it = find(v.begin(),v.end(),x1);
	if(it != v.end()) {
		cout << "found " << x1 << endl;
	}
	else {
		cout << "could not find " << x1 << endl;
	}

	it = find(v.begin(),v.end(),x2);
	if(it != v.end()) {
		cout << "found " << x2 << endl;
	}
	else {
		cout << "could not find " << x2 << endl;
	}
	return 0;
}

Output is:

found 5
could not find 100

find() operates on a sequence defined by pair of iterators. We're trying to find val in the range [fist:last). The find() returns iterator as the result. The iterator points either to the first element of the sequence with the value val or to last. Returning an iterator to the one-beyond-the-last element of a sequence indicates not found.

The sequence consists of all the elements of a container, in this case, vector:

vector<int>::iterator it; 
it = find(v.begin(),v.end(),x1);

The iterator operations used are those of a vector<int>::iterator, in other words, ++ simply moves a pointer to the next location in memory and * dereferences the pointer.

while(first != last && *first != val) ++first;

The iterator comparison, first != last is a pointer comparison, and the value comparison *first != val simply compares two integers.

We check the returned iterator against the end of our sequence to see if we found the value:

if(it != v.end()) {

We could have written the implicit loop of the find() more explicitly:

template<class T, class U>
T find(T first, T last, const U&val;)
{
	for(T iter = first; iter != last; ++iter) 
		if(*iter == val) return iter;
	return last;
}

Note that we needed addition variable iter. So, the original find() may be more efficient and more compact.

The find() algorithm we wrote is generic. In other words, it can be used for different data types.

Here is an example using list instead of vector with the same find():

#include <iostream>
#include <list>
#include <string>

using namespace std;

template<class T, class U>
T find(T first, T last, const U&val;)
{
	while(first != last && *first != val) ++first;
	return first;
}

int main()
{
	list<string> myList(3);	// creates a list with 3 elements
	myList.push_back ("A");
	myList.push_front("B");
	myList.push_back("C");
    
	string x1 = "B", x2 = "F";

	list<string>::iterator it;
 
	it = find(myList.begin(),myList.end(),x1);
	if(it != myList.end()) {
		cout << "found " << x1 << endl;
	}
	else {
		cout << "could not find " << x1 << endl;
	}

	it = find(myList.begin(),myList.end(),x2);
	if(it != myList.end()) {
		cout << "found " << x2 << endl;
	}
	else {
		cout << "could not find " << x2 << endl;
	}
	return 0;
}

Output is:

found B
could not find F

Here, the iteration for find() is that of a list<string>::iterator. The operators have the required meaning, so that the underlying logic remains the same as for the vector<int> of the previous example. However, the implementation is different. In other words, ++ of the ++first simply follows a pointer in the link (or next) part of the element to where the next element of the list is stored.

From the examples above, we can easily find that the find() is extremely flexible as long as we obey the simple rules for iterators.

find_if()

Typically, we don't look for a specific value, rather we are more often look for a value that meets a certain criteria. The find() would be more useful if we could set our own find criteria. The standard algorithm that searches based on a criterion of user's choice is find_if():

template<class T, class P>
T find_if(T first, T last, P predicate)
{
	while(first != last && !predicate(*first) ++first;
	return first;
}

When we compare the find_if() with find(), it uses !predicate(*first) as a stop condition rather than *first != val. So, it stops searching once the predicate() succeeds rather than when an element has the same value with val.

A predicate is a function that returns true or false. Our find_if() needs a predicate that takes one argument so that it can say predicate(*first).

Here are the examples of predicates:

bool GreaterThan5(double d) { return 5.0 < d; }
bool FirstEven(int n) { return !(n % 2);}

And the complete code of using the predicates and find_if() is:

#include <iostream>
#include <vector>
#include <list>

using namespace std;

template<class T, class P>
T find_if(T first, T last, P predicate)
{
	while(first != last && !predicate(*first)) ++first;
	return first;
}

bool GreaterThan5(double d) { return 5.0 < d; }
bool FirstEven(int n) { return !(n % 2);}

void f1(vector<double>&v;)
{
	vector<double>::iterator it = find_if(v.begin(), v.end(), GreaterThan5);
	if(it != v.end()) 
		cout << "found " << *it << endl;
	else
		cout << "could not find " << endl;
}

void f2(list<int>&l;)
{
	list<int>::iterator it = find_if(l.begin(), l.end(), FirstEven);
	if(it != l.end()) 
		cout << "found " << *it << endl;
	else
		cout << "could not find " << endl;
}

int main()
{
	vector<double> v;
	for(int i = 1; i <= 10; i++) 
		v.push_back(i*1.0);

	list<int> myList;
	for(int i = 1;  i <= 10; i++) 
		myList.push_back(i);
	f1(v);
	f2(myList);

	return 0;
}

Output from the run is:

found 6
found 2

The find_if() calls GreaterThan5() or FirstEven() for each element until it finds the right element.

Functors

In the previous example, we used a predicate:

bool GreaterThan5(double d) { return 5.0 < d; }

That is ugly because if we want to find an element greater than 10, we have to make another one. So, we need to find a better way! Let's investigate what the property of GreaterThan() predicate. First, we can call it as a predicate like predicate(*first). Second, we can store a value, such as 10 or x so that it can be used when called.

That's the reason why we need a function object. In other words, we need an object that can behave like a function while it can store data. Here is the object we need:

class GreaterThan
{
	int data;
public:
	GreaterThan(int n):data(n) {}
	bool operator()(int n) const { return n > data; }
};

When we say GreaterThan(5), we make an object of class GreaterThan holding 5 as its member data:

find_if(v.begin(), v.end(), GreaterThan(5));

We pass that object to find_if() as its parameter called predicate. For each element of v, find_if() makes a call

predicate(*first)

This will invoke GreaterThan::operator() which is our function object using the argument *first. The result is a comparison of the element's value, *first, with 5.

What do we see here? The function call can be seen as an operator, the () operator, just like any operator. Therefore, () in predicate(*first) is given a meaning by GreaterThan::operator(), just as subscripting in v[i] is given a meaning by vector::operator[].

Here is the complete code:

#include <iostream>
#include <vector>
#include <list>

using namespace std;


template<class T, class P>
T find_if(T first, T last, P predicate)
{
	while(first != last && !predicate(*first)) ++first;
	return first;
}

class GreaterThan
{
	int data;
public:
	GreaterThan(int n):data(n) {}
	bool operator()(int n) const { return n > data; }
};

void f(vector<int>&v;)
{
	vector<int>::iterator it = find_if(v.begin(), v.end(), GreaterThan(5));
	if(it != v.end()) 
		cout << "found " << *it << endl;
	else
		cout << "could not find " << endl;
}

int main()
{
	vector<int> v;
	for(int i = 1; i <= 10; i++) 
		v.push_back(i);

	f(v);

	return 0;
}

Now, we established a way of allowing a function to carry around data that it needs. Obviously, the function objects provide us with a very powerful and convenient mechanism.

accumulate()

accumulate() is one of the simplest and most useful numerical algorithm:

a = accumulate(b,e,val)

It adds a sequence of values, and the type of the result a is the type of the initial value val.
It is found in <numeric>.

#include <iostream>

template <typename T, typename U>
U accumulate(T first, T last, U acc)
{
	while (first != last) {
		acc = acc + *first;
		++first;
	}
	return acc;
}

int main()
{
	int a[] = {1,2,3,4,5,6,7,8,9,10};
	std::cout << accumulate(a, a + sizeof(a)/sizeof(int), 0) << std::endl; // 55	
	return 0;
}

Here is another accumulator() which has four arguments, with the last one, we can specify the operation to be used:

/* acc.cpp */
#include <iostream>
#include <vector>
#include <functional>

template <typename T, typename U, typename Op>
U accumulate(T first, T last, U acc, Op op)
{
	while (first != last) {
		acc = op(acc, *first);
		++first;
	}
	return acc;
}


int main()
{
        double arr[] = {1.0, 2.1, 3.2, 4.3, 5.4};
        std::vector<double> v(arr, arr+5);

	std::cout << accumulate(v.begin(),v.end(), 1.0, std::multiplies<double>()) << std::endl; 
	std::cout << accumulate(v.begin(),v.end(), 1.0, std::divides<double>()) << std::endl;
	std::cout << accumulate(v.begin(),v.end(), 1.0, std::minus<int>()) << std::endl;
	
	return 0;
}

Output is:

$ g++ -o acc acc.cpp
$ ./acc
156.038
0.00640868
-14

Note that unlike other operations, the minus() operation was done against integers.

transform()

template <class InputIterator, class OutputIterator, class UnaryOperator>
  OutputIterator transform ( InputIterator first, InputIterator last,
                             OutputIterator output, UnaryOperator op );

The op will be applied to all the elements in the input range ([first,last)) and stores each returned value in the range beginning at output.

Here is the sample which converts a string to uppercase.

#include <iostream>
#include <string>
#include <algorithm>

std::string toUpperCase(std::string str)
{
    std::string out;
    std::transform(str.begin(), str.end(), std::back_inserter(out), ::toupper);
    return out;
}


int main()
{
    std::string str = "Stravinsky";
    std::cout << "In: " << str << std::endl; 
    std::cout << "Out: " << toUpperCase(str) << std::endl; 
    return 0;
}

List of algorithms

r=find(b,e,v)
r points to the first occurrence of v in [b:e).
r=find_if(b,e,v)
r points to the first occurrence of v in [b:e) so that v(x) is true.
r=count(b,e,v)
r is the number of occurrence of v in [b:e).
r=count_if(b,e,v)
r is the number of occurrence in [b:e) so that v(x) is true.
sort(b,e)
Sort [b,e) using <.
sort(b,e,v)
Sort [b,e) using v.
copy(b,e,b2)
Copy [b,e) to [b2:b2+(e-b)); there had better be enough elements after b2.
unique_copy(b,e,b2)
Copy [b,e) to [b2:b2+(e-b)); don't copy adjacent duplicates.
merge(b,e,b2,e2,r)
Merge two sorted sequencec [b2,e2) and [b,e) into [r:r+(e-b)+(e2-b2)).
r=equal_range(b,e,v)
r is te subsequence of the sorted range [b,e) with the value v, basically, a binary search for v.
equal(b,e,b2)
Do all elements of [b,e) and [b2:b2+(e-b)) compare equal?
x=accumulate(b,e,i)
x is the sum of i and the elements of [b,e).
x=accumulate(b,e,i,op)
Like the other accumulate, but with the sum calculate using op.
x=inner_product(b,e,b2,i)
x is the inner product of [b,e) and [b2:b2+(e-b)).
x=inner_product(b,e,b2,i,op,op2)
Like the othe inner_product, but with op and op2 instead of + and *.

Ph.D. / Golden Gate Ave, San Francisco / Seoul National Univ / Carnegie Mellon / UC Berkeley / DevOps / Deep Learning / Visualization

My YouTube channel

Sponsor Open Source development activities and free contents for everyone.

Thank you.

- K Hong

Sponsor Open Source development activities and free contents for everyone.

Thank you.

- K Hong

C++ Tutorials

C++ Home

Algorithms & Data Structures in C++ ...

Application (UI) - using Windows Forms (Visual Studio 2013/2012)

auto_ptr

Binary Tree Example Code

Blackjack with Qt

Boost - shared_ptr, weak_ptr, mpl, lambda, etc.

Boost.Asio (Socket Programming - Asynchronous TCP/IP)...

Classes and Structs

Constructor

C++11(C++0x): rvalue references, move constructor, and lambda, etc.

C++ API Testing

C++ Keywords - const, volatile, etc.

Debugging Crash & Memory Leak

Design Patterns in C++ ...

Dynamic Cast Operator

Eclipse CDT / JNI (Java Native Interface) / MinGW

Embedded Systems Programming I - Introduction

Embedded Systems Programming II - gcc ARM Toolchain and Simple Code on Ubuntu and Fedora

Embedded Systems Programming III - Eclipse CDT Plugin for gcc ARM Toolchain

Exceptions

Friend Functions and Friend Classes

fstream: input & output

Function Overloading

Functors (Function Objects) I - Introduction

Functors (Function Objects) II - Converting function to functor

Functors (Function Objects) - General

Git and GitHub Express...

GTest (Google Unit Test) with Visual Studio 2012

Inheritance & Virtual Inheritance (multiple inheritance)

Libraries - Static, Shared (Dynamic)

Linked List Basics

Linked List Examples

make & CMake

make (gnu)

Memory Allocation

Multi-Threaded Programming - Terminology - Semaphore, Mutex, Priority Inversion etc.

Multi-Threaded Programming II - Native Thread for Win32 (A)

Multi-Threaded Programming II - Native Thread for Win32 (B)

Multi-Threaded Programming II - Native Thread for Win32 (C)

Multi-Threaded Programming II - C++ Thread for Win32

Multi-Threaded Programming III - C/C++ Class Thread for Pthreads

MultiThreading/Parallel Programming - IPC

Multi-Threaded Programming with C++11 Part A (start, join(), detach(), and ownership)

Multi-Threaded Programming with C++11 Part B (Sharing Data - mutex, and race conditions, and deadlock)

Multithread Debugging

Object Returning

Object Slicing and Virtual Table

OpenCV with C++

Operator Overloading I

Operator Overloading II - self assignment

Pass by Value vs. Pass by Reference

Pointers

Pointers II - void pointers & arrays

Pointers III - pointer to function & multi-dimensional arrays

Preprocessor - Macro

Private Inheritance

Python & C++ with SIP

(Pseudo)-random numbers in C++

References for Built-in Types

Socket - Server & Client

Socket - Server & Client 2

Socket - Server & Client 3

Socket - Server & Client with Qt (Asynchronous / Multithreading / ThreadPool etc.)

Stack Unwinding

Standard Template Library (STL) I - Vector & List

Standard Template Library (STL) II - Maps

Standard Template Library (STL) II - unordered_map

Standard Template Library (STL) II - Sets

Standard Template Library (STL) III - Iterators

Standard Template Library (STL) IV - Algorithms

Standard Template Library (STL) V - Function Objects

Static Variables and Static Class Members

String

String II - sstream etc.

Taste of Assembly

Templates

Template Specialization

Template Specialization - Traits

Template Implementation & Compiler (.h or .cpp?)

The this Pointer

Type Cast Operators

Upcasting and Downcasting

Virtual Destructor & boost::shared_ptr

Virtual Functions

Programming Questions and Solutions ↓

Strings and Arrays

Linked List

Recursion

Bit Manipulation

Small Programs (string, memory functions etc.)

Math & Probability

Multithreading

140 Questions by Google

Qt 5 EXPRESS...

Win32 DLL ...

Articles On C++

What's new in C++11...

C++11 Threads EXPRESS...

Go Tutorial

OpenCV...