[Note: this document is formatted similarly to the SGI STL implementation documentation pages, and refers to concepts and classes defined there. However, neither this document nor the code it describes is associated with SGI, nor is it necessary to have SGI's STL implementation installed in order to use this class.]
dense_hash_map is a Hashed Associative Container that associates objects of type Key with objects of type Data. dense_hash_map is a Pair Associative Container, meaning that its value type is pair<const Key, Data>. It is also a Unique Associative Container, meaning that no two elements have keys that compare equal using EqualKey.
Looking up an element in a dense_hash_map by its key is efficient, so dense_hash_map is useful for "dictionaries" where the order of elements is irrelevant. If it is important for the elements to be in a particular order, however, then map is more appropriate.
dense_hash_map is distinguished from other hash-map implementations by its speed and by the ability to save and restore contents to disk. On the other hand, this hash-map implementation can use significantly more space than other hash-map implementations, and it also has requirements -- for instance, for a distinguished "empty key" -- that may not be easy for all applications to satisfy.
This class is appropriate for applications that need speedy access to relatively small "dictionaries" stored in memory, and for applications that need these dictionaries to be persistent. [implementation note])
#include <google/dense_hash_map> using std::dense_hash_map; // namespace where class lives by default struct eqstr { bool operator()(const char* s1, const char* s2) const { return strcmp(s1, s2) == 0; } }; int main() { dense_hash_map<const char*, int, hash<const char*>, eqstr> months; months.set_empty_key(NULL); months["january"] = 31; months["february"] = 28; months["march"] = 31; months["april"] = 30; months["may"] = 31; months["june"] = 30; months["july"] = 31; months["august"] = 31; months["september"] = 30; months["october"] = 31; months["november"] = 30; months["december"] = 31; cout << "september -> " << months["september"] << endl; cout << "april -> " << months["april"] << endl; cout << "june -> " << months["june"] << endl; cout << "november -> " << months["november"] << endl; }
Parameter | Description | Default |
---|---|---|
Key | The hash_map's key type. This is also defined as dense_hash_map::key_type. | |
Data | The hash_map's data type. This is also defined as dense_hash_map::data_type. | |
HashFcn | The hash function used by the hash_map. This is also defined as dense_hash_map::hasher. | hash<Key> |
EqualKey | The hash_map key equality function: a binary predicate that determines whether two keys are equal. This is also defined as dense_hash_map::key_equal. | equal_to<Key> |
Alloc | Ignored; this is included only for API-compatibility with SGI's STL implementation. | alloc |
Member | Where defined | Description |
---|---|---|
key_type [7] | Associative Container | The dense_hash_map's key type, Key. |
data_type [7] | Pair Associative Container | The type of object associated with the keys. |
value_type | Pair Associative Container | The type of object, pair<const key_type, data_type>, stored in the hash_map. |
hasher | Hashed Associative Container | The dense_hash_map's hash function. |
key_equal | Hashed Associative Container | Function object that compares keys for equality. |
pointer | Container | Pointer to T. |
reference | Container | Reference to T |
const_reference | Container | Const reference to T |
size_type | Container | An unsigned integral type. |
difference_type | Container | A signed integral type. |
iterator | Container | Iterator used to iterate through a dense_hash_map. [1] |
const_iterator | Container | Const iterator used to iterate through a dense_hash_map. |
iterator begin() | Container | Returns an iterator pointing to the beginning of the dense_hash_map. |
iterator end() | Container | Returns an iterator pointing to the end of the dense_hash_map. |
const_iterator begin() const | Container | Returns an const_iterator pointing to the beginning of the dense_hash_map. |
const_iterator end() const | Container | Returns an const_iterator pointing to the end of the dense_hash_map. |
size_type size() const | Container | Returns the size of the dense_hash_map. |
size_type max_size() const | Container | Returns the largest possible size of the dense_hash_map. |
bool empty() const | Container | true if the dense_hash_map's size is 0. |
size_type bucket_count() const | Hashed Associative Container | Returns the number of buckets used by the dense_hash_map. |
size_type max_bucket_count() const | Hashed Associative Container | Returns the largest possible number of buckets used by the dense_hash_map. |
void resize(size_type n) | Hashed Associative Container | Increases the bucket count to at least n. [4] |
hasher hash_funct() const | Hashed Associative Container | Returns the hasher object used by the dense_hash_map. |
key_equal key_eq() const | Hashed Associative Container | Returns the key_equal object used by the dense_hash_map. |
dense_hash_map() | Container | Creates an empty dense_hash_map. |
dense_hash_map(size_type n) | Hashed Associative Container | Creates an empty dense_hash_map that's optimized for holding up to n items. [5] |
dense_hash_map(size_type n, const hasher& h) | Hashed Associative Container | Creates an empty dense_hash_map that's optimized for up to n items, using h as the hash function. |
dense_hash_map(size_type n, const hasher& h, const key_equal& k) | Hashed Associative Container | Creates an empty dense_hash_map that's optimized for up to n items, using h as the hash function and k as the key equal function. |
template <class InputIterator> dense_hash_map(InputIterator f, InputIterator l)[2] |
Unique Hashed Associative Container | Creates a dense_hash_map with a copy of a range. |
template <class InputIterator> dense_hash_map(InputIterator f, InputIterator l, size_type n)[2] |
Unique Hashed Associative Container | Creates a hash_map with a copy of a range that's optimized to hold up to n items. |
template <class InputIterator> dense_hash_map(InputIterator f, InputIterator l, size_type n, const hasher& h)[2] |
Unique Hashed Associative Container | Creates a hash_map with a copy of a range that's optimized to hold up to n items, using h as the hash function. |
template <class InputIterator> dense_hash_map(InputIterator f, InputIterator l, size_type n, const hasher& h, const key_equal& k)[2] |
Unique Hashed Associative Container | Creates a hash_map with a copy of a range that's optimized for holding up to n items, using h as the hash function and k as the key equal function. |
dense_hash_map(const hash_map&) | Container | The copy constructor. |
dense_hash_map& operator=(const hash_map&) | Container | The assignment operator |
void swap(hash_map&) | Container | Swaps the contents of two hash_maps. |
pair<iterator, bool> insert(const value_type& x) |
Unique Associative Container | Inserts x into the dense_hash_map. |
template <class InputIterator> void insert(InputIterator f, InputIterator l)[2] |
Unique Associative Container | Inserts a range into the dense_hash_map. |
void set_empty_key(const key_type& key) [6] | dense_hash_map | See below. |
void set_deleted_key(const key_type& key) [6] | dense_hash_map | See below. |
void clear_deleted_key() [6] | dense_hash_map | See below. |
void erase(iterator pos) | Associative Container | Erases the element pointed to by pos. [6] |
size_type erase(const key_type& k) | Associative Container | Erases the element whose key is k. [6] |
void erase(iterator first, iterator last) | Associative Container | Erases all elements in a range. [6] |
void clear() | Associative Container | Erases all of the elements. |
const_iterator find(const key_type& k) const | Associative Container | Finds an element whose key is k. |
iterator find(const key_type& k) | Associative Container | Finds an element whose key is k. |
size_type count(const key_type& k) const | Unique Associative Container | Counts the number of elements whose key is k. |
pair<const_iterator, const_iterator> equal_range(const key_type& k) const |
Associative Container | Finds a range containing all elements whose key is k. |
pair<iterator, iterator> equal_range(const key_type& k) |
Associative Container | Finds a range containing all elements whose key is k. |
data_type& operator[](const key_type& k) [3] |
dense_hash_map | See below. |
bool write_metadata(FILE *fp) | dense_hash_map | See below. |
bool read_metadata(FILE *fp) | dense_hash_map | See below. |
bool write_nopointer_data(FILE *fp) | dense_hash_map | See below. |
bool read_nopointer_data(FILE *fp) | dense_hash_map | See below. |
bool operator==(const hash_map&, const hash_map&) |
Hashed Associative Container | Tests two hash_maps for equality. This is a global function, not a member function. |
Member | Description |
---|---|
void set_empty_key(const key_type& key) | Sets the distinguished "empty" key to key. This must be called immediately after construct time, before calls to another other dense_hash_map operation. [6] |
void set_deleted_key(const key_type& key) | Sets the distinguished "deleted" key to key. This must be called before any calls to erase(). [6] |
void clear_deleted_key() [6] | Clears the distinguished "deleted" key. After this is called, calls to erase() are not valid on this object. [6] |
data_type& operator[](const key_type& k) [3] |
Returns a reference to the object that is associated with a particular key. If the dense_hash_map does not already contain such an object, operator[] inserts the default object data_type(). [3] |
bool write_metadata(FILE *fp) | Write hashtable metadata to fp. See below. |
bool read_metadata(FILE *fp) | Read hashtable metadata from fp. See below. |
bool write_nopointer_data(FILE *fp) | Write hashtable contents to fp. This is valid only if the hashtable key and value are "plain" data. See below. |
bool read_nopointer_data(FILE *fp) | Read hashtable contents to fp. This is valid only if the hashtable key and value are "plain" data. See below. |
[1] dense_hash_map::iterator is not a mutable iterator, because dense_hash_map::value_type is not Assignable. That is, if i is of type dense_hash_map::iterator and p is of type dense_hash_map::value_type, then *i = p is not a valid expression. However, dense_hash_map::iterator isn't a constant iterator either, because it can be used to modify the object that it points to. Using the same notation as above, (*i).second = p is a valid expression.
[2] This member function relies on member template functions, which may not be supported by all compilers. If your compiler supports member templates, you can call this function with any type of input iterator. If your compiler does not yet support member templates, though, then the arguments must either be of type const value_type* or of type dense_hash_map::const_iterator.
[3] Since operator[] might insert a new element into the dense_hash_map, it can't possibly be a const member function. Note that the definition of operator[] is extremely simple: m[k] is equivalent to (*((m.insert(value_type(k, data_type()))).first)).second. Strictly speaking, this member function is unnecessary: it exists only for convenience.
[4] In order to preserve iterators, erasing hashtable elements does not cause a hashtable to resize. This means that after a string of erase() calls, the hashtable will use more space than is required. At a cost of invalidating all current iterators, you can call resize() to manually compact the hashtable. The hashtable promotes too-small resize() arguments to the smallest legal value, so to compact a hashtable, it's sufficient to call resize(0).
[5] Unlike some other hashtable implementations, the optional n in the constructor size indicates not the desired number of buckets that should be allocated, but instead the expected number of items to be inserted. The class then sizes the hash-map appropriately for the number of items specified. It's not an error to actually insert more or fewer items into the hashtable, but the implementation is most efficient -- does the fewest hashtable resizes -- if the number of inserted items is close to n.
[6] dense_hash_map requires you call set_empty_key() immediately after constructing the hash-map, and before calling any other dense_hash_map method. (This is the largest difference between the dense_hash_map API and other hash-map APIs. See implementation.html for why this is necessary.) The argument to set_empty_key() should be a key-value that is never used for legitimate hash-map entries. If you have no such key value, you will be unable to use dense_hash_map. It is an error to call insert() with an item whose key is the "empty key."
dense_hash_map also requires you call set_deleted_key() before calling erase(). The argument to set_deleted_key() should be a key-value that is never used for legitimate hash-map entries. It must be different from the key-value used for set_empty_key(). It is an error to call erase() without first calling set_deleted_key(), and it is also an error to call insert() with an item whose key is the "deleted key."There is no need to call set_deleted_key if you do not wish to call erase() on the hash-map.
It is acceptable to change the deleted-key at any time by calling set_deleted_key() with a new argument. You can also call clear_deleted_key(), at which point all keys become valid for insertion but no hashtable entries can be deleted until set_deleted_key() is called again.
[7] Both key_type and data_type must be plain old data (see implementation.html for why this is necessary). In addition, there should be no data structures that point directly into parts of key or value, including the key or value itself (for instance, you cannot have a value like struct {int a = 1, *b = &a}. This is because dense_hash_map uses malloc() and free() to allocate space for the key and value, and memmove() to reorganize the key and value in memory.
Almost all common uses fit this criterion: key_type can be a basic C type or an STL type like string, for instance. If value_type is complex, consider storing only the pointer to the data in the dense_hash_map. This will speed up the hashtable as well, since hashtable resizes will have to manipulate less memory.
IMPORTANT IMPLEMENTATION NOTE: In the current version of this code, the input/output routines for dense_hash_map have not yet been implemented. This section explains the API, but note that all calls to these routines will fail (return false). It is a TODO to remedy this situation. |
It is possible to save and restore dense_hash_map objects to disk. Storage takes place in two steps. The first writes the hashtable metadata. The second writes the actual data.
To write a hashtable to disk, first call write_metadata() on an open file pointer. This saves the hashtable information in a byte-order-independent format.
After the metadata has been written to disk, you must write the actual data stored in the hash-map to disk. If both the key and data are "simple" enough, you can do this by calling write_nopointer_data(). "Simple" data is data that can be safely copied to disk via fwrite(). Native C data types fall into this category, as do structs of native C data types. Pointers and STL objects do not.
Note that write_nopointer_data() does not do any endian conversion. Thus, it is only appropriate when you intend to read the data on the same endian architecture as you write the data.
If you cannot use write_nopointer_data() for any reason, you can write the data yourself by iterating over the dense_hash_map with a const_iterator and writing the key and data in any manner you wish.
To read the hashtable information from disk, first you must create a dense_hash_map object. Then open a file pointer to point to the saved hashtable, and call read_metadata(). If you saved the data via write_nopointer_data(), you can follow the read_metadata() call with a call to read_nopointer_data(). This is all that is needed.
If you saved the data through a custom write routine, you must call a custom read routine to read in the data. To do this, iterate over the dense_hash_map with an iterator; this operation is sensical because the metadata has already been set up. For each iterator item, you can read the key and value from disk, and set it appropriately. You will need to do a const_cast on the iterator, since it->first is always const. The code might look like this:
for (dense_hash_map<int*, int>::iterator it = ht.begin(); it != ht.end(); ++it) { const_cast<int*>(it->first) = new int; fread(const_cast<int*>(it->first), sizeof(int), 1, fp); fread(&it->second, sizeof(int), 1, fp); }
insert() invalidates all iterators, as does resize(). erase() is guaranteed not to invalidate any iterators.
This is implemented by making erase() not resize the hashtable. If you desire maximum space efficiency, you can call resize(0) after a string of erase() calls, to force the hashtable to resize to the smallest possible size.
The following are SGI STL concepts and classes related to dense_hash_map.
hash_map, Associative Container, Hashed Associative Container, Pair Associative Container, Unique Hashed Associative Container, set, map multiset, multimap, hash_set, hash_multiset, hash_multimap, sparse_hash_set, sparse_hash_map, dense_hash_set