algorithm-reading
leetcode/lintcode题解/算法学习笔记
1. Part I - Basics
2. Data Structure
- 2.1. Linked List
- 2.2. Binary Tree
- 2.3. Binary Search Tree
- 2.4. Huffman Compression
- 2.5. Priority Queue
3. Basics Sorting
- 3.1. Bubble Sort
- 3.2. Selection Sort
- 3.3. Insertion Sort
- 3.4. Merge Sort
- 3.5. Quick Sort
- 3.6. Heap Sort
- 3.7. Bucket Sort
- 3.8. Counting Sort
- 3.9. Radix Sort
4. Basics Misc
- 4.1. Bit Manipulation
5. Part II - Coding
6. String - 字符串
- 6.1. strStr
- 6.2. Two Strings Are Anagrams
- 6.3. Compare Strings
- 6.4. Anagrams
- 6.5. Longest Common Substring
- 6.6. Rotate String
- 6.7. Reverse Words in a String
7. Integer Array - 整型数组
- 7.1. Remove Element
- 7.2. Zero Sum Subarray
- 7.3. Subarray Sum K
- 7.4. Subarray Sum Closest
- 7.5. Product of Array Exclude Itself
- 7.6. Partition Array
- 7.7. First Missing Positive
- 7.8. 2 Sum
- 7.9. 3 Sum
- 7.10. 3 Sum Closest
- 7.11. Remove Duplicates from Sorted Array
- 7.12. Remove Duplicates from Sorted Array II
- 7.13. Merge Sorted Array
- 7.14. Merge Sorted Array II
- 7.15. Median
8. Binary Search - 二分搜索
- 8.1. Binary Search
- 8.2. Search Insert Position
- 8.3. Search for a Range
- 8.4. First Bad Version
- 8.5. Search a 2D Matrix
- 8.6. Find Peak Element
- 8.7. Search in Rotated Sorted Array
- 8.8. Find Minimum in Rotated Sorted Array
- 8.9. Search a 2D Matrix II
- 8.10. Median of two Sorted Arrays
- 8.11. Sqrt x
- 8.12. Wood Cut
9. Math and Bit Manipulation - 数学技巧与位运算
- 9.1. Single Number
- 9.2. Single Number II
- 9.3. Single Number III
- 9.4. O1 Check Power of 2
- 9.5. Convert Integer A to Integer B
- 9.6. Factorial Trailing Zeroes
- 9.7. Unique Binary Search Trees
- 9.8. Update Bits
- 9.9. Fast Power
10. Linked List - 链表
- 10.1. Remove Duplicates from Sorted List
- 10.2. Remove Duplicates from Sorted List II
- 10.3. Remove Duplicates from Unsorted List
- 10.4. Partition List
- 10.5. Two Lists Sum
- 10.6. Two Lists Sum Advanced
- 10.7. Remove Nth Node From End of List
- 10.8. Linked List Cycle
- 10.9. Linked List Cycle II
- 10.10. Reverse Linked List
- 10.11. Reverse Linked List II
- 10.12. Merge Two Sorted Lists
- 10.13. Merge k Sorted Lists
- 10.14. Reorder List
- 10.15. Copy List with Random Pointer
- 10.16. Sort List
- 10.17. Insertion Sort List
- 10.18. Check if a singly linked list is palindrome
11. Reverse - 翻转法
- 11.1. Recover Rotated Sorted Array
12. Binary Tree - 二叉树
- 12.1. Binary Tree Preorder Traversal
- 12.2. Binary Tree Inorder Traversal
- 12.3. Binary Tree Postorder Traversal
- 12.4. Binary Tree Level Order Traversal
- 12.5. Maximum Depth of Binary Tree
- 12.6. Balanced Binary Tree
- 12.7. Binary Tree Maximum Path Sum
- 12.8. Lowest Common Ancestor
13. Binary Search Tree - 二叉搜索树
- 13.1. Insert Node in a Binary Search Tree
- 13.2. Validate Binary Search Tree
- 13.3. Search Range in Binary Search Tree
- 13.4. Convert Sorted Array to Binary Search Tree
- 13.5. Convert Sorted List to Binary Search Tree
- 13.6. Binary Search Tree Iterator
14. Exhaustive Search - 穷竭搜索
- 14.1. Subsets
- 14.2. Unique Subsets
- 14.3. Permutation
- 14.4. Unique Permutations
- 14.5. Unique Binary Search Trees II
15. Dynamic Programming - 动态规划
- 15.1. Triangle
- 15.2. Knapsack - 背包问题
  - 15.2.1. Backpack
- 15.3. Matrix
  - 15.3.1. Minimum Path Sum
  - 15.3.2. Unique Paths
- 15.4. Sequence
  - 15.4.1. Climbing Stairs
  - 15.4.2. Jump Game
- 15.5. Word Break
- 15.6. Longest Increasing Subsequence
- 15.7. Palindrome Partitioning II
- 15.8. Longest Common Subsequence
- 15.9. Edit Distance
16. Appendix I Interview and Resume
- 16.1. Interview
- 16.2. Resume

algorithm-reading

Anagrams

Source

leetcode: Anagrams | LeetCode OJ
lintcode: (171) Anagrams

Given an array of strings, return all groups of strings that are anagrams.

Example
Given ["lint", "intl", "inlt", "code"], return ["lint", "inlt", "intl"].

Given ["ab", "ba", "cd", "dc", "e"], return ["ab", "ba", "cd", "dc"].
Note
All inputs will be in lower-case

题解1 - 双重`for`循环(TLE)

题 Two Strings Are Anagrams 的升级版，容易想到的方法为使用双重for循环两两判断字符串数组是否互为变位字符串。但显然此法的时间复杂度较高。还需要 $O(n)$ 的数组来记录字符串是否被加入到最终结果中。

C++

class Solution {
public:
    /**
     * @param strs: A list of strings
     * @return: A list of strings
     */
    vector<string> anagrams(vector<string> &strs) {
        if (strs.size() < 2) {
            return strs;
        }

        vector<string> result;
        vector<bool> visited(strs.size(), false);
        for (int s1 = 0; s1 != strs.size(); ++s1) {
            bool has_anagrams = false;
            for (int s2 = s1 + 1; s2 < strs.size(); ++s2) {
                if ((!visited[s2]) && isAnagrams(strs[s1], strs[s2])) {
                    result.push_back(strs[s2]);
                    visited[s2] = true;
                    has_anagrams = true;
                }
            }
            if ((!visited[s1]) && has_anagrams) result.push_back(strs[s1]);
        }

        return result;
    }

private:
    bool isAnagrams(string &s, string &t) {
        if (s.size() != t.size()) {
            return false;
        }

        const int AlphabetNum = 26;
        int letterCount[AlphabetNum] = {0};
        for (int i = 0; i != s.size(); ++i) {
            ++letterCount[s[i] - 'a'];
            --letterCount[t[i] - 'a'];
        }
        for (int i = 0; i != t.size(); ++i) {
            if (letterCount[t[i] - 'a'] < 0) {
                return false;
            }
        }

        return true;
    }
};

源码分析

strs 长度小于等于1时直接返回。
使用与 strs 等长的布尔数组表示其中的字符串是否被添加到最终的返回结果中。
双重循环遍历字符串数组，注意去重即可。
私有方法isAnagrams用于判断两个字符串是否互为变位词。

复杂度分析

私有方法isAnagrams最坏的时间复杂度为 $O(2L)$ , 其中 $L$ 为字符串长度。双重for循环时间复杂度近似为 $\frac {1}{2} O(n^2)$ , $n$ 为给定字符串数组数目。总的时间复杂度近似为 $O(n^2 L)$ . 使用了含有26个元素的 int 数组，空间复杂度可认为是 $O(1)$ .

题解2 - 排序 + hashmap

在题 Two Strings Are Anagrams 中曾介绍过使用排序和 hashmap 两种方法判断变位词。这里我们将这两种方法同时引入！只不过此时的 hashmap 的 key 为字符串，value 为该字符串在 vector 中出现的次数。两次遍历字符串数组，第一次遍历求得排序后的字符串数量，第二次遍历将排序后相同的字符串取出放入最终结果中。

C++

class Solution {
public:
    /**
     * @param strs: A list of strings
     * @return: A list of strings
     */
    vector<string> anagrams(vector<string> &strs) {
        unordered_map<string, int> hash;

        for (int i = 0; i < strs.size(); i++) {
            string str = strs[i];
            sort(str.begin(), str.end());
            ++hash[str];
        }

        vector<string> result;
        for (int i = 0; i < strs.size(); i++) {
            string str = strs[i];
            sort(str.begin(), str.end());
            if (hash[str] > 1) {
                result.push_back(strs[i]);
            }
        }

        return result;
    }
};

源码分析

建立 key 为字符串，value 为相应计数器的hashmap, unordered_map为 C++ 11中引入的哈希表数据结构^{unordered_map}, 这种新的数据结构和之前的 map 有所区别，详见^{map-unordered_map}。

第一次遍历字符串数组获得排序后的字符串计数器信息，第二次遍历字符串数组将哈希表中计数器值大于1的字符串取出。

复杂度分析

遍历一次字符串数组，复杂度为 $O(n)$ , 对单个字符串排序复杂度近似为 $O(L \log L)$ . 两次遍历字符串数组，故总的时间复杂度近似为 $O(nL \log L)$ . 使用了哈希表，空间复杂度为 $O(K)$ , 其中 K 为排序后不同的字符串个数。

Reference

^{unordered_map}. unordered_map - C++ Reference ↩
^{map-unordered_map}. c++ - Choosing between std::map and std::unordered_map - Stack Overflow ↩
Anagrams | 九章算法

algorithm-reading

Anagrams

Source

题解1 - 双重for循环(TLE)

C++

源码分析

复杂度分析

题解2 - 排序 + hashmap

C++

源码分析

复杂度分析

Reference

题解1 - 双重`for`循环(TLE)