Word Subsets
Problem Description
You are given two string arrays words1 and words2.
A string b is a subset of string a if every letter in b occurs in a including multiplicity.
- For example,
"wrr"is a subset of"warrior"but is not a subset of"world".
A string a from words1 is universal if for every string b in words2, b is a subset of a.
Return an array of all the universal strings in words1. You may return the answer in any order.
Examples
Example 1:
Input: words1 = ["amazon","apple","facebook","google","leetcode"], words2 = ["e","o"]
Output: ["facebook","google","leetcode"]
Example 2:
Input: words1 = ["amazon","apple","facebook","google","leetcode"], words2 = ["l","e"]
Output: ["apple","google","leetcode"]
Constraints
words1[i]andwords2[i]consist only of lowercase English letters.- All the strings of
words1are unique.
Solution for Word Subsets
Approach: Reduce to Single Word in B
Intuition
If b is a subset of a, then say a is a superset of b. Also, say is the count of the number of 's in the word.
When we check whether a word wordA in A is a superset of wordB, we are individually checking the counts of letters: that for each , we have .
Now, if we check whether a word wordA is a superset of all words , we will check for each letter and each i, that . This is the same as checking .
For example, when checking whether "warrior" is a superset of words B = ["wrr", "wa", "or"], we can combine these words in B to form a "maximum" word "arrow", that has the maximum count of every letter in each word in B.
Algorithm
Reduce B to a single word bmax as described above, then compare the counts of letters between words a in A, and bmax.
Code in Different Languages
- C++
- Java
- Python
#include <vector>
#include <string>
#include <unordered_map>
class Solution {
public:
std::vector<std::string> wordSubsets(std::vector<std::string>& A, std::vector<std::string>& B) {
std::vector<int> bmax(26, 0);
for (const auto& b : B) {
std::unordered_map<char, int> bCount = count(b);
for (int i = 0; i < 26; ++i)
bmax[i] = std::max(bmax[i], bCount[static_cast<char>('a' + i)]);
}
std::vector<std::string> ans;
for (const auto& a : A) {
std::unordered_map<char, int> aCount = count(a);
bool universal = true;
for (int i = 0; i < 26; ++i) {
if (aCount[static_cast<char>('a' + i)] < bmax[i]) {
universal = false;
break;
}
}
if (universal)
ans.push_back(a);
}
return ans;
}
private:
std::unordered_map<char, int> count(const std::string& S) {
std::unordered_map<char, int> ans;
for (char c : S)
ans[c]++;
return ans;
}
};
class Solution {
public List<String> wordSubsets(String[] A, String[] B) {
int[] bmax = count("");
for (String b: B) {
int[] bCount = count(b);
for (int i = 0; i < 26; ++i)
bmax[i] = Math.max(bmax[i], bCount[i]);
}
List<String> ans = new ArrayList();
search: for (String a: A) {
int[] aCount = count(a);
for (int i = 0; i < 26; ++i)
if (aCount[i] < bmax[i])
continue search;
ans.add(a);
}
return ans;
}
public int[] count(String S) {
int[] ans = new int[26];
for (char c: S.toCharArray())
ans[c - 'a']++;
return ans;
}
}
class Solution(object):
def wordSubsets(self, A, B):
def count(word):
ans = [0] * 26
for letter in word:
ans[ord(letter) - ord('a')] += 1
return ans
bmax = [0] * 26
for b in B:
for i, c in enumerate(count(b)):
bmax[i] = max(bmax[i], c)
ans = []
for a in A:
if all(x >= y for x, y in zip(count(a), bmax)):
ans.append(a)
return ans
Complexity Analysis
Time Complexity:
Reason: where and is the total amount of information in A and B respectively.
Space Complexity:
Video Solution
References
-
LeetCode Problem: Word Subsets
-
Solution Link: Word Subsets