Number of Valid Words in a Sentence

In this tutorial, we will solve the Number of Valid Words problem using two different approaches: brute force and optimized. We will provide the implementation of the solution in C++, Java, and Python.

Problem Description

Given a sentence, count the number of valid words. A valid word is defined by the following rules:

It only contains lowercase letters, hyphens, and/or punctuation (no digits).
There is at most one hyphen ('-'). If present, it must be surrounded by lowercase characters ("a-b" is valid, but "-ab" and "ab-" are not valid).
There is at most one punctuation mark ('!', '.', ','). If present, it must be at the end of the token ("ab,", "cd!", and "." are valid, but "a!b" and "c.," are not valid).

Examples

Example 1:

Input: sentence = "cat and  dog"
Output: 3
Explanation: The valid words in the sentence are "cat", "and", and "dog".

Example 2:

Input: sentence = "!this  1-s b8d!"
Output: 0
Explanation: There are no valid words in the sentence. "!this" is invalid because it starts with a punctuation mark. "1-s" and "b8d" are invalid because they contain digits.

Example 3:

Input: sentence = "alice and  bob are playing stone-game10"
Output: 5
Explanation: The valid words in the sentence are "alice", "and", "bob", "are", and "playing". "stone-game10" is invalid because it contains digits.

Constraints

1 <= sentence.length <= 1000
sentence only contains lowercase English letters, digits, ' ', '-', '!', '.', and ','.
There will be at least 1 token.

Solution for Number of Valid Words Problem

Intuition and Approach

The problem can be solved using a brute force approach or an optimized approach. The brute force approach directly iterates through the string and checks each token for validity, while the optimized approach streamlines the validation process.

Brute Force
Optimized

Approach 1: Brute Force

The brute force approach iterates through each token of the sentence, validates it according to the given rules, and counts the number of valid tokens.

Code in Different Languages

C++
Java
Python

Written by @ImmidiSivani

#include <string>
#include <vector>
#include <sstream>

class Solution {
public:
    int countValidWords(std::string sentence) {
        std::istringstream iss(sentence);
        std::string word;
        int validWordCount = 0;
        
        while (iss >> word) {
            if (isValid(word)) {
                validWordCount++;
            }
        }
        
        return validWordCount;
    }
    
private:
    bool isValid(const std::string& word) {
        int hyphenCount = 0;
        int punctuationCount = 0;
        
        for (int i = 0; i < word.length(); i++) {
            if (isdigit(word[i])) {
                return false;
            }
            if (word[i] == '-') {
                hyphenCount++;
                if (hyphenCount > 1 || i == 0 || i == word.length() - 1 || !islower(word[i-1]) || !islower(word[i+1])) {
                    return false;
                }
            }
            if (word[i] == '!' || word[i] == '.' || word[i] == ',') {
                punctuationCount++;
                if (punctuationCount > 1 || i != word.length() - 1) {
                    return false;
                }
            }
        }
        
        return true;
    }
};

Written by @ImmidiSivani

class Solution {
    public int countValidWords(String sentence) {
        String[] tokens = sentence.split("\\s+");
        int validWordCount = 0;
        
        for (String token : tokens) {
            if (isValid(token)) {
                validWordCount++;
            }
        }
        
        return validWordCount;
    }
    
    private boolean isValid(String word) {
        int hyphenCount = 0;
        int punctuationCount = 0;
        
        for (int i = 0; i < word.length(); i++) {
            char c = word.charAt(i);
            
            if (Character.isDigit(c)) {
                return false;
            }
            if (c == '-') {
                hyphenCount++;
                if (hyphenCount > 1 || i == 0 || i == word.length() - 1 || !Character.isLowerCase(word.charAt(i - 1)) || !Character.isLowerCase(word.charAt(i + 1))) {
                    return false;
                }
            }
            if (c == '!' || c == '.' || c == ',') {
                punctuationCount++;
                if (punctuationCount > 1 || i != word.length() - 1) {
                    return false;
                }
            }
        }
        
        return true;
    }
}

Written by @ImmidiSivani

class Solution:
    def countValidWords(self, sentence: str) -> int:
        tokens = sentence.split()
        valid_word_count = 0
        
        for token in tokens:
            if self.is_valid(token):
                valid_word_count += 1
                
        return valid_word_count
    
    def is_valid(self, word: str) -> bool:
        hyphen_count = 0
        punctuation_count = 0
        
        for i, c in enumerate(word):
            if c.isdigit():
                return False
            if c == '-':
                hyphen_count += 1
                if hyphen_count > 1 or i == 0 or i == len(word) - 1 or not (word[i - 1].islower() and word[i + 1].islower()):
                    return False
            if c in "!.,": 
                punctuation_count += 1
                if punctuation_count > 1 or i != len(word) - 1:
                    return False
        
        return True

Complexity Analysis

Time Complexity: $O(n)$
Space Complexity: $O(n)$
Where n is the length of the sentence.
The time complexity is $O(n)$ because we iterate through each character in the sentence once.
The space complexity is $O(n)$ because we store the result in a new string or list of strings.

Approach 2: Optimized Approach

The optimized approach uses similar logic but may include improvements such as pre-checking token conditions or using efficient string methods.

Code in Different Languages

C++
Java
Python

Written by @ImmidiSivani

#include <string>
#include <vector>
#include <sstream>

class Solution {
public:
    int countValidWords(std::string sentence) {
        std::istringstream iss(sentence);
        std::string word;
        int validWordCount = 0;
        
        while (iss >> word) {
            if (isValid(word)) {
                validWordCount++;
            }
        }
        
        return validWordCount;
    }
    
private:
    bool isValid(const std::string& word) {
        int hyphenCount = 0;
        int punctuationCount = 0;
        int n = word.length();
        
        for (int i = 0; i < n; i++) {
            char c = word[i];
            
            if (isdigit(c)) {
                return false;
            }
            if (c == '-') {
                hyphenCount++;
                if (hyphenCount > 1 || i == 0 || i == n - 1 || !islower(word[i-1]) || !islower(word[i+1])) {
                    return false;
                }
            }
            if (c == '!' || c == '.' || c == ',') {
                punctuationCount++;
                if (punctuationCount > 1 || i != n - 1) {
                    return false;
                }
            }
        }
        
        return true;
    }
};

Written by @ImmidiSivani

class Solution {
    public int countValidWords(String sentence) {
        String[] tokens = sentence.split("\\s+");
        int validWordCount = 0;
        
        for (String token : tokens) {
            if (isValid(token)) {
                validWordCount++;
            }
        }
        
        return validWordCount;
    }
    
    private boolean isValid(String word) {
        int hyphenCount = 0;
        int punctuation

        Count = 0;
        int n = word.length();
        
        for (int i = 0; i < n; i++) {
            char c = word.charAt(i);
            
            if (Character.isDigit(c)) {
                return false;
            }
            if (c == '-') {
                hyphenCount++;
                if (hyphenCount > 1 || i == 0 || i == n - 1 || !Character.isLowerCase(word.charAt(i - 1)) || !Character.isLowerCase(word.charAt(i + 1))) {
                    return false;
                }
            }
            if (c == '!' || c == '.' || c == ',') {
                punctuationCount++;
                if (punctuationCount > 1 || i != n - 1) {
                    return false;
                }
            }
        }
        
        return true;
    }
}

Written by @ImmidiSivani

class Solution:
    def countValidWords(self, sentence: str) -> int:
        tokens = sentence.split()
        valid_word_count = 0
        
        for token in tokens:
            if self.is_valid(token):
                valid_word_count += 1
                
        return valid_word_count
    
    def is_valid(self, word: str) -> bool:
        hyphen_count = 0
        punctuation_count = 0
        n = len(word)
        
        for i, c in enumerate(word):
            if c.isdigit():
                return False
            if c == '-':
                hyphen_count += 1
                if hyphen_count > 1 or i == 0 or i == n - 1 or not (word[i - 1].islower() and word[i + 1].islower()):
                    return False
            if c in "!.,": 
                punctuation_count += 1
                if punctuation_count > 1 or i != n - 1:
                    return False
        
        return True

Complexity Analysis

Time Complexity: $O(n)$
Space Complexity: $O(n)$
Where n is the length of the sentence.
The time complexity is $O(n)$ because we iterate through each character in the sentence once.
The space complexity is $O(n)$ because we store the result in a new string or list of strings.

Problem Description​

Examples​

Constraints​

Solution for Number of Valid Words Problem​

Intuition and Approach​

Approach 1: Brute Force​

Code in Different Languages​

Complexity Analysis​

Approach 2: Optimized Approach​

Code in Different Languages​

Complexity Analysis​

Authors:

Problem Description

Examples

Constraints

Solution for Number of Valid Words Problem

Intuition and Approach

Approach 1: Brute Force

Code in Different Languages

Complexity Analysis

Approach 2: Optimized Approach

Code in Different Languages

Complexity Analysis