H-Index - Google Top Interview Questions


Problem Statement :


In academia, the h-index is a metric used to calculate the impact of a researcher's papers. It is calculated as follows:

A researcher has index h if at least h papers have at least h citations each. If there are multiple h satisfying this formula, the maximum is chosen.

Given a list of paper citations of a researcher citations, calculate their h-index.

Constraints

n ≤ 100,000 where n is the length of citations
Example 1
Input
citations = [4, 3, 0, 1, 5]
Output
3
Explanation
At least 3 papers have at least 3 citations each: 3, 4, 5.

Example 2
Input
citations = [4, 4, 0, 1, 5, 9]
Output
4
Explanation
At least 4 papers have 4 citations each: 4, 4, 5, 9.

Example 3


Input

citations = [9, 10, 0, 1, 6]

Output

3

Explanation

At least 3 papers have 3 citations each: 6, 9, 10.



Solution :



title-img




                        Solution in C++ :

int solve(vector<int>& citations) {
    int h = 0, N = citations.size();
    sort(citations.begin(), citations.end());
    for (int i = 0; i < N; i++) {
        if (citations[i] >= h && N - i >= h) {
            h = min(citations[i], N - i);
        }
    }
    return h;
}
                    


                        Solution in Java :

import java.util.*;

class Solution {
    public int solve(int[] citations) {
        Arrays.sort(citations);
        int n = citations.length;
        int hIndex = Integer.MIN_VALUE;
        for (int i = 0; i < n; i++) {
            int hPapers = n - i;
            int hCitations = citations[i];
            hIndex = Math.max(hIndex, Math.min(hPapers, hCitations));
        }
        return hIndex;
    }
}
                    


                        Solution in Python : 
                            
class Solution:
    def solve(self, citations):
        lo, hi = 0, len(citations)
        while lo < hi:
            mid = (lo + hi + 1) // 2
            cnt = 0
            for citation in citations:
                if citation >= mid:
                    cnt += 1
            if cnt >= mid:
                lo = mid
            else:
                hi = mid - 1
        return lo
                    


View More Similar Problems

Components in a graph

There are 2 * N nodes in an undirected graph, and a number of edges connecting some nodes. In each edge, the first value will be between 1 and N, inclusive. The second node will be between N + 1 and , 2 * N inclusive. Given a list of edges, determine the size of the smallest and largest connected components that have or more nodes. A node can have any number of connections. The highest node valu

View Solution →

Kundu and Tree

Kundu is true tree lover. Tree is a connected graph having N vertices and N-1 edges. Today when he got a tree, he colored each edge with one of either red(r) or black(b) color. He is interested in knowing how many triplets(a,b,c) of vertices are there , such that, there is atleast one edge having red color on all the three paths i.e. from vertex a to b, vertex b to c and vertex c to a . Note that

View Solution →

Super Maximum Cost Queries

Victoria has a tree, T , consisting of N nodes numbered from 1 to N. Each edge from node Ui to Vi in tree T has an integer weight, Wi. Let's define the cost, C, of a path from some node X to some other node Y as the maximum weight ( W ) for any edge in the unique path from node X to Y node . Victoria wants your help processing Q queries on tree T, where each query contains 2 integers, L and

View Solution →

Contacts

We're going to make our own Contacts application! The application must perform two types of operations: 1 . add name, where name is a string denoting a contact name. This must store name as a new contact in the application. find partial, where partial is a string denoting a partial name to search the application for. It must count the number of contacts starting partial with and print the co

View Solution →

No Prefix Set

There is a given list of strings where each string contains only lowercase letters from a - j, inclusive. The set of strings is said to be a GOOD SET if no string is a prefix of another string. In this case, print GOOD SET. Otherwise, print BAD SET on the first line followed by the string being checked. Note If two strings are identical, they are prefixes of each other. Function Descriptio

View Solution →

Cube Summation

You are given a 3-D Matrix in which each block contains 0 initially. The first block is defined by the coordinate (1,1,1) and the last block is defined by the coordinate (N,N,N). There are two types of queries. UPDATE x y z W updates the value of block (x,y,z) to W. QUERY x1 y1 z1 x2 y2 z2 calculates the sum of the value of blocks whose x coordinate is between x1 and x2 (inclusive), y coor

View Solution →