Data Science Interview Questions [2024]

All Courses

Introduction

Natural Language Processing (NLP) allows machines to analyze and understand natural language. It plays a vital role in today’s era because of the sheer volume of text data that users generate around the world on digital channels such as social media apps, e-commerce websites, emails, blog posts, etc. Learning NLP will not only allow you to land high-paying jobs but will also help you in developing your profile for one of the most in-demand jobs in the field of Data Science.

If you are searching for how to prepare for an NLP interview or a comprehensive list of NLP interview questions and answers, you have landed on the right page. We have compiled a list of basic, intermediate, and advanced NLP interview questions and answers that can give you a head start in the interview preparation for Data Science, NLP Engineers, and ML Engineers

NLP Interview Questions and Answers for 2026

Beginner

1. How do you define Natural Language Processing?

Natural Language Processing (NLP) is an amalgamation of machine learning, computer science and linguistics that gives machines the ability to understand natural language in the manner it is spoken and written.

The study of NLP has been around for over 50 years and prior to having powerful computers, the implementation of NLP was limited to heuristic-based rules which often lead to inaccurate results and limited the scope of its use cases. But now with the help of computers, tasks like text summarization, language translation, chatbot, spelling correction, text auto-completion, image captioning etc. have been made possible.

2. Where do you find the usage of NLP in your day-to-day life? Give me 2 examples.

Usage of Natural Language Processing can be seen all around us. Here are two very commonly occurring use cases:

We can use Google Assistant to take notes as we dictate. This is achieved by first deciphering which language we are speaking and then converting each word from its soundwave pattern into text.
While typing messages in WhatsApp, a lot of times we have observed the sentences been auto-completed or auto-corrected. This is achieved by using NLP in order to understand the meaning of what we are typing and suggesting us the appropriate texts to complete or correct the sentences.

3. What are some of the commonly performed NLP tasks?

Some of the commonly performed NLP tasks include:

Machine Translation: This can allow us to translate text from one language into another. For example, we can translate blogs written in English to French using this technique.
Information Extraction: This can help us in extracting key information from documents based upon the user query. For example, we can use it to extract information like agreement signing date, names of the parties involved, termination clause etc. from contractual documents.
Text Classification: This can allow us to segregate documents into multiple categories based on their content. For example, we can classify news articles into political, sports, entertainment, economics and other such categories.
Text Summarization: This will help us in compressing the information from a big corpus of data into a short summary. For example, we can create news headlines for a given news article by using this technique.
Text Correction: This can help us in detecting spelling and syntax errors in our text in real-time and allow us to correct them. For example, we can use this technique to check for any errors/ mistakes in our emails.

4. What are some of the best python libraries used for NLP?

Listed below are some of the most used NLP libraries:

NLTK
Spacy
Gensim
Stanford CoreNLP
TextBlob
Pattern

5. Can you tell me 5 features that you can extract from a tweet?

When performing feature engineering on text data, we can extract the below mentioned features:

Length of the tweet.
Average length of each word in the tweet.
Count of part-of-speech tags like Nouns, Verbs, Adjectives etc. in the tweet.
Presence of any #-tags or @-mentions in the tweet.
Presence of any URL in the tweet.

Intermediate

1. What are collocations? How can you find them in a document?

Collocations are defined as phrases or expressions containing multiple words, that are highly likely to co-occur. For example, words like ‘ice cream’, ‘machine learning’, ‘natural language processing’ etc. Collocations are different from bi-grams or tri-grams. Bi-grams don’t always form meaningful phrases. Pointwise Mutual Information or PMI score is used for identifying these collocations from texts. It is calculated as below.

In the above formula, p(a,b) indicates the probability that two token ‘a’ and ‘b’ occur together in a piece of text, p(a) and p(b) indicate the probability that tokens ‘a’ and ‘b’ occur individually in the same text. We can decide a threshold above which all collocations can be filtered from the document.

2. What is Phonetic Hashing?

Phonetic Hashing is Lexical Processing technique which is used to reduce the different pronunciations of the same word to a common base form. A common example of this problem occurs with pronunciation of the capital of India which is New Delhi. Delhi is also pronounced as “Dilli” and hence it’s not surprising to find both these variants in uncleaned text corpus.

Phonetic Hashing buckets all the similar Phonemes, which are words with similar sound or pronunciation, into a single bucket and gives all these variations a single hash code. Hence, the words ‘Dilli’ and ‘Delhi’ will have the same code. It is performed using the Soundex Algorithm. Let’s compute the Soundex hash code of the word “Mississippi”.

Phonetic hash is 4 letter code. The first letter of the code is the first letter of the input word. Thus, the first character of our hash is ‘M’.
Next, we need to map all the consonant letters (except the first letter). All the vowels are written as is and ‘H’s, ‘Y’s and ‘W’s remain unencoded. After mapping the consonants, the code becomes MI22I22I11I.
Next, we remove all the vowels. ‘I’ is the only vowel. The code then becomes M222211.
Next, merge all the consecutive duplicate numbers. Thus, all ‘2’s and ‘1’s are merged and the code becomes M21.
At last, force the code to be of 4 characters in length by either padding ‘0’s to the right if the length is less than 4. Or you need to truncate from the right if the length is more than 4. The final code is M210.

3. What is the “Zero Probability Problem” in text classification? How can you solve this problem?

Zero Probability Problem occurs when we have an instance in the test dataset which contains a category that was absent in the training dataset. In such a case the conditional probability of P(x|Ci) becomes 0 which in-turn makes the overall probability estimate of P(Ci|x) equal to zero. And thus, we are unable to estimate any probability for the classes. In order to overcome this problem a commonly used technique is called Laplace Smoothing in which we add a small number such as 1 to each value in our dataset.

4. What is Linear Discriminant Analysis? How can you use this technique in NLP?

Linear Discriminant Analysis or LDA is a dimensionality reduction technique which primarily used for supervised classification problems. It creates a linear combination of features such that the new features separate the two or more classes in the original data. The two criteria that are used by LDA are:

Maximizing the distance between the means of the two or more classes.
Minimizing the variability within each class.

In other words, it is trying to find a lower dimensional space in which the ratio

5. What are the different types of RNN architectures?

There are 4 main types of RNN architectures. Let’s take a look at them one-by-one.

One to One: This is the most basic RNN architecture which is similar to a traditional neural network where we have a single input and get a single output.
One to Many: In this architecture we have single input and the RNN model generates a sequence of output. A common use case of this architecture is image captioning in which we give an encoded image an input the model generates a sequence of words for the caption.
Many to One: This architecture is used when we want a single output with a sequence of inputs. A common use case for this is in generating ratings for reviews in which the input is a sequence of tokens from the review and the output is an integer from 1-5.
Many to Many: This architecture has two variations. Both of them are used when we want a sequence of inputs and outputs. In the above figure, the bottom left architecture is used for tasks like Name Entity Recognition and Part-of-Speech Tagging in which the input is a sequence of tokens, and the output is either the entity or the POS tag for that respective token. And the bottom right architecture is used for tasks like Machine Translation in which both input and outputs are a sequence of tokens from two different languages.

Advanced

1. What is “Name Entity Recognition” (NER)? Can you name a python library that can allow you to perform NER in text data?

Name Entity Recognition or NER is a subtask of Information Extraction. The term ‘Named Entity’ refers to a real-world object, such as a person, location, organization and money. Narendra Modi, India and KnowledgeHut are all examples of Named Entities. NER is process of classification and extraction of these entities from documents into pre-defined categories like person, location, quantities, organization etc. For example, in the sentence “Narendra Modi is the Prime Minister of India”, NER would identify ‘Narendra Modi’ as a Person and ‘India’ as a Location.

Spacy is an open-source python library which can be used to perform NER. Below is a code snippet which demonstrates how we can perform NER using Spacy.

import spacy 
#Load the spacy model to be used in the program 
ner_model = spacy.load("en_core_web_sm") 
#Sample text to run NER 
text = "Narendra Modi is the Prime Minister of India" 
text_doc = ner_model(text) 
#iterate through each entity in the document and print the NER label 
for entity in text_doc.ents: 
    print(entity.text, entity.label_) 

The output of the above process would look like below:

Narendra Modi Person 
India GPE 

2. How can you use NLTK in an NLP pipeline?

NLTK or “Natural Language ToolKit” is an open-source python library. We can perform a host of NLP tasks using this library such as:

Tokenization

#Importing all the dependencies 
import nltk 
from nltk.tokenize import sent_tokenize 
#Sample text to tokenize 
text = "This is a sample text which is to be tokenized” 
#Print the list of tokens generted 
print(word_tokenize(text)) 

Stop-Words Removal

import nltk 
from nltk.corpus import stopwords 
#Printing stopwords which are available in the package. 
print(stopwords.words("english")) 

Stemming

rom nltk.stem.porter import PorterStemmer 
text = "This is a sample text which is to be stemmed” 
text_tokens = word_tokenize(text)) 
# Reduce words to their stems 
stemmed = [PorterStemmer().stem(w) for w in text_tokens] 
print(stemmed) 

Lemmatization

from nltk.stem.wordnet import WordNetLemmatizer 
text = "This is a sample text which is to be lemmatized” 
text_tokens = word_tokenize(text)) 
# Reduce words to their root form 
lemmed = [WordNetLemmatizer().lemmatize(w) for w in text_tokens] 
print(lemmed) 

POS Tagging

import nltk 
from nltk.tokenize import word_tokenize 
text = "This sentence is an used for performing POS Tagging using NLTK" 
text_tokens = word_tokenize(text) 
for token in text_tokens: 
  token_tag = nltk.pos_tag(token) 
  print(token_tag) 

3. How can you use Spacy in an NLP pipeline?

Just like NLTK, we can also use Spacy to perform a host of NLP related tasks:

Tokenization

import spacy  
nlp = spacy.load('en_core_web_sm') 
# Create an nlp object 
doc = nlp("This is a sample sentence which is to be tokenized.") 
# Print the list of tokens generated 
print([token for token in doc]) 

Stop Words Removal

import spacy  
nlp = spacy.load('en_core_web_sm') 
# Create an nlp object 
doc = nlp("This is a sample sentence which is to be tokenized.") 
# Iterate over the tokens 
for token in doc: 
    # Print the token and if it's a stopword 
    print(token.text, token.is_stop) 

Lemmatization

import spacy  
nlp = spacy.load('en_core_web_sm') 
# Create an nlp object 
doc = nlp("This is a sample sentence which is to be tokenized.") 
# Iterate over the tokens 
for token in doc: 
    # Print the token and it's lemma. 
    print(token.text, token.lemma_) 

Named Entity Recognition

import spacy 
#Load the spacy model to be used in the program 
ner_model = spacy.load("en_core_web_sm") 
#Sample text to run NER 
text = "Narendra Modi is the Prime Minister of India" 
text_doc = ner_model(text) 
#iterate through each entity in the document and print the NER label 
for entity in text_doc.ents: 
    print(entity.text, entity.label_) 

4. What is Principal Component Analysis? How can you use this technique in NLP?

Principal Component Analysis or PCA is a dimensionality reduction or feature extraction technique. It is a statistical process that converts observations containing correlated features into a set of orthogonal, uncorrelated features called “Principal Components” (PC). If the original dataset contains ‘n’ number of features, then PCA will create ‘n’ Principal Components. Consider this example given below:

In the above example, Figure 1 shows 2 features X1 and X2 in the original dataset. PCA will try to find directions that can capture as much variance as possible from the original data. Hence, once the algorithm is run, the two Principal Components, Z1 and Z2 are shown in Figure 2. Given below are the properties of these Principal Components:

They are a linear combination of original features in the dataset.
All Principal Components are independent of each other i.e., their correlation is zero.
The amount of information (variance) stored in each Principal Component decreases from 1 to ‘n’ i.e., the first Principal Component carries the most amount of information and last Principal Component carries the least amount of information.

In case of NLP when we use feature extraction techniques like BOW, TF-IDF etc., it results in a high dimensional dataset. PCA can help us identify the main Principal Components that carry maximum amount of information. We can then project the original data on the new principal components to get a lower dimensional dataset.

5. What are sequence models? Give me few examples in which you can use them.

Sequence data are data points in which the observations are ordered in a meaningful manner such as a time series data in which the observations are ordered based on time. An audio clip is another sequence data in which the words are present in the order in which they are being spoken.

Sequence Models are machine learning models that input or output sequential data. Recurrent Neural Networks (RNN) are a popular example of Sequence Models. Below are a couple of use cases of these models:

Speech Recognition: This task uses sequence models which input and output sequential data. The input is an audio clip, and the output is the text transcript of the input.
Sentiment Classification: This task uses sequence models which take sequential data as an input and outputs a scaler. The input is a piece of text such as a tweet, and the output is an integer indicating the sentiment of the tweet.
Name Entity Recognition: This task uses sequence models which take sequential data as an input and outputs a string. The input is a piece of text, and output can be a string indicating the entity of each token.

Want to Know More?

Full Name*

Email*

+91

Phone Number*

United States +1

India +91

Canada +1

Australia +61

Singapore +65

New Zealand +64

Germany +49

United Arab Emirates +971

Hong Kong +852

Ireland +353

Afghanistan +93

Aland Islands +358

Albania +355

Algeria +213

AmericanSamoa +1684

Andorra +376

Angola +244

Anguilla +1264

Antarctica +672

Antigua and Barbuda +1268

Argentina +54

Armenia +374

Aruba +297

Ascension Island +247

Austria +43

Azerbaijan +994

Bahamas +1242

Bahrain +973

Bangladesh +880

Barbados +1246

Belarus +375

Belgium +32

Belize +501

Benin +229

Bermuda +1441

Bhutan +975

Bolivia +591

Bosnia and Herzegovina +387

Botswana +267

Brazil +55

British Indian Ocean Territory +246

Brunei Darussalam +673

Bulgaria +359

Burkina Faso +226

Burundi +257

Cambodia +855

Cameroon +237

Cape Verde +238

Cayman Islands +1345

Central African Republic +236

Chad +235

Chile +56

China +86

Christmas Island +61

Cocos (Keeling) Islands +61

Colombia +57

Comoros +269

Congo +242

Cook Islands +682

Costa Rica +506

Cote d'Ivoire +225

Croatia +385

Cuba +53

Cyprus +357

Czech Republic +420

Democratic Republic of the Congo +243

Denmark +45

Djibouti +253

Dominica +1767

Dominican Republic +1849

Ecuador +593

Egypt +20

El Salvador +503

Equatorial Guinea +240

Eritrea +291

Estonia +372

Eswatini +268

Ethiopia +251

Falkland Islands (Malvinas) +500

Faroe Islands +298

Fiji +679

Finland +358

France +33

French Guiana +594

French Polynesia +689

Gabon +241

Gambia +220

Georgia +995

Ghana +233

Gibraltar +350

Greece +30

Greenland +299

Grenada +1473

Guadeloupe +590

Guam +1671

Guatemala +502

Guernsey +44

Guinea +224

Guinea-Bissau +245

Guyana +592

Haiti +509

Holy See (Vatican City State) +379

Honduras +504

Hungary +36

Iceland +354

Indonesia +62

Iran +98

Iraq +964

Isle of Man +44

Israel +972

Italy +39

Jamaica +1876

Japan +81

Jersey +44

Jordan +962

Kazakhstan +77

Kenya +254

Kiribati +686

Korea, Democratic People's Republic of Korea +850

Korea, Republic of South Korea +82

Kosovo +383

Kyrgyzstan +996

Laos +856

Latvia +371

Lebanon +961

Lesotho +266

Liberia +231

Libya +218

Liechtenstein +423

Lithuania +370

Luxembourg +352

Macau +853

Madagascar +261

Malawi +265

Malaysia +60

Maldives +960

Mali +223

Malta +356

Marshall Islands +692

Martinique +596

Mauritania +222

Mauritius +230

Mayotte +262

Mexico +52

Micronesia, Federated States of Micronesia +691

Moldova +373

Monaco +377

Mongolia +976

Montenegro +382

Montserrat +1664

Morocco +212

Mozambique +258

Myanmar +95

Namibia +264

Nauru +674

Nepal +977

Netherlands +31

New Caledonia +687

Nicaragua +505

Niger +227

Nigeria +234

Niue +683

Norfolk Island +672

North Macedonia +389

Northern Mariana Islands +1670

Norway +47

Oman +968

Pakistan +92

Palau +680

Palestine +970

Papua New Guinea +675

Paraguay +595

Peru +51

Philippines +63

Pitcairn +872

Poland +48

Portugal +351

Puerto Rico +1939

Qatar +974

Reunion +262

Romania +40

Russia +7

Rwanda +250

Saint Barthelemy +590

Saint Helena, Ascension and Tristan Da Cunha +290

Saint Kitts and Nevis +1869

Saint Lucia +1758

Saint Martin +590

Saint Pierre and Miquelon +508

Saint Vincent and the Grenadines +1784

Samoa +685

San Marino +378

Sao Tome and Principe +239

Saudi Arabia +966

Senegal +221

Serbia +381

Seychelles +248

Sierra Leone +232

Sint Maarten +1721

Slovakia +421

Slovenia +386

Solomon Islands +677

Somalia +252

South Africa +27

South Georgia and the South Sandwich Islands +500

South Sudan +211

Spain +34

Sri Lanka +94

Sudan +249

Suriname +597

Svalbard and Jan Mayen +47

Sweden +46

Switzerland +41

Syrian Arab Republic +963

Taiwan +886

Tajikistan +992

Tanzania, United Republic of Tanzania +255

Thailand +66

Timor-Leste +670

Togo +228

Tokelau +690

Tonga +676

Trinidad and Tobago +1868

Tunisia +216

Turkey +90

Turkmenistan +993

Turks and Caicos Islands +1649

Tuvalu +688

Uganda +256

Ukraine +380

United Kingdom +44

Uruguay +598

Uzbekistan +998

Vanuatu +678

Venezuela, Bolivarian Republic of Venezuela +58

Vietnam +84

Virgin Islands, British +1284

Virgin Islands, U.S. +1340

Wallis and Futuna +681

Yemen +967

Zambia +260

Zimbabwe +263

By Signing up, you agree to ourTerms & Conditionsand ourPrivacy and Policy

15% OFF

Coupon Code "SELF15"

Coupon Expires 23/03

Copy

Description

How to Prepare for an NLP Interview?

Apart from the natural language processing interview questions discussed here, you can follow the below roadmap to fully prepare for an NLP interview:

Start with the basics, practice the simple text preprocessing techniques such as Tokenization, Stemming, Lemmatization, regular expressions etc. that form the majority of interview questions in NLP.
Next practice some intermediate techniques like BOW, TF-IDF and n-grams.
Get used to working with word embeddings. Use pre-trained word embeddings GloVe, Word2Vec etc.
Train some simple Machine Learning models using the above techniques to get a handle on how to solve various NLP tasks.
Learn about neural network based architectures such RNN, LSTM and GRU.
Practice creating custom word embeddings by using encoder networks, CBOW, Skip-Gram and other such architectures.
Use deep learning python libraries such as TensorFlow and PyTorch to practice neural network based model architectures.
And finally, learn about the Transformer based model architectures such as BERT, GPT and such.

Natural Language Processing (NLP) Interview Preparation Tips and Tricks

The task of learning NLP is a huge mountain to climb. You can use the below mentioned tips to make your preparation a little easier:

Don’t be in a hurry to learn. Some of the concepts are not that easy to understand in one go. So, give each topic sufficient time.
Read the documentation of all the libraries that are popularly used. The documentations contain excellent examples in which you can apply any algorithm that you are looking for.
If you are a fresher, use Kaggle to read and understand how the experienced programmers approach towards any NLP problem. Reading their notebooks will give you great insights towards how to break down each problem and what techniques works best in various scenarios.
Develop a habit of participating in hackathons. These competitions are a great place to showcase your learning and get an understanding of where you stand in the competition.
Don’t hesitate to ask questions. The more you ask, the more you learn. You can check NLP course to further help you with your preparation.

NLP Job Roles

Research Engineer – NLP
AI/ML Architect
Machine Learning Engineer - NLP
Data Scientist – NLP
Data Science Manager – NLP
Software Engineer – NLP

Top Companies

Wells Fargo
Adobe
Mindtree
Quantiphi
Harman
Mercedes Benz

What to Expect in a Natural Language Processing Interview?

When stepping into an NLP interview, prepare yourself for the below topics:

In technical round, you will be asked theoretical questions around the basics and advance topics of NLP. Make sure to go through this blog before your interview to practice NLP basic interview questions.
You will be asked to showcase your programming skills. Expect some basic DSA questions. Also practice implementation of all the elements of an NLP pipeline. So it a good idea to practice major NLP coding interview questions.
You might also be asked to solve a case study using machine learning.
Prepare your resume well. If you have mentioned any courses, trainings, or certificates, questions around those can also be asked. Don’t put anything in there that you don’t know.

Summary

Congratulations on making it to the end of this blog. If you’ve made it this far then this certainly means that you’re committed to your preparation for a full-time Data Science or NLP role. We certainly hope that these top NLP interview questions can serve as a helping hand in your preparation for all types of data science interviews. For a much deeper understanding, we highly recommend that you check out our popular online course for Data Science. This course takes you through the entire journey of being a professional data scientist with practical data science interview questions and a hands-on problem-solving experience.

Just to recap, in the basic NLP interview questions, we have covered topics around the lexical processing techniques such as tokenization, Bag of Words, TF-IDF, regular expressions, and simple machine learning algorithms which are popularly used in NLP. In the intermediate NLP questions, we have focused on dimensionality reduction techniques such as PCA and LDA. We have also covered questions related to word embeddings, performance metrics and some NLP coding interview questions as well, this includes spacy interview questions as well as NLTK interview questions.

The basic and intermediate NLP interview questions are more than enough to get you through generic data science interviews. For NLP engineer interviews, questions from advanced section will help you out. In the advanced section, we have focused on asking questions about the popular RNN architectures, NLP transformer interview questions, and BERT interview questions as well.

Recommended Courses

Learners Enrolled For

Got more questions? We've got answers.

Book Your Free Counselling Session Today.

NLP Interview Questions and Answers for 2026

Introduction

Beginner

Intermediate

Advanced

1. How do you define Natural Language Processing?

2. Where do you find the usage of NLP in your day-to-day life? Give me 2 examples.

3. What are some of the commonly performed NLP tasks?

4. What are some of the best python libraries used for NLP?

5. Can you tell me 5 features that you can extract from a tweet?

6. How do you define Lexical Processing in NLP?

7. How do you define Syntactic Processing in NLP?

8. How do you define Semantic Processing in NLP?

9. What are Regular Expressions? Can you tell me 2 features that you can extract from a tweet using Regular Expression?

10. Can you tell me the difference between the ‘?’, the ‘*’ and the ‘+’ quantifiers in Regular Expressions?

11. Can you create a character set equivalent of a regular expression special character ‘\w’?

12. How do you define groups in regular expressions? Create a regular expression that extract dates in the format dd/mm/yyyy as well as the day, month and year from these dates.

13. How do you define Part-of-Speech (POS) tags?

14. How do you define “stop” words? Why is it generally recommended to remove them from your corpus?

15. How do you define “Bag-of-Words” (BOW) representation of a sentence? What are some of the drawbacks of using this approach?

16. How do you define “Term-Frequency Inverse-Document-Frequency” (TF-IDF) representation of a sentence? What are the benefits of using this over the “Bag-of-Words” approach?

17. Why do you use “Stemming” and “Lemmatization” in an NLP pipeline? What are the differences between these two techniques?

18. Can you explain the various stages involved in the lifecycle of an NLP project?

19. What is Bayes Theorem in Conditional Probability? Which machine learning algorithm uses this theorem?

20. Can you explain the Naïve Bayes Algorithm? State the assumptions of the same.

21. What are the different types of Naïve Bayes algorithms? Can you state a use case in which each respective algorithm can be used?

1. What are collocations? How can you find them in a document?

2. What is Phonetic Hashing?

3. What is the “Zero Probability Problem” in text classification? How can you solve this problem?

4. What is Linear Discriminant Analysis? How can you use this technique in NLP?

5. What are the different types of RNN architectures?

6. Design a Neural Network based architecture for the task of Image Captioning.

7. Design a Recurrent Neural Network based architecture for the task of Machine Translation.

8. Can you explain the vanishing gradient problem in neural network models? How to tackle this problem?

9. Can you explain the exploding gradient problem in neural network models? How to tackle this problem?

10. What is Masked Language Modeling?

11. What are Auto-Encoders? Explain how are they used in NLP?

12. What is Attention in Deep Learning Models?

13. What is transfer learning?

1. What is “Name Entity Recognition” (NER)? Can you name a python library that can allow you to perform NER in text data?

2. How can you use NLTK in an NLP pipeline?

3. How can you use Spacy in an NLP pipeline?

4. What is Principal Component Analysis? How can you use this technique in NLP?

5. What are sequence models? Give me few examples in which you can use them.

6. Can you describe what are word embeddings? Give me an example of how you can use them.

7. What are the differences between Skip-Gram and C-BOW models for training word embeddings?

8. What is negative sampling? Why do we use it?

9. What is Perplexity?

10. What is BLEU score?

11. Can you describe how BEAM search helps in Machine Translation?

12. How can you perform the task of text similarity in NLP?

13. What is Dropout? Explain how do you use it in machine learning?

14. What is Batch Normalization? Explain how do you use it in machine learning?